JWAT

Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
POM cleanup. Minor change in WarcRecord to accommodate WarcModule.
    • -2
    • +4
    /jwat-distribution/src/main/assembly/bin.xml
POM cleanup and modified to accomodate sonatype release. jwat-parent removed. Removed some tabs.
    • -13
    • +0
    /jwat-parent/src/main/assembly/resources.xml
    • -216
    • +0
    /jwat-parent/src/main/resources/jwat/checkstyle.xml
    • -1
    • +0
    /jwat-parent/src/main/resources/test.txt
Review changes. Javadoc and comments.
Removed a new batch of tab. Found 8562 occurrence(s) in 64 file(s)
  1. … 49 more files in changeset.
Found and fixed a digest compare bug while running my jwat vs. heritrix benchmark.
Digest unit test coverage. Initial gzip unit test.
    • -0
    • +57
    /jwat-gzip/src/test/java/org/jwat/gzip/TestInvalid.java
    • binary
    /jwat-gzip/src/test/resources/invalid-entries.gz
    • binary
    /jwat-gzip/src/test/resources/invalid-magic.gz
    • binary
    /jwat-gzip/src/test/resources/sample.txt.gz
    • binary
    /jwat-gzip/src/test/resources/three-files.gz
Digest validation in WARC reader and optional digest computation in ARC reader.
Fixed a serious bug in FixedLengthInputStream and renamed another to MaxLength from FixedLength.
  1. … 3 more files in changeset.
Renamed packages to org.jwat.*, Mikis fixed assembly pomming, broke alot of unit tests.
  1. … 149 more files in changeset.
changes from review of last changeset. More unit test coverage of common. Found a serious bug which will be fixed in next changeset.
  1. … 17 more files in changeset.
Jenkins failed on missing key for gpg signing and license check.
Fixes per review. Digest support in payload and httpresponse. Removed licenses.
  1. … 39 more files in changeset.
Fixes from review and checktyle. Added Base<x> code with unittests. Added content-type parser/validator. Various unittests in common and arc. Fixed tasks WARC-30, WARC-38, WARC-22, WARC-32)
    • -0
    • +144
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase16.java
    • -0
    • +272
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase2.java
    • -0
    • +221
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase32.java
    • -0
    • +165
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase64.java
  1. … 5 more files in changeset.
Added loads of parameter testing etc. for the warc package. Added some parameter testing on arc. Started on common unit tests and a ContentType parser/validator.
    • -0
    • +115
    /jwat-common/src/main/java/dk/netarkivet/common/ContentType.java
    • -0
    • +395
    /jwat-warc/src/test/java/dk/netarkivet/warclib/TestParams.java
Fixed a bunch of review issues, loads of checkstyle issues, added javadoc, etc.
  1. … 13 more files in changeset.
Fixes WARC-28 and WARC-29. Tried DigestInputStream. Also added Base16, Base32 and Base64 code.
Minor javadoc change.
Corrected a few javadocs.
Added some javadocs and also some reviewed issues.
  1. … 7 more files in changeset.
Removed some todos, fixed some todos, fixed arcversion code, fixed payload trunaction detectionm, fixed som javadoc, changed the main pom slightly.
Fixed the valid utf-8 test file in relation to cr-lf and utf-8 content.
    • -51
    • +51
    /jwat-warc/src/test/resources/test-utf8.warc
Forgot to un-uncomment a method used somewhere else.
Fixed some gzip support in arc and warc which should hopefully work now. Added test to arcreadercompressed.
  1. … 3 more files in changeset.
Added compressed arcreader. Changed some minor stuff. Renamed some stuff to make arc/warc ressemble more.
  1. … 16 more files in changeset.
Added some javadoc here and there. Fixed some review issues. Moved WarcInputStream to common and part 1 of arc factory conversion.
Added test for accessibility to non warc headers and fixed a bug in the process (WARC-17).
    • -0
    • +26
    /jwat-warc/src/test/resources/test-non-warc-headers.warc
Official fix for WARC-23, WARC-15, WARC-14. Pom voyage.
Added javadoc here and there, added content-header availability, move payload to common and almost integrated into warc reader.

Also fixed a few review comments.

    • -0
    • +17
    /jwat-parent/.project
  1. … 4 more files in changeset.
Fixed a serious bug in warcinputstream, added some factory test, added more complete factory integration, loads of pom gymnastics, etc.
    • -0
    • +252
    /jwat-parent/pom.xml
    • -0
    • +13
    /jwat-parent/src/main/assembly/resources.xml
    • -0
    • +20
    /jwat-parent/src/main/resources/jwat/.checkstyle
    • -0
    • +1
    /jwat-parent/src/main/resources/jwat/LICENSE.txt
    • -0
    • +2
    /jwat-parent/src/main/resources/jwat/checkstyle.properties
    • -0
    • +216
    /jwat-parent/src/main/resources/jwat/checkstyle.xml
    • -0
    • +1
    /jwat-parent/src/main/resources/test.txt
  1. … 6 more files in changeset.
Fixed a test, removed some test with large test data, added some new files and a new warc factory test. Minor refactoring.
    • binary
    /jwat-arc/src/test/resources/IAH-20080430204825-00000-blackbook.arc.gz
    • binary
    /jwat-warc/src/test/resources/IAH-20080430204825-00000-blackbook.warc
    • binary
    /jwat-warc/src/test/resources/IAH-20080430204825-00000-blackbook.warc.gz