JWAT

Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Removed some tabs and pom cleanup again. Switch license plugin.
Wrote a new gzip reader/validator. Wrote a gzip writer. Unittest covered almost all the gzip code.

Old gzip reader will be moved to test and phased out.

    • -0
    • +1
    /jwat-distribution/src/main/assembly/bin.xml
    • -0
    • +431
    /jwat-gzip/src/main/java/org/jwat/gzip/GzipReader.java
    • -0
    • +165
    /jwat-gzip/src/main/java/org/jwat/gzip/GzipReaderEntry.java
    • -0
    • +277
    /jwat-gzip/src/main/java/org/jwat/gzip/GzipWriter.java
    • -0
    • +189
    /jwat-gzip/src/test/java/org/jwat/gzip/TestEncoding.java
    • -0
    • +321
    /jwat-gzip/src/test/java/org/jwat/gzip/TestFlagged.java
    • -0
    • +98
    /jwat-gzip/src/test/java/org/jwat/gzip/TestGzipReader.java
    • -0
    • +82
    /jwat-gzip/src/test/java/org/jwat/gzip/TestGzipWriter.java
    • -0
    • +231
    /jwat-gzip/src/test/java/org/jwat/gzip/TestGzipWriterCloning.java
    • -0
    • +153
    /jwat-gzip/src/test/java/org/jwat/gzip/TestInputStream.java
  1. … 5 more files in changeset.
Back to work on the snapshot.
Added tag v0.8.0 for changeset 7df5e9d5631d
v0.8.0
v0.8.0 aka Maven is madness.
Added Apache 2 License.
  1. … 80 more files in changeset.
Tabs removed again. Cleanup in WarcWriter test.
Review changes, javadocs, magic number unit tests.
    • -0
    • +52
    /jwat-arc/src/test/java/org/jwat/arc/TestMagic.java
    • -0
    • +51
    /jwat-gzip/src/test/java/org/jwat/gzip/TestMagic.java
  1. … 13 more files in changeset.
Review changes. Stripped license changeset since it was incorrect.
Revised the license header, enabled the license plugin and added the header to all java files.
  1. … 75 more files in changeset.
Started refactoring parts of HttpResponse and added simple magic detection to arc and warc factories.
Unit tests and coverage of readLine methods across all stream implementations (JWAT-25), Unit tests and coverage of RandomFileAccessI/OStreams in common (JWAT-24).
  1. … 9 more files in changeset.
A bunch of small changes to accommodate the jhove2 modules.

GZip changes as per old review.

Expose ComputedDigests,DigestEncoding,HttpResponse headers exposed,ValidDigests,Offset,Consumed.

    • -0
    • +23
    /jwat-common/src/main/java/org/jwat/common/Digest.java
  1. … 3 more files in changeset.
fixed serious double close bug, added encoding info to digest and changed some javadoc.
POM cleanup. Minor change in WarcRecord to accommodate WarcModule.
    • -2
    • +4
    /jwat-distribution/src/main/assembly/bin.xml
POM cleanup and modified to accomodate sonatype release. jwat-parent removed. Removed some tabs.
    • -13
    • +0
    /jwat-parent/src/main/assembly/resources.xml
    • -216
    • +0
    /jwat-parent/src/main/resources/jwat/checkstyle.xml
    • -1
    • +0
    /jwat-parent/src/main/resources/test.txt
Review changes. Javadoc and comments.
Removed a new batch of tab. Found 8562 occurrence(s) in 64 file(s)
  1. … 49 more files in changeset.
Found and fixed a digest compare bug while running my jwat vs. heritrix benchmark.
Digest unit test coverage. Initial gzip unit test.
    • -0
    • +57
    /jwat-gzip/src/test/java/org/jwat/gzip/TestInvalid.java
    • binary
    /jwat-gzip/src/test/resources/invalid-entries.gz
    • binary
    /jwat-gzip/src/test/resources/invalid-magic.gz
    • binary
    /jwat-gzip/src/test/resources/sample.txt.gz
    • binary
    /jwat-gzip/src/test/resources/three-files.gz
Digest validation in WARC reader and optional digest computation in ARC reader.
Fixed a serious bug in FixedLengthInputStream and renamed another to MaxLength from FixedLength.
  1. … 3 more files in changeset.
Renamed packages to org.jwat.*, Mikis fixed assembly pomming, broke alot of unit tests.
  1. … 149 more files in changeset.
changes from review of last changeset. More unit test coverage of common. Found a serious bug which will be fixed in next changeset.
  1. … 17 more files in changeset.
Jenkins failed on missing key for gpg signing and license check.
Fixes per review. Digest support in payload and httpresponse. Removed licenses.
  1. … 39 more files in changeset.
Fixes from review and checktyle. Added Base<x> code with unittests. Added content-type parser/validator. Various unittests in common and arc. Fixed tasks WARC-30, WARC-38, WARC-22, WARC-32)
    • -0
    • +144
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase16.java
    • -0
    • +272
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase2.java
    • -0
    • +221
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase32.java
    • -0
    • +165
    /jwat-common/src/test/java/dk/netarkivet/common/TestBase64.java
  1. … 5 more files in changeset.
Added loads of parameter testing etc. for the warc package. Added some parameter testing on arc. Started on common unit tests and a ContentType parser/validator.
    • -0
    • +115
    /jwat-common/src/main/java/dk/netarkivet/common/ContentType.java
    • -0
    • +395
    /jwat-warc/src/test/java/dk/netarkivet/warclib/TestParams.java
Fixed a bunch of review issues, loads of checkstyle issues, added javadoc, etc.
  1. … 13 more files in changeset.