dk

Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Prepare gzip code for joining and mavenize it.
    • -79
    • +0
    ./netarkivet/gzip/GzipConstants.java
    • -460
    • +0
    ./netarkivet/gzip/GzipEntry.java
    • -444
    • +0
    ./netarkivet/gzip/GzipInputStream.java
  1. … 6 more files in changeset.
Temporary project for cleaning up gzipinputstream classes.
    • -0
    • +79
    ./netarkivet/gzip/GzipConstants.java
    • -0
    • +460
    ./netarkivet/gzip/GzipEntry.java
    • -0
    • +444
    ./netarkivet/gzip/GzipInputStream.java
    • -0
    • +139
    ./netarkivet/gzip/TestGzip.java
  1. … 3 more files in changeset.
Moved files and folders around part 1.
    • -333
    • +0
    ./netarkivet/warclib/WarcConstants.java
    • -64
    • +0
    ./netarkivet/warclib/WarcDateParser.java
    • -40
    • +0
    ./netarkivet/warclib/WarcDigest.java
    • -47
    • +0
    ./netarkivet/warclib/WarcErrorType.java
    • -20
    • +0
    ./netarkivet/warclib/WarcHeader.java
    • -114
    • +0
    ./netarkivet/warclib/WarcInputStream.java
    • -102
    • +0
    ./netarkivet/warclib/WarcParser.java
    • -1095
    • +0
    ./netarkivet/warclib/WarcRecord.java
  1. … 30 more files in changeset.
Big corporate merger!
Added some quoted string parsing. Fixed huge skip bug which was apparent testing with a BufferedInputStream.
    • -6
    • +22
    ./netarkivet/warclib/WarcRecord.java
Minor additions to the read header routine.
Partial quoted string and almost no encoded words.
    • -15
    • +76
    ./netarkivet/warclib/WarcRecord.java
  1. … 1 more file in changeset.
Added utf8 support to header linereader. Seems to works. Tests not conclusive.

Needs tweaking and more unit tests.

    • -2
    • +81
    ./netarkivet/warclib/WarcRecord.java
  1. … 2 more files in changeset.
Wrote a functional readheader line method that now handles multiline headers.

Added some unit test.

    • -0
    • +11
    ./netarkivet/warclib/WarcHeader.java
    • -202
    • +318
    ./netarkivet/warclib/WarcRecord.java
  1. … 4 more files in changeset.
Various stuff.

Moved test folders around.

Fixed trailing newline requirement after record.

Also fixed some incorrect test files.

Added a pushback inputstream for the newline checker and also to be used in header readline routine.

    • -0
    • +114
    ./netarkivet/warclib/WarcInputStream.java
    • -7
    • +49
    ./netarkivet/warclib/WarcRecord.java
  1. … 46 more files in changeset.
Fixed Content-Length in some test warc that were incorrect after checking for excess lines in the parser.

Minor tweaks.

Moved unit tests and test files to seperate folders.

    • -0
    • +9
    ./netarkivet/warclib/WarcHeader.java
    • -0
    • +13
    ./netarkivet/warclib/WarcRecord.java
  1. … 4 more files in changeset.
Added Digest Parser. Started on header readline method.

Added some more unit tests.

    • -0
    • +40
    ./netarkivet/warclib/WarcDigest.java
    • -5
    • +71
    ./netarkivet/warclib/WarcRecord.java
  1. … 3 more files in changeset.
Fixed some more header validation.

Added some matrix checks.

Added some content-type, segment-number checks.

Changed the error types to more types and more meaningful names.

Added some more unit tests to cover most of the current functionality.

    • -1
    • +2
    ./netarkivet/warclib/WarcConstants.java
    • -5
    • +20
    ./netarkivet/warclib/WarcErrorType.java
    • -47
    • +125
    ./netarkivet/warclib/WarcRecord.java
    • -0
    • +88
    ./netarkivet/warclib/WarcValidationError.java
  1. … 13 more files in changeset.
Added detection of duplicate fields.

Finished some more unit-tests.

    • -1
    • +11
    ./netarkivet/warclib/WarcConstants.java
    • -0
    • +32
    ./netarkivet/warclib/WarcErrorType.java
    • -0
    • +33
    ./netarkivet/warclib/WarcParser.java
    • -112
    • +131
    ./netarkivet/warclib/WarcRecord.java
  1. … 13 more files in changeset.
Added some unit tests.

Added some more header parsing code.

Fixed a date case error and case error in magic identifier.

    • -34
    • +44
    ./netarkivet/warclib/TestWarc.java
    • -27
    • +60
    ./netarkivet/warclib/WarcConstants.java
    • -4
    • +3
    ./netarkivet/warclib/WarcDateParser.java
    • -19
    • +64
    ./netarkivet/warclib/WarcRecord.java
  1. … 4 more files in changeset.
Added an iterator to the parser.

Introduced myself to junit and made 2 small tests that compare the number of records with the expected number using both the iterator and nextrecord method.

    • -0
    • +10
    ./netarkivet/warclib/WarcConstants.java
    • -0
    • +37
    ./netarkivet/warclib/WarcParser.java
    • -14
    • +9
    ./netarkivet/warclib/WarcRecord.java
Parser almost validates all fields according to specs.

Policy errors need there own class.

    • -3
    • +137
    ./netarkivet/warclib/WarcConstants.java
    • -33
    • +157
    ./netarkivet/warclib/WarcRecord.java
The warc parser now parses all fields in a simplistic way.

WarcDateParser added. Other parsers were borrowed from the arc package.

    • -12
    • +18
    ./netarkivet/warclib/WarcConstants.java
    • -0
    • +65
    ./netarkivet/warclib/WarcDateParser.java
    • -8
    • +266
    ./netarkivet/warclib/WarcRecord.java
First commit.

CheckMagic and Version.

Primitive WARC field parser.

    • -0
    • +42
    ./netarkivet/warclib/TestWarc.java
    • -0
    • +139
    ./netarkivet/warclib/WarcConstants.java
    • -0
    • +34
    ./netarkivet/warclib/WarcParser.java
    • -0
    • +183
    ./netarkivet/warclib/WarcRecord.java
  1. … 1 more file in changeset.