Clone Tools
  • last updated a few seconds ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Improve support for empty files and errors at the end of (W)ARC files in the ArchiveParser/ArchiveParserCallback.
  1. … 3 more files in changeset.
CR-JWAS-33: Follow-up on review.
  1. … 71 more files in changeset.
Fixed some texts. Added some spaces.
  1. … 39 more files in changeset.
The Gzip reader's close now closes the underlying stream. Less streams for the user to keep track off
Stuff with close() now implements Closeable
  1. … 8 more files in changeset.
Fixed the javadoc so that the command 'mvn -Psonatype-oss-release clean install -Dgpg.skip=true' works
  1. … 10 more files in changeset.
JWAT-69: Unit tested WARC-Refers-To-Target-URI and WARC-Refers-To-Date in reader.

Fixed some small bugs and omissions with the reading of those new headers.

Removed some tabs.

  1. … 12 more files in changeset.
JWAT-70: Unit test DataFormatException in Gzip reader/writer.

JWAT-71: Found IndexOutOfBoundException while unit testing DataFormatException in GzipWriter.

Removed some tabs.

  1. … 4 more files in changeset.
Improved Gzip reader getConsumed() and getOffset() unit testing.
  1. … 2 more files in changeset.
JWAT-50: Added support and validation for FEXTRA subfields in reader and writer.

Fixed some old static FEXTRA test data with incorrect endianess.

    • -0
    • +45
    ./jwat/gzip/GzipExtraData.java
  1. … 5 more files in changeset.
Followup from reviews.

Saving of test data for use in JHOVE2.

'no-type' is ignored when looking for http headers.

Improved detection of possible arc record.

Minor tweaks.

  1. … 43 more files in changeset.
Zero length ARC, WARC and GZip files are now reported as non compliant.
    • -178
    • +211
    ./jwat/gzip/GzipReader.java
  1. … 15 more files in changeset.
startOffset tweaking and unit testing.

Unit testing of those hard to throw exceptions.

  1. … 21 more files in changeset.
A bit more unit testing.
  1. … 10 more files in changeset.
Followup on reviews. (CR-JWAS-19, CR-JWAS-20, CR-JWAS-21, CR-JWAS-22, CR-JWAS-23, CR-JWAS-24)
  1. … 44 more files in changeset.
Changes from review(spelling, javadoc, exeption hanlding etc.) and BnF tests(GZip compliant fields/methods). Add missing javadoc to common and warc packages.
  1. … 35 more files in changeset.
Refactoring of gzip test class names and addition of compressed entry size.
  1. … 19 more files in changeset.
Added missing consumed and isValid logic/methods to GZip reader and entry.
  1. … 2 more files in changeset.
More unit testing, consumed now works for ARC/WARC sequential and random methods. Other minor refactoring.
  1. … 21 more files in changeset.
JWAT-48,JWAT-51,JWAT-49,JWAT-52: WarcWriter validate length of payload written, added additional writepayload methods, added diagnostics object directly on reader/writer, fixed serious GZipOutputStream bug.
  1. … 18 more files in changeset.
Added some datatyped add header methods including unit tests.

Also fixed some minor stuff with the WARC writer states.

  1. … 18 more files in changeset.
Minor interface changes, unit tests cover warc package except the writer, tabs removed, etc.
  1. … 34 more files in changeset.
Cleanup, comments and javadocs.
  1. … 13 more files in changeset.
Fixed a bug in the compressed warc writer and some consumed/offset methods in the warcreaders.
  1. … 12 more files in changeset.
Forgot to add a class. Added some javadoc. Added some unit testing.
  1. … 11 more files in changeset.
Improved ContentType parser, EncodedWords parser, QuotedPrintable parser, Base32/64 decoders.

Improved HeaderLineReader.

Improved junit code coverage in common package.

Implemented OutputStream wrapper for GZip writer.

Added support for GZip compressed WARC writing.

  1. … 36 more files in changeset.
JWAT-20, JWAT-19: Work in progress on header line reader. Added encoded-words, quoted printables, unit tests etc.
  1. … 27 more files in changeset.
JWAT-27,JWAT-39,JWAT-41: Improved HeaderLineReader and HttpResponse, unittests, javadocs, minor fixes here and there...
  1. … 22 more files in changeset.
Work in progress on line readers and JWAT-11, JWAT-17, JWAT-20, JWAT-19, JWAT-28, JWAT-39. Review changes etc.
  1. … 79 more files in changeset.
JWAT-42: warc package changed to use common error/warning classes, like gzip package.
  1. … 24 more files in changeset.