Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
JWAT-89: Removed encodedwords use in HeaderLineParser. Both need to be refactord and it is not really useful.
    • -1
    • +7
    ./java/org/jwat/common/HeaderLineReader.java
  1. … 1 more file in changeset.
CR-JWAS-33: Follow-up on review.
    • -3
    • +3
    ./java/org/jwat/common/ByteArrayIOStream.java
    • -1
    • +1
    ./java/org/jwat/common/EncodedWords.java
    • -1
    • +1
    ./java/org/jwat/common/HeaderLine.java
    • -3
    • +3
    ./java/org/jwat/common/HeaderLineReader.java
  1. … 59 more files in changeset.
ArcReader and WarcReader now implement Iterable<..> interface.
    • -1
    • +1
    ./java/org/jwat/common/EncodedWords.java
  1. … 3 more files in changeset.
Use default charset in case of bad charset and handle bad encoding in WARC-Target-URI header (add a simple test case)
    • -0
    • +6
    ./java/org/jwat/common/EncodedWords.java
    • -0
    • +10
    ./java/org/jwat/common/HeaderLineReader.java
  1. … 3 more files in changeset.
UriProfile throw clauses modified so that the invalid character gets hex encoded and the message becomes more meaningful.
    • -3
    • +3
    ./java/org/jwat/common/UriProfile.java
  1. … 1 more file in changeset.
JWAT-77: Unit tests and bug fixes for newly implemented ArcFileWriter/WarcFileWriter and related classes.

JWAT-76: Fix for archiveLengthStr/contentLengthStr set and archiveLength/contentLength null when using payload length validation.

Removed alot of tags and replaced with spaces. (Company policy)

Minor code cleanup.

    • -47
    • +47
    ./java/org/jwat/common/ANVLRecord.java
    • -259
    • +259
    ./java/org/jwat/common/ArrayUtils.java
    • -158
    • +158
    ./java/org/jwat/common/ByteArrayIOStream.java
  1. … 93 more files in changeset.
Unit test/javadoc of merged classes in jwat-common. Trivial full unit test of old class.
    • -6
    • +77
    ./java/org/jwat/common/ArrayUtils.java
    • -0
    • +43
    ./java/org/jwat/common/ByteArrayIOStream.java
  1. … 3 more files in changeset.
Merge in files from jwat-tools with history.
    • -1
    • +22
    ./java/org/jwat/common/ArrayUtils.java
    • -4
    • +42
    ./java/org/jwat/common/ByteArrayIOStream.java
  1. … 13 more files in changeset.
ANVLRecord adds space after ":" to make output pretty.

Made constant in WarcFileWriter public.

    • -1
    • +8
    ./java/org/jwat/common/ANVLRecord.java
  1. … 2 more files in changeset.
JWAT-78: PayloadManager in JWAT-Tools seems to have a bug related to the closing of the RandomAccessFile and a non null tmpfile object.

Added some unit tests of most common classes.

Tweaked some constant definitions.

    • -3
    • +11
    ./java/org/jwat/common/ByteArrayIOStream.java
  1. … 6 more files in changeset.
Changed some method signtures.
    • -4
    • +12
    ./java/org/jwat/common/ANVLRecord.java
  1. … 1 more file in changeset.
Added initial ANVLRecord class.
    • -0
    • +77
    ./java/org/jwat/common/ANVLRecord.java
  1. … 1 more file in changeset.
WWM-160: Be lenient with broken protocol version field in status line.
    • -0
    • +6
    ./java/org/jwat/common/HttpHeader.java
  1. … 1 more file in changeset.
Make buffer sizes configurable in PayloadManager and ByteArrayIOStream.

Added new ManagedPayloadManager to support this.

    • -8
    • +22
    ./java/org/jwat/common/ByteArrayIOStream.java
  1. … 2 more files in changeset.
Fixed some texts. Added some spaces.
    • -1
    • +1
    ./java/org/jwat/common/EncodedWords.java
    • -1
    • +1
    ./java/org/jwat/common/HeaderLine.java
    • -1
    • +1
    ./java/org/jwat/common/HeaderLineReader.java
  1. … 30 more files in changeset.
Stuff with close() now implements Closeable
  1. … 9 more files in changeset.
Fixed the javadoc so that the command 'mvn -Psonatype-oss-release clean install -Dgpg.skip=true' works
    • -1
    • +1
    ./java/org/jwat/common/FieldValidator.java
    • -0
    • +1
    ./java/org/jwat/common/HeaderLine.java
    • -1
    • +1
    ./java/org/jwat/common/InputStreamNoSkip.java
  1. … 6 more files in changeset.
JWAT-72: Scheme class is not case insensitive

Unit test of isArcRecord().

  1. … 1 more file in changeset.
Fixed javadoc mistake.
    • -1
    • +1
    ./java/org/jwat/common/IPAddressParser.java
Work in progress:

Git style help command.

Improved multithreading for some tasks.

Support for linux file identification.

Rewrote arc2warc to support multiple filedesc records and payload repair.

ManagedPayload added for reloading of payload in different validators.

Fully implemented 2 step XML validation.

Improved FileIdent based on file name and stream peeking.

    • -0
    • +240
    ./java/org/jwat/common/ArrayUtils.java
    • -0
    • +188
    ./java/org/jwat/common/ByteArrayIOStream.java
  1. … 5 more files in changeset.
Missing javadoc, improved UriProfile for extending profiles.

Added some TODOs for some missing unit tests.

    • -1
    • +14
    ./java/org/jwat/common/UriProfile.java
  1. … 2 more files in changeset.
Fix for JWAT-65 and JWAT-66.

JWAT-65: HttpHeader digest not calculated when using getInputStream on payload.

JWAT-66: ArcRecords with invalid Urls do not get their HttpHeader parsed.

    • -15
    • +53
    ./java/org/jwat/common/Scheme.java
  1. … 6 more files in changeset.
Followup from reviews.

Saving of test data for use in JHOVE2.

'no-type' is ignored when looking for http headers.

Improved detection of possible arc record.

Minor tweaks.

    • -0
    • +83
    ./java/org/jwat/common/Scheme.java
    • -0
    • +1
    ./java/org/jwat/common/UriProfile.java
  1. … 39 more files in changeset.
Zero length ARC, WARC and GZip files are now reported as non compliant.
    • -3
    • +3
    ./java/org/jwat/common/InputStreamNoSkip.java
  1. … 15 more files in changeset.
Generation of ARC/WARC test files based on unittests, reorganizing of test files. Minor tweaks.
    • -0
    • +63
    ./java/org/jwat/common/InputStreamNoSkip.java
  1. … 67 more files in changeset.
Raw ARC record line now stored.

Removed some tabs.

    • -0
    • +35
    ./java/org/jwat/common/HeaderLine.java
  1. … 5 more files in changeset.
Strict validation of <> encapsulating some URIs.

Tying up loose ends.

    • -2
    • +2
    ./java/org/jwat/common/HttpHeader.java
  1. … 14 more files in changeset.
Added some getters.
    • -10
    • +2
    ./java/org/jwat/common/HttpHeader.java
  1. … 2 more files in changeset.
Minor refactoring of API, unittests, etc.
    • -2
    • +3
    ./java/org/jwat/common/HttpHeader.java
    • -0
    • +1
    ./java/org/jwat/common/UriProfile.java
  1. … 8 more files in changeset.
Changed existing spaces to tabs even though personal preference is for tabs.
    • -21
    • +21
    ./java/org/jwat/common/UriProfile.java
  1. … 4 more files in changeset.