Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
CR-JWAS-33: Follow-up on review.
  1. … 71 more files in changeset.
UriProfile throw clauses modified so that the invalid character gets hex encoded and the message becomes more meaningful.
    • -0
    • +20
    ./org/jwat/common/TestUriProfile.java
  1. … 1 more file in changeset.
JWAT-77: Unit tests and bug fixes for newly implemented ArcFileWriter/WarcFileWriter and related classes.

JWAT-76: Fix for archiveLengthStr/contentLengthStr set and archiveLength/contentLength null when using payload length validation.

Removed alot of tags and replaced with spaces. (Company policy)

Minor code cleanup.

    • -73
    • +74
    ./org/jwat/common/TestANVLRecord.java
    • -507
    • +507
    ./org/jwat/common/TestArrayUtils.java
    • -453
    • +453
    ./org/jwat/common/TestByteArrayIOStream.java
    • -20
    • +20
    ./org/jwat/common/TestDiagnosisType.java
    • -0
    • +127
    ./org/jwat/common/TestHelpers.java
    • -2
    • +1
    ./org/jwat/common/TestIPAddressParser.java
  1. … 85 more files in changeset.
Unit test/javadoc of merged classes in jwat-common. Trivial full unit test of old class.
    • -8
    • +456
    ./org/jwat/common/TestArrayUtils.java
    • -0
    • +50
    ./org/jwat/common/TestDiagnosisType.java
  1. … 2 more files in changeset.
Merge in files from jwat-tools with history.
    • -1
    • +18
    ./org/jwat/common/TestArrayUtils.java
    • -3
    • +28
    ./org/jwat/common/TestByteArrayIOStream.java
  1. … 13 more files in changeset.
ANVLRecord adds space after ":" to make output pretty.

Made constant in WarcFileWriter public.

    • -0
    • +18
    ./org/jwat/common/TestANVLRecord.java
  1. … 2 more files in changeset.
JWAT-78: PayloadManager in JWAT-Tools seems to have a bug related to the closing of the RandomAccessFile and a non null tmpfile object.

Added some unit tests of most common classes.

Tweaked some constant definitions.

    • -0
    • +473
    ./org/jwat/common/TestByteArrayIOStream.java
  1. … 5 more files in changeset.
Changed some method signtures.
  1. … 1 more file in changeset.
Added initial ANVLRecord class.
    • -0
    • +86
    ./org/jwat/common/TestANVLRecord.java
  1. … 1 more file in changeset.
WWM-160: Be lenient with broken protocol version field in status line.
  1. … 1 more file in changeset.
JWAT-69: Unit tested WARC-Refers-To-Target-URI and WARC-Refers-To-Date in reader.

Fixed some small bugs and omissions with the reading of those new headers.

Removed some tabs.

    • -34
    • +34
    ./org/jwat/common/TestInputStreamNoSkip.java
  1. … 12 more files in changeset.
Added some unit tests.
    • -0
    • +31
    ./org/jwat/common/TestByteCountingPushbackInputStreamPeek.java
    • -1
    • +77
    ./org/jwat/common/TestInputStreamNoSkip.java
  1. … 6 more files in changeset.
Work in progress on unified ARC/WARC reader. Module not included yet.
    • -0
    • +32
    ./org/jwat/common/TestInputStreamNoSkip.java
  1. … 7 more files in changeset.
Work in progress:

Git style help command.

Improved multithreading for some tasks.

Support for linux file identification.

Rewrote arc2warc to support multiple filedesc records and payload repair.

ManagedPayload added for reloading of payload in different validators.

Fully implemented 2 step XML validation.

Improved FileIdent based on file name and stream peeking.

    • -0
    • +126
    ./org/jwat/common/TestArrayUtils.java
  1. … 6 more files in changeset.
Missing javadoc, improved UriProfile for extending profiles.

Added some TODOs for some missing unit tests.

    • -0
    • +31
    ./org/jwat/common/TestUriProfile.java
    • -0
    • +5
    ./org/jwat/common/Test_AdditionalReadMethods.java
  1. … 3 more files in changeset.
Fix for JWAT-65 and JWAT-66.

JWAT-65: HttpHeader digest not calculated when using getInputStream on payload.

JWAT-66: ArcRecords with invalid Urls do not get their HttpHeader parsed.

  1. … 8 more files in changeset.
Followup from reviews.

Saving of test data for use in JHOVE2.

'no-type' is ignored when looking for http headers.

Improved detection of possible arc record.

Minor tweaks.

    • -0
    • +61
    ./org/jwat/common/TestScheme.java
  1. … 42 more files in changeset.
Raw ARC record line now stored.

Removed some tabs.

    • -0
    • +96
    ./org/jwat/common/TestHeaderLine.java
  1. … 5 more files in changeset.
Strict validation of <> encapsulating some URIs.

Tying up loose ends.

  1. … 15 more files in changeset.
Added an even more relaxed Uri profile for Heritrix written data.

Warc-Profile treated as an URI, oversight fixed (JWAT-61).

Minor review stuff.

Refactored Test classes file names.

    • -206
    • +0
    ./org/jwat/common/TestAdditionalReadMethods.java
    • -1
    • +207
    ./org/jwat/common/Test_AdditionalReadMethods.java
    • -335
    • +0
    ./org/jwat/common/TestStreamsInStreams.java
    • -2
    • +337
    ./org/jwat/common/Test_StreamsInStreams.java
  1. … 53 more files in changeset.
Uri methods added with profile parameter, additional uri profiles added, minor unittesting, javadocs and review changes.
    • -73
    • +216
    ./org/jwat/common/TestUri.java
  1. … 4 more files in changeset.
Unittest for UriProfile (JWAT-59).
    • -2
    • +83
    ./org/jwat/common/TestIPAddressParser.java
    • -2
    • +126
    ./org/jwat/common/TestUriProfile.java
  1. … 1 more file in changeset.
URI and URI profile split into separate classes. Currently only includes a strict RFC3986 profile.
    • -35
    • +130
    ./org/jwat/common/TestUri.java
    • -0
    • +67
    ./org/jwat/common/TestUriProfile.java
  1. … 2 more files in changeset.
JWAT-59: Good progress on JWAt Uri implementation. Almost ready and tested.
  1. … 3 more files in changeset.
Followup to review CR-JWAS-25. Experimental Uri implementation.
  1. … 31 more files in changeset.
Somewhat conclusion of the following issues:

JWAT-46: ARC reader refactoring

JWAT-8: Unit tests and coverage of ARCRecordBase, ArcRecord and ArcVersionBlock

JWAT-45: ARC writer

Partial lenient Uri implementation.

    • -0
    • +67
    ./org/jwat/common/TestUri.java
  1. … 16 more files in changeset.
Followup on reviews. (CR-JWAS-19, CR-JWAS-20, CR-JWAS-21, CR-JWAS-22, CR-JWAS-23, CR-JWAS-24)
  1. … 44 more files in changeset.
JWAT-57: Added workaround for test using toString() on ContentType.
    • -1
    • +5
    ./org/jwat/common/TestContentType.java
    • -163
    • +166
    ./org/jwat/common/TestHttpHeader_Response.java
Fixed a bug introduced with http request support in HttpHeader parser.

Added some unit tests for absolute resources in http request.

Added some more unit testing here and there.

Changed the ARC Writer slightly, still not 100% functional nor tested.

    • -0
    • +3
    ./org/jwat/common/TestContentType.java
    • -0
    • +71
    ./org/jwat/common/TestDiagnosis.java
    • -501
    • +2
    ./org/jwat/common/TestHttpHeader.java
    • -0
    • +312
    ./org/jwat/common/TestHttpHeader_Request.java
    • -0
    • +301
    ./org/jwat/common/TestHttpHeader_Response.java
  1. … 8 more files in changeset.
Added number of information strings comparison to diagnosis constructor.
    • -27
    • +35
    ./org/jwat/common/TestDiagnosis.java
  1. … 6 more files in changeset.