JWAT

Clone Tools
  • last updated a few seconds ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
[maven-release-plugin] copy for tag jwat-1.0.5
[maven-release-plugin] prepare release jwat-1.0.5
ArcReader and WarcReader now implement Iterable<..> interface.
Merged in tledouxfr/jwat/unknown_charset (pull request #6)

Use default charset in case of bad charset and handle bad encoding in WARC-Target-URI header (add a simple test case)

Use default charset in case of bad charset and handle bad encoding in WARC-Target-URI header (add a simple test case)
    • binary
    /jwat-warc/src/test/resources/invalid-warcfile-encoding-headers.warc.gz
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] copy for tag jwat-1.0.4
[maven-release-plugin] prepare release jwat-1.0.4
POM cleanup.
UriProfile throw clauses modified so that the invalid character gets hex encoded and the message becomes more meaningful.
[maven-release-plugin] prepare for next development iteration
[maven-release-plugin] copy for tag jwat-1.0.3
[maven-release-plugin] prepare release jwat-1.0.3
JWAT-77: Unit tests and bug fixes for newly implemented ArcFileWriter/WarcFileWriter and related classes.

JWAT-76: Fix for archiveLengthStr/contentLengthStr set and archiveLength/contentLength null when using payload length validation.

Removed alot of tags and replaced with spaces. (Company policy)

Minor code cleanup.

    • -0
    • +505
    /jwat-arc/src/test/java/org/jwat/arc/TestArcFileWriter.java
    • -0
    • +64
    /jwat-arc/src/test/java/org/jwat/arc/TestArcFileWriterConfig.java
  1. … 82 more files in changeset.
Unit test/javadoc of merged classes in jwat-common. Trivial full unit test of old class.
    • -0
    • +50
    /jwat-common/src/test/java/org/jwat/common/TestDiagnosisType.java
Clean up unwanted head.
Merge in files from jwat-tools with history.
    • -0
    • +41
    /jwat-archive/pom.xml
update tags
ANVLRecord adds space after ":" to make output pretty.

Made constant in WarcFileWriter public.

JWAT-78: PayloadManager in JWAT-Tools seems to have a bug related to the closing of the RandomAccessFile and a non null tmpfile object.

Added some unit tests of most common classes.

Tweaked some constant definitions.

Changed some method signtures.
Added initial ANVLRecord class.
    • -0
    • +77
    /jwat-common/src/main/java/org/jwat/common/ANVLRecord.java
WWM-160: Be lenient with broken protocol version field in status line.
Make buffer sizes configurable in PayloadManager and ByteArrayIOStream.

Added new ManagedPayloadManager to support this.

JWAT-77: Add (W)ArcFileWriter helper classes.
    • -0
    • +26
    /jwat-arc/src/main/java/org/jwat/arc/ArcFileNaming.java
    • -0
    • +44
    /jwat-arc/src/main/java/org/jwat/arc/ArcFileNamingSingleFile.java
    • -0
    • +138
    /jwat-arc/src/main/java/org/jwat/arc/ArcFileWriter.java
    • -0
    • +26
    /jwat-warc/src/main/java/org/jwat/warc/WarcFileNaming.java
    • -0
    • +82
    /jwat-warc/src/main/java/org/jwat/warc/WarcFileNamingDefault.java
    • -0
    • +177
    /jwat-warc/src/main/java/org/jwat/warc/WarcFileWriter.java
Minor tweaks. Changed version to 0.6.0-SNAPSHOT. Deployed to maven central from now on.
Changed manageRecord back from private to public since it was used after all.
Merged in tledouxfr/jwat-tools/containermd_task (pull request #1)

Adding a containermd task to create the containerMD representation of an arc or a warc file.

Adding a containermd task to create the containerMD represnetation of an arc or a warc file.

Use the 1.0.2 version of the jwat core libraries.

Correct the usage of identified payload by closing the handle in the case where a temporary file has to be created (large files) and delete them at the end.

[maven-release-plugin] prepare for next development iteration