Clone
 

thomas ledoux <tledouxfr@gmail.com> in JWAT

Use default charset in case of bad charset and handle bad encoding in WARC-Target-URI header (add a simple test case)
    • binary
    /jwat-warc/src/test/resources/invalid-warcfile-encoding-headers.warc.gz
Adding a containermd task to create the containerMD represnetation of an arc or a warc file.

Use the 1.0.2 version of the jwat core libraries.

Correct the usage of identified payload by closing the handle in the case where a temporary file has to be created (large files) and delete them at the end.