During the development of the JWAT library it has proved useful to test the library with varying input data.
To this end a small commandline tool was created.
Download binary and source
The tool is currently hosted on bitbucket. Zip archives are also available.
The following options are currently available in JWAT-Tools.
You can supply one or more files. Each file can contain * and/or ? wildcards, but only in the filename part of the path. You can use more wildcards at the same time if you want.
Reads and validates all the files supplied. Files which are not recognized as either GZip, ARC or WARC are skipped. If wildcards are used, files that do not match are also skipped.
Use -e for more than a summary of errors.
Decompress one or more (multi-part) GZip files and write the decompressed data to a new file, one for each input file.
Useful for decompressing ARC and/or WARC files.
Is currently ignored. All operations are currently recursive.
Compress normal and/or WARC files.
-i (interval extract)
Extract an interval from a given file. Interval can be expressed as offset, offset2 or offset,+length. Offset and length can be expressed in hex by pre-pending "$" or "0x".
Unpack a (multi-file)GZip and save each entry as individual files.
Convert ARC files to WARC.