SBPROJECTS will be offline Wednesday between 7:30 and 8:30

SBForge with all its applications will be down for security updates during a time interval of about 10-20 minutes in the interval mentioned above.

Child pages
  • Overview

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.

Welcome to the Java Web Archive Toolkit

This wiki describes the overall packages and also includes some detail on how the main classes are implemented.

The JWAT code was originally intended for use only in a number of JHove2 modules, but since the classes can be of use outside the JHove2 project, an independent project was created.

Note: Even though the repository is about 80mb, most of that is test data. The libraries themselves are very small!


  • GZip support.
    • Reader with validation.
    • Writer with validation.
    • (Multi-file) GZip validating decompressor/compressor.
    • GZip Input/Output wrapper streams.
  • ARC support.
    • Reader with validation.
    • Writer with validation.
  • WARC support.
    • Reader with validation.
    • Writer with validation.
  • Encoding supported:
    • Base64, Base32 and Base16.
    • ISO8859-1.
    • UTF-8.
    • QuotedString.
    • EncodedWords.
    • URI including profiles.
  • Advanced header line reader.
  • HttpHeader request/response parser/validation.
  • Content-Type parser/validation.
  • Various special purpose stream implementations.
Children Display