Child pages
  • jwat-warc
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

warc package

org.jwat.warc:

WarcConstants.java

Most of the constants should be collected in this class, most of which are primarily for internal use.

ReaderFactory and Readers

WarcReaderFactory.java: This factory can be used to create the various types of readers with optional buffering. You can either get compressed or uncompressed readers. There are also methods which can auto-detect whether or not a compressed reader is required.
WarcReader.java: Abstract reader class which is the base for the all the readers. It also defines the options which can be set on a reader. Currently only digest options.
WarcReaderCompressed.java: A reader implementation for reading compressed records.
WarcReaderUncompressed.java: A reader implementation for reading uncompressed records.

WarcRecord.java

ArcRecordBase.java: Abstract class which is the base for both the VersionBlock and ArcRecord classes. Contains most of the fields, validation and methods.
ArcVersionBlock.java: VersionBlock parser including relevant fields, validation and methods.
ArcRecord.java: Record parser including relevant fields, validation and methods.

Auxiliary classes

WarcHeaderLine.java
WarcDateParser.java: Parses and validated an WARC date.
WarcDigest.java
WarcErrorType.java: Defines the different possible error types.
WarcValidationError.java: Defines an WARC validation error using a type, key and value.

Writers

WarcWriter.java
WarcWriterUncompressed.java

  • No labels