Class ArchiveExtractCDX


  • public class ArchiveExtractCDX
    extends java.lang.Object
    Command line tool for extracting CDX information from given ARC/WARC files.

    Usage: java dk.netarkivet.common.tools.ExtractCDX file1.ext [file2.ext ...] > myindex.cdx

    "ext" can be arc, arc.gz, warc or warc.gz

    Note: Does not depend on logging - communicates failures on stderr.

    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static void main​(java.lang.String[] argv)
      Main method.
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Method Detail

      • main

        public static void main​(java.lang.String[] argv)
        Main method. Extracts CDX from all given files and outputs the index on stdout.
        Parameters:
        argv - A list of (absolute paths to) files to index.