Child pages
  • Running JWAT-Tools

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Info
iconfalse
titleLinux scripts

jwattools.sh

jwattools_debug.sh

jwattools_debug_suspended.sh

Options

This is the new style usage introduced with v0.5.5.

The command line interface is work in progress. So at some point the arguments/options will be refactored.

Unfortunately I have a small command line package which also requires refactoring.

The following options are currently available in JWAT-Tools.

...

There may come more refactoring.

Code Block
titleCommandline options (v0.5.4)
borderStylesolid
C:\Java\workspace\jwat-tools>target\jwat-tools-0.5.5-SNAPSHOT\jwattools.cmd
JWATTools v0.5.5
usage: JWATTools [-dte19] command [file ...]
Commands:
   arc2warc     convert ARC to WARC
   cdx          create a CDX index (unsorted)
   compress     compress
   decompress   decompress
   extract      extract ARC/WARC record(s)
   interval     interval extract
   pathindex    create a heritrix path index (unsorted)
   test         test validity of ARC/WARC/GZip file(s)
   unpack       unpack multifile GZip
Options:
   -r      recursive
   -w<x>   thread(s)
Test options:
   -e   show errors
   -l   relaxed URL URI validation
   -x   to validate text/xml payload (eg. mets)
Compress options:
   -1, --fast   compress faster
   -9, --slow   compress better

C:\Java\workspace\jwat-tools>

This is the old style usage for v0.5.4. (It will be removed once a binary v0.5.5 has been uploaded)

Code Block
titleCommandline options (v0.5.4)
borderStylesolid
C:\Java\workspace\jwat-tools>target\jwat-tools-0.5.4-SNAPSHOT\jwattools.cmd
JWATTools v0.5.4
usage: JWATTools [-dte19] [file ...]
 -t   test validity of ARC, WARC and/or GZip file(s)
 -r   recursive
 -e   show errors
 -l   relaxed URL URI validation
 -x   to validate text/xml payload (eg. mets)
 -d   decompress
 -1   compress faster
 -9   compress better
 -i   interval extract
 -u   unpack multifile gzip
 -c   convert arc to warc
 -C   output CDX

...