|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object dk.netarkivet.wayback.DeduplicateToCDXApplication
public class DeduplicateToCDXApplication
A simple command line application to generate cdx files from local crawl-log files.
Constructor Summary | |
---|---|
DeduplicateToCDXApplication()
|
Method Summary | |
---|---|
void |
generateCDX(java.lang.String[] localCrawlLogs)
Takes an array of file names (relative or full paths) of crawl.log files from which duplicate records are to be extracted. |
static void |
main(java.lang.String[] args)
An application to generate unsorted cdx files from duplicate records present in a crawl.log file. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public DeduplicateToCDXApplication()
Method Detail |
---|
public void generateCDX(java.lang.String[] localCrawlLogs) throws java.io.IOException
localCrawlLogs
- a list of file names
java.io.FileNotFoundException
- if one of the files cannot be found
java.io.IOException
public static void main(java.lang.String[] args) throws java.io.IOException
args
- the file names (relative or absolute paths)
java.io.FileNotFoundException
- if one or more of the files does not exist
java.io.IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |