public class DefaultHeritrixLauncher extends HeritrixLauncher
CRAWL_CONTROL_WAIT_PERIOD
Modifier and Type | Method and Description |
---|---|
void |
doCrawl()
This method launches heritrix in the following way: 1.
|
static DefaultHeritrixLauncher |
getInstance(HeritrixFiles files)
Get instance of this class.
|
getControllerArguments, getHeritrixFiles, makeTemplateReadyForHeritrix1, setupOrderfile
public static DefaultHeritrixLauncher getInstance(HeritrixFiles files) throws ArgumentNotValid
files
- Object encapsulating location of Heritrix crawldir and configuration filesDefaultHeritrixLauncher
objectArgumentNotValid
- If either order.xml or seeds.txt does not exist, or argument files is null.public void doCrawl() throws IOFailure
doCrawl
in class HeritrixLauncher
IOFailure
- - if the order.xml is invalid if unable to initialize Heritrix CrawlController if Heritrix
process interruptedCopyright © 2005–2016 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.