public class BnfHeritrixLauncher extends HeritrixLauncher
BnfHeritrixController
. Every turn of the crawl control
loop, asks the Heritrix controller to generate a progress report as a CrawlProgressMessage
and then send this
message on the JMS bus to be consumed by the HarvestMonitor
instance.CRAWL_CONTROL_WAIT_PERIOD
Modifier and Type | Method and Description |
---|---|
void |
doCrawl()
Initializes an Heritrix controller, then launches the Heritrix instance.
|
static BnfHeritrixLauncher |
getInstance(HeritrixFiles files)
Get instance of this class.
|
getControllerArguments, getHeritrixFiles, makeTemplateReadyForHeritrix1, setupOrderfile
public static BnfHeritrixLauncher getInstance(HeritrixFiles files) throws ArgumentNotValid
files
- Object encapsulating location of Heritrix crawldir and configuration filesBnfHeritrixLauncher
objectArgumentNotValid
- If either order.xml or seeds.txt does not exist, or argument files is null.public void doCrawl() throws IOFailure
HarvesterSettings.CRAWL_LOOP_WAIT_TIME
.CrawlProgressMessage
from the Heritrix controllerdoCrawl
in class HeritrixLauncher
IOFailure
Copyright © 2005–2015 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.