|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectdk.netarkivet.harvester.harvesting.HeritrixLauncher
dk.netarkivet.harvester.harvesting.controller.BnfHeritrixLauncher
public class BnfHeritrixLauncher
BnF specific Heritrix launcher, that forces the use of
BnfHeritrixController
. Every turn of the crawl control loop, asks the
Heritrix controller to generate a progress report as a
CrawlProgressMessage
and then send this message on the JMS bus to
be consumed by the HarvestMonitor
instance.
Field Summary | |
---|---|
(package private) static long |
FRONTIER_REPORT_GEN_FREQUENCY
Frequency in seconds for generating the full harvest report. |
(package private) static org.apache.commons.logging.Log |
log
The class logger. |
Fields inherited from class dk.netarkivet.harvester.harvesting.HeritrixLauncher |
---|
CRAWL_CONTROL_WAIT_PERIOD |
Method Summary | |
---|---|
void |
doCrawl()
Initializes an Heritrix controller, then launches the Heritrix instance. |
static BnfHeritrixLauncher |
getInstance(HeritrixFiles files)
Get instance of this class. |
Methods inherited from class dk.netarkivet.harvester.harvesting.HeritrixLauncher |
---|
getControllerArguments, getHeritrixFiles, isDeduplicationEnabledInTemplate, setupOrderfile |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
static final org.apache.commons.logging.Log log
static final long FRONTIER_REPORT_GEN_FREQUENCY
Method Detail |
---|
public static BnfHeritrixLauncher getInstance(HeritrixFiles files) throws ArgumentNotValid
files
- Object encapsulating location of Heritrix crawldir and
configuration files
BnfHeritrixLauncher
object
ArgumentNotValid
- If either order.xml or seeds.txt does not exist, or argument
files is null.public void doCrawl() throws IOFailure
HarvesterSettings.CRAWL_LOOP_WAIT_TIME
.CrawlProgressMessage
from the Heritrix controller
doCrawl
in class HeritrixLauncher
IOFailure
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |