BnfHeritrixController
insteadpublic class JMXHeritrixController extends AbstractJMXHeritrixController
Constructor and Description |
---|
JMXHeritrixController(HeritrixFiles files)
Deprecated.
Create a JMXHeritrixController object.
|
Modifier and Type | Method and Description |
---|---|
boolean |
atFinish()
Deprecated.
Query whether Heritrix is in a state where it can finish crawling.
|
void |
beginCrawlStop()
Deprecated.
Tell Heritrix to stop crawling.
|
void |
cleanup()
Deprecated.
Cleanup after an Heritrix process.
|
boolean |
crawlIsEnded()
Deprecated.
Check if the crawl has ended, either because Heritrix finished of its own, or because we terminated it.
|
int |
getActiveToeCount()
Deprecated.
Get the number of currently active ToeThreads (crawler threads).
|
int |
getCurrentProcessedKBPerSec()
Deprecated.
Get an estimate of the rate, in kb, at which documents are currently being processed by the crawler.
|
String |
getHarvestInformation()
Deprecated.
Return the URL for monitoring this instance.
|
String |
getProgressStats()
Deprecated.
Get a human-readable set of statistics on the progress of the crawl.
|
long |
getQueuedUriCount()
Deprecated.
Get the number of URIs currently on the queue to be processed.
|
void |
initialize()
Deprecated.
Initialize a new CrawlController for executing a Heritrix crawl.
|
boolean |
isPaused()
Deprecated.
Returns true if the crawler has been paused, and thus not supposed to fetch anything.
|
void |
requestCrawlStart()
Deprecated.
Request that Heritrix start crawling.
|
void |
requestCrawlStop(String reason)
Deprecated.
Request that crawling stops.
|
getFiles, getGuiPort, getHeritrixFiles, getHostName, getJmxPort, getJobDescription, processHasExited, toString, waitForHeritrixProcessExit
public JMXHeritrixController(HeritrixFiles files)
files
- Files that are used to set up Heritrix.public void initialize()
HeritrixController
IOFailure
- If Heritrix dies before initialization, or we encounter any problems during the initialization.HeritrixController.initialize()
public void requestCrawlStart()
HeritrixController
IOFailure
- if unable to communicate with HeritrixHeritrixController.requestCrawlStart()
public boolean atFinish()
HeritrixController
HeritrixController.atFinish()
public void beginCrawlStop()
HeritrixController
IOFailure
- if unable to communicate with HeritrixHeritrixController.beginCrawlStop()
public int getActiveToeCount()
HeritrixController
HeritrixController.getActiveToeCount()
public void requestCrawlStop(String reason)
HeritrixController
reason
- A human-readable reason the crawl is being stopped.HeritrixController.requestCrawlStop(String)
public long getQueuedUriCount()
HeritrixController
HeritrixController.getQueuedUriCount()
public int getCurrentProcessedKBPerSec()
HeritrixController
HeritrixController.getCurrentProcessedKBPerSec()
public String getProgressStats()
HeritrixController
HeritrixController.getProgressStats()
public boolean isPaused()
HeritrixController
HeritrixController.isPaused()
public boolean crawlIsEnded()
HeritrixController.crawlIsEnded()
public void cleanup()
HeritrixController.cleanup()
public String getHarvestInformation()
Copyright © 2005–2016 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.