|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object dk.netarkivet.harvester.harvesting.controller.AbstractJMXHeritrixController dk.netarkivet.harvester.harvesting.controller.JMXHeritrixController
BnfHeritrixController
instead
public class JMXHeritrixController
This implementation of the HeritrixController interface starts Heritrix as a separate process and uses JMX to communicate with it. Each instance executes exactly one process that runs exactly one crawl job.
Constructor Summary | |
---|---|
JMXHeritrixController(HeritrixFiles files)
Deprecated. Create a JMXHeritrixController object. |
Method Summary | |
---|---|
boolean |
atFinish()
Deprecated. Query whether Heritrix is in a state where it can finish crawling. |
void |
beginCrawlStop()
Deprecated. Tell Heritrix to stop crawling. |
void |
cleanup()
Deprecated. Cleanup after an Heritrix process. |
boolean |
crawlIsEnded()
Deprecated. Check if the crawl has ended, either because Heritrix finished of its own, or because we terminated it. |
int |
getActiveToeCount()
Deprecated. Get the number of currently active ToeThreads (crawler threads). |
int |
getCurrentProcessedKBPerSec()
Deprecated. Get an estimate of the rate, in kb, at which documents are currently being processed by the crawler. |
java.lang.String |
getHarvestInformation()
Deprecated. Return the URL for monitoring this instance. |
java.lang.String |
getProgressStats()
Deprecated. Get a human-readable set of statistics on the progress of the crawl. |
long |
getQueuedUriCount()
Deprecated. Get the number of URIs currently on the queue to be processed. |
void |
initialize()
Deprecated. Initialize a new CrawlController for executing a Heritrix crawl. |
boolean |
isPaused()
Deprecated. Returns true if the crawler has been paused, and thus not supposed to fetch anything. |
void |
requestCrawlStart()
Deprecated. Request that Heritrix start crawling. |
void |
requestCrawlStop(java.lang.String reason)
Deprecated. Request that crawling stops. |
Methods inherited from class dk.netarkivet.harvester.harvesting.controller.AbstractJMXHeritrixController |
---|
getGuiPort, getHeritrixFiles, getHostName, getJmxPort, getJobDescription, processHasExited, toString, waitForHeritrixProcessExit |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
public JMXHeritrixController(HeritrixFiles files)
files
- Files that are used to set up Heritrix.Method Detail |
---|
public void initialize()
HeritrixController
IOFailure
- If Heritrix dies before initialization,
or we encounter any problems during the initialization.HeritrixController.initialize()
public void requestCrawlStart()
HeritrixController
IOFailure
- if unable to communicate with HeritrixHeritrixController.requestCrawlStart()
public boolean atFinish()
HeritrixController
HeritrixController.atFinish()
public void beginCrawlStop()
HeritrixController
IOFailure
- if unable to communicate with HeritrixHeritrixController.beginCrawlStop()
public int getActiveToeCount()
HeritrixController
HeritrixController.getActiveToeCount()
public void requestCrawlStop(java.lang.String reason)
HeritrixController
reason
- A human-readable reason the crawl is being stopped.HeritrixController.requestCrawlStop(String)
public long getQueuedUriCount()
HeritrixController
HeritrixController.getQueuedUriCount()
public int getCurrentProcessedKBPerSec()
HeritrixController
HeritrixController.getCurrentProcessedKBPerSec()
public java.lang.String getProgressStats()
HeritrixController
HeritrixController.getProgressStats()
public boolean isPaused()
HeritrixController
HeritrixController.isPaused()
public boolean crawlIsEnded()
HeritrixController.crawlIsEnded()
public void cleanup()
HeritrixController.cleanup()
public java.lang.String getHarvestInformation()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |