|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface HeritrixController
This interface encapsulates the direct access to Heritrix, allowing for accessing in various ways (direct class access or JMX). Heritrix is expected to perform one crawl for each instance of an implementing class.
Method Summary | |
---|---|
boolean |
atFinish()
Query whether Heritrix is in a state where it can finish crawling. |
void |
beginCrawlStop()
Tell Heritrix to stop crawling. |
void |
cleanup()
Release any resources kept by the class. |
boolean |
crawlIsEnded()
Returns true if the crawl has ended, either because Heritrix finished or because we terminated it. |
int |
getActiveToeCount()
Get the number of currently active ToeThreads (crawler threads). |
int |
getCurrentProcessedKBPerSec()
Get an estimate of the rate, in kb, at which documents are currently being processed by the crawler. |
java.lang.String |
getHarvestInformation()
Get harvest information. |
java.lang.String |
getProgressStats()
Get a human-readable set of statistics on the progress of the crawl. |
long |
getQueuedUriCount()
Get the number of URIs currently on the queue to be processed. |
void |
initialize()
Initialize a new CrawlController for executing a Heritrix crawl. |
boolean |
isPaused()
Returns true if the crawler has been paused, and thus not supposed to fetch anything. |
void |
requestCrawlStart()
Request that Heritrix start crawling. |
void |
requestCrawlStop(java.lang.String reason)
Request that crawling stops. |
Method Detail |
---|
void initialize()
void requestCrawlStart() throws IOFailure
IOFailure
- If something goes wrong during startup.void beginCrawlStop()
void requestCrawlStop(java.lang.String reason)
reason
- A human-readable reason the crawl is being stopped.boolean atFinish()
boolean crawlIsEnded()
int getActiveToeCount()
long getQueuedUriCount()
int getCurrentProcessedKBPerSec()
StatisticsTracking.currentProcessedKBPerSec()
java.lang.String getProgressStats()
boolean isPaused()
void cleanup()
java.lang.String getHarvestInformation()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |