Class Summary |
CrawlStatusMessage |
Instances of this class are sent by a HarvestControllerServer to the
THE_SCHED queue to indicate the progress of a heritrix crawl. |
DomainHarvestReport |
Interface to define what kind of statistics, all crawlers
are supposed to deliver to this system. |
DomainStats |
Tuple class to hold domain harvest statistics for a single domain. |
DoOneCrawlMessage |
Container for doOneCrawl request. |
HarvestControllerClient |
Proxy for remote scheduler. |
HarvestControllerServer |
This class responds to JMS doOneCrawl messages from the HarvestScheduler and
launches a Heritrix crawl with the received job description. |
MetadataEntry |
Class used to carry metadata in DoOneCrawl messages, including the URL
and mimetype necessary to write the metadata to metadata ARC files. |
PersistentJobData |
Class PersistentJobData holds information about an ongoing harvest. |