Class CrawlLogDataCache

  • All Implemented Interfaces:
    RawDataCache

    public class CrawlLogDataCache
    extends RawMetadataCache
    This class implements the low-level cache for crawl log Lucene indexing. It will get the crawl logs for individual jobs as files.
    • Constructor Detail

      • CrawlLogDataCache

        public CrawlLogDataCache()
        Create a new CrawlLogDataCache. For a given job ID, this will fetch and cache crawl.log files from metadata files (<ID>-metadata-[0-9]+.arc).