|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use ARCBatchJob | |
---|---|
dk.netarkivet.common.utils.cdx | |
dk.netarkivet.viewerproxy.reporting | |
dk.netarkivet.wayback.batch |
Uses of ARCBatchJob in dk.netarkivet.common.utils.cdx |
---|
Subclasses of ARCBatchJob in dk.netarkivet.common.utils.cdx | |
---|---|
class |
ExtractCDXJob
Batch job that extracts information to create a CDX file. |
class |
GetCDXRecordsBatchJob
Job to get cdx records out of metadata files. |
Uses of ARCBatchJob in dk.netarkivet.viewerproxy.reporting |
---|
Subclasses of ARCBatchJob in dk.netarkivet.viewerproxy.reporting | |
---|---|
class |
CrawlLogLinesMatchingRegexp
Batchjob that extracts lines from a crawl log matching a regular expression The batch job should be restricted to run on metadata files for a specific job only, using the FileBatchJob.processOnlyFilesMatching(String) construct. |
class |
HarvestedUrlsForDomainBatchJob
Batchjob that extracts lines referring to a specific domain from a crawl log. |
Uses of ARCBatchJob in dk.netarkivet.wayback.batch |
---|
Subclasses of ARCBatchJob in dk.netarkivet.wayback.batch | |
---|---|
class |
ExtractDeduplicateCDXBatchJob
This batch batch job takes deduplication records from a crawl log in a metadata arcfile and converts them to cdx records for use in wayback. |
class |
ExtractWaybackCDXBatchJob
Returns a cdx file using the appropriate format for wayback, including canonicalisation of urls. |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |