Modifier and Type | Method and Description |
---|---|
static File |
getCrawlLogForDomainInJob(String domain,
int jobid)
Submit a batch job to extract the part of a crawl log that is associated with the given domain and job.
|
static File |
getCrawlLoglinesMatchingRegexp(int jobid,
String regexp)
Return any crawllog lines for a given jobid matching the given regular expression.
|
static List<String> |
getFilesForJob(int jobid,
String harvestprefix)
Submit a batch job to list all files for a job, and report result in a sorted list.
|
static List<CDXRecord> |
getMetadataCDXRecordsForJob(long jobid)
Submit a batch job to generate cdx for all metadata files for a job, and report result in a list.
|
public static List<String> getFilesForJob(int jobid, String harvestprefix)
jobid
- The job to get files for.harvestprefix
- The harvestprefix for the files produced by heritrixArgumentNotValid
- If jobid is 0 or negative.IOFailure
- On trouble generating the file listpublic static List<CDXRecord> getMetadataCDXRecordsForJob(long jobid)
jobid
- The job to get cdx for.ArgumentNotValid
- If jobid is 0 or negative.IOFailure
- On trouble generating the cdxpublic static File getCrawlLogForDomainInJob(String domain, int jobid)
domain
- The domain to get crawl.log-lines for.jobid
- The jobid to get the crawl.log-lines for.ArgumentNotValid
- On negative jobids, or if domain is null or the empty string.public static File getCrawlLoglinesMatchingRegexp(int jobid, String regexp)
jobid
- The jobidregexp
- A regular expressionCopyright © 2005–2015 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.