public class WaybackCDXExtractionARCBatchJob extends ARCBatchJob
FileBatchJob.ExceptionOccurrence
noOfRecordsProcessed
batchJobTimeout, exceptions, filesFailed, noOfFilesProcessed
Constructor and Description |
---|
WaybackCDXExtractionARCBatchJob()
Constructor which set timeout to one day.
|
WaybackCDXExtractionARCBatchJob(long timeout)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
void |
finish(OutputStream os)
Does nothing except log the end of the job.
|
ARCBatchFilter |
getFilter()
returns a BatchFilter object which restricts the set of arcrecords in the archive on which this batch-job is
performed.
|
void |
initialize(OutputStream os)
Initializes the private fields of this class.
|
void |
processRecord(org.archive.io.arc.ARCRecord record,
OutputStream os)
For each ARCRecord writes one CDX line (including newline) to the output.
|
getExceptionArray, handleException, noOfRecordsProcessed, processFile
addException, addFinishException, addInitializeException, getBatchJobTimeout, getExceptions, getFilenamePattern, getFilesFailed, getNoOfFilesProcessed, maxExceptionsReached, postProcess, processOnlyFileNamed, processOnlyFilesMatching, processOnlyFilesMatching, processOnlyFilesNamed, setBatchJobTimeout
public WaybackCDXExtractionARCBatchJob()
public WaybackCDXExtractionARCBatchJob(long timeout)
timeout
- specific timeout periodpublic void initialize(OutputStream os)
initialize
in class ARCBatchJob
os
- unused argumentpublic void finish(OutputStream os)
finish
in class ARCBatchJob
os
- unused argument.public ARCBatchFilter getFilter()
ARCBatchJob
getFilter
in class ARCBatchJob
public void processRecord(org.archive.io.arc.ARCRecord record, OutputStream os)
processRecord
in class ARCBatchJob
record
- the ARCRecord to be indexed.os
- the OutputStream to which output is written.Copyright © 2005–2016 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.