|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object dk.netarkivet.common.utils.batch.FileBatchJob dk.netarkivet.common.utils.arc.ARCBatchJob dk.netarkivet.viewerproxy.reporting.HarvestedUrlsForDomainBatchJob
public class HarvestedUrlsForDomainBatchJob
Batchjob that extracts lines referring to a specific domain from a crawl log.
The batch job should be restricted to run on metadata files for a specific
job only, using the FileBatchJob.processOnlyFilesMatching(String)
construct.
Nested Class Summary |
---|
Nested classes/interfaces inherited from class dk.netarkivet.common.utils.batch.FileBatchJob |
---|
FileBatchJob.ExceptionOccurrence |
Field Summary | |
---|---|
(package private) java.lang.String |
domain
The domain to extract crawl.log lines for. |
Fields inherited from class dk.netarkivet.common.utils.arc.ARCBatchJob |
---|
noOfRecordsProcessed |
Fields inherited from class dk.netarkivet.common.utils.batch.FileBatchJob |
---|
batchJobTimeout, exceptions, filesFailed, noOfFilesProcessed |
Constructor Summary | |
---|---|
HarvestedUrlsForDomainBatchJob(java.lang.String domain)
Initialise the batch job. |
Method Summary | |
---|---|
void |
finish(java.io.OutputStream os)
Does nothing, no finishing is needed. |
ARCBatchFilter |
getFilter()
returns a BatchFilter object which restricts the set of arcrecords in the archive on which this batch-job is performed. |
void |
initialize(java.io.OutputStream os)
Does nothing, no initialisation is needed. |
void |
processRecord(org.archive.io.arc.ARCRecord record,
java.io.OutputStream os)
Process a record on crawl log concerning the given domain to result. |
java.lang.String |
toString()
Humanly readable representation of this instance. |
Methods inherited from class dk.netarkivet.common.utils.arc.ARCBatchJob |
---|
getExceptionArray, handleException, noOfRecordsProcessed, processFile |
Methods inherited from class dk.netarkivet.common.utils.batch.FileBatchJob |
---|
addException, addFinishException, addInitializeException, getBatchJobTimeout, getExceptions, getFilenamePattern, getFilesFailed, getNoOfFilesProcessed, maxExceptionsReached, postProcess, processOnlyFileNamed, processOnlyFilesMatching, processOnlyFilesMatching, processOnlyFilesNamed, setBatchJobTimeout |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
final java.lang.String domain
Constructor Detail |
---|
public HarvestedUrlsForDomainBatchJob(java.lang.String domain)
domain
- The domain to get crawl.log lines for.Method Detail |
---|
public void initialize(java.io.OutputStream os)
initialize
in class ARCBatchJob
os
- Not used.public ARCBatchFilter getFilter()
ARCBatchJob
getFilter
in class ARCBatchJob
public void processRecord(org.archive.io.arc.ARCRecord record, java.io.OutputStream os)
processRecord
in class ARCBatchJob
record
- The record to process.os
- The output stream for the result.
ArgumentNotValid
- on null parameters
IOFailure
- on trouble processing the record.public void finish(java.io.OutputStream os)
finish
in class ARCBatchJob
os
- Not used.public java.lang.String toString()
toString
in class java.lang.Object
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |