Modifier and Type | Class and Description |
---|---|
class |
HeritrixArchiveRecordWrapper
Heritrix wrapper implementation of the abstract archive record interface.
|
Modifier and Type | Method and Description |
---|---|
static ArchiveRecordBase |
ArchiveRecordBase.wrapArchiveRecord(org.archive.io.ArchiveRecord archiveRecord)
Factory method for creating a wrapped Heritrix record.
|
Modifier and Type | Method and Description |
---|---|
void |
GetMetadataArchiveBatchJob.processRecord(ArchiveRecordBase record,
OutputStream os)
The method for processing the arc-records.
|
abstract void |
ArchiveBatchJob.processRecord(ArchiveRecordBase record,
OutputStream os)
Exceptions should be handled with the handleException() method.
|
Modifier and Type | Method and Description |
---|---|
abstract boolean |
ArchiveBatchFilter.accept(ArchiveRecordBase record)
Check if a given record is accepted (not filtered out) by this filter.
|
Modifier and Type | Method and Description |
---|---|
void |
ArchiveExtractCDXJob.processRecord(ArchiveRecordBase record,
OutputStream os)
Process this entry, reading metadata into the output stream.
|
Modifier and Type | Method and Description |
---|---|
void |
HarvestedUrlsForDomainBatchJob.processRecord(ArchiveRecordBase record,
OutputStream os)
Process a record on crawl log concerning the given domain to result.
|
void |
CrawlLogLinesMatchingRegexp.processRecord(ArchiveRecordBase record,
OutputStream os)
Process a record on crawl log concerning the given domain to result.
|
Modifier and Type | Method and Description |
---|---|
void |
DeduplicationCDXExtractionBatchJob.processRecord(ArchiveRecordBase record,
OutputStream os)
If the ArchiveRecord is a crawl-log entry then any duplicate entries in the crawl log are converted to CDX
entries and written to the output.
|
Copyright © 2005–2015 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.