Package | Description |
---|---|
dk.netarkivet.common.utils.archive | |
dk.netarkivet.common.utils.batch | |
dk.netarkivet.common.utils.cdx | |
dk.netarkivet.viewerproxy.webinterface |
Modifier and Type | Method and Description |
---|---|
ArchiveBatchFilter |
ArchiveBatchJob.getFilter()
Returns an ArchiveBatchFilter object which restricts the set of records in the archive on which this batch-job is
performed.
|
Modifier and Type | Field and Description |
---|---|
static ArchiveBatchFilter |
ArchiveBatchFilter.EXCLUDE_NON_RESPONSE_RECORDS
A default filter: Accepts only response records.
|
static ArchiveBatchFilter |
ArchiveBatchFilter.EXCLUDE_NON_WARCINFO_RECORDS
A default filter: Accepts only response records.
|
static ArchiveBatchFilter |
ArchiveBatchFilter.NO_FILTER
A default filter: Accepts everything.
|
static ArchiveBatchFilter |
ArchiveBatchFilter.ONLY_HTTP_ENTRIES
Filter that only accepts records where the url starts with http.
|
Modifier and Type | Method and Description |
---|---|
static ArchiveBatchFilter |
ArchiveBatchFilter.getMimetypeBatchFilter(String mimetype)
Note that the mimetype of the WARC responserecord is not (necessarily) the same as its payload.
|
Modifier and Type | Method and Description |
---|---|
ArchiveBatchFilter |
ArchiveExtractCDXJob.getFilter()
Filters out the NON-RESPONSE records.
|
Modifier and Type | Method and Description |
---|---|
ArchiveBatchFilter |
HarvestedUrlsForDomainBatchJob.getFilter() |
ArchiveBatchFilter |
CrawlLogLinesMatchingRegexp.getFilter() |
Copyright © 2005–2018 The Royal Danish Library, the National Library of France and the Austrian National Library.. All rights reserved.