|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object dk.netarkivet.harvester.harvesting.MetadataFile
public class MetadataFile
Wraps information for an Heritrix file that should be stored in the metadata ARC. Defines a natural order to sort them.
Field Summary | |
---|---|
static java.lang.String |
CDX_PATTERN
A pattern identifying a CDX metadata entry. |
static java.lang.String |
CRAWL_LOG_PATTERN
A pattern identifying the crawl log metadata entry. |
static java.lang.String |
DOMAIN_SETTINGS_FILE
The name of a domain-specific Heritrix settings file (a.k.a. |
static java.lang.String |
HERITRIX_FILE_PATTERN
The pattern controlling which files in the crawl directory root should be stored in the metadata ARC. |
static java.lang.String |
LOG_FILE_PATTERN
The pattern controlling which files in the logs subdirectory of the crawl directory root should be stored in the metadata ARC as log files. |
static java.lang.String |
REPORT_FILE_PATTERN
The pattern controlling which files in the crawl directory root should be stored in the metadata ARC as reports. |
Constructor Summary | |
---|---|
MetadataFile(java.io.File heritrixFile,
java.lang.Long harvestId,
java.lang.Long jobId,
java.lang.String heritrixVersion)
Creates a metadata file and finds which metadata type it belongs to. |
|
MetadataFile(java.io.File heritrixFile,
java.lang.Long harvestId,
java.lang.Long jobId,
java.lang.String heritrixVersion,
java.lang.String domain)
Creates a metadata file for a domain-specific override file. |
Method Summary | |
---|---|
int |
compareTo(MetadataFile other)
First we compare the type ordinals, then the URLs. |
java.io.File |
getHeritrixFile()
Returns the actual file. |
java.lang.String |
getUrl()
Returns the metadata URL associated to this file. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final java.lang.String CDX_PATTERN
CDXDataCache.CDXDataCache()
,
Constant Field Valuespublic static final java.lang.String CRAWL_LOG_PATTERN
CrawlLogDataCache.CrawlLogDataCache()
,
Constant Field Valuespublic static final java.lang.String HERITRIX_FILE_PATTERN
public static final java.lang.String REPORT_FILE_PATTERN
public static final java.lang.String LOG_FILE_PATTERN
public static final java.lang.String DOMAIN_SETTINGS_FILE
Constructor Detail |
---|
MetadataFile(java.io.File heritrixFile, java.lang.Long harvestId, java.lang.Long jobId, java.lang.String heritrixVersion)
MetadataFile(java.io.File heritrixFile, java.lang.Long harvestId, java.lang.Long jobId, java.lang.String heritrixVersion, java.lang.String domain)
Method Detail |
---|
public java.lang.String getUrl()
public java.io.File getHeritrixFile()
public int compareTo(MetadataFile other)
compareTo
in interface java.lang.Comparable<MetadataFile>
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |