Package | Description |
---|---|
dk.netarkivet.harvester.harvesting |
This module handles defining, scheduling, and execution of harvests.
|
dk.netarkivet.harvester.harvesting.controller | |
dk.netarkivet.harvester.harvesting.report |
Modifier and Type | Method and Description |
---|---|
static HeritrixFiles |
HeritrixFiles.getH1HeritrixFilesWithDefaultJmxFiles(File crawlDir,
JobInfo harvestJob) |
static HeritrixFiles |
HeritrixFiles.getH3HeritrixFiles(File crawlDir,
JobInfo harvestJob) |
protected HeritrixFiles |
HeritrixLauncher.getHeritrixFiles() |
HeritrixFiles |
HarvestController.writeHarvestFiles(File crawldir,
Job job,
HarvestDefinitionInfo hdi,
List<MetadataEntry> metadataEntries)
Writes the files involved with a harvests.
|
Modifier and Type | Method and Description |
---|---|
static void |
HeritrixLauncher.makeTemplateReadyForHeritrix1(HeritrixFiles files)
Updates the diskpath value, archivefile_prefix, seedsfile, and deduplication -information.
|
void |
HarvestController.runHarvest(HeritrixFiles files)
Creates the actual HeritrixLauncher instance and runs it, after the various setup files have been written.
|
void |
HeritrixLauncher.setupOrderfile(HeritrixFiles files) |
HarvestReport |
HarvestController.storeFiles(HeritrixFiles files,
StringBuilder errorMessage,
List<File> failedFiles)
Controls storing all files involved in a job.
|
Constructor and Description |
---|
HeritrixLauncher(HeritrixFiles files)
Private HeritrixLauncher constructor.
|
IngestableFiles(HeritrixFiles files)
Constructor for this class.
|
Modifier and Type | Method and Description |
---|---|
HeritrixFiles |
AbstractJMXHeritrixController.getFiles() |
protected HeritrixFiles |
AbstractJMXHeritrixController.getHeritrixFiles() |
Modifier and Type | Method and Description |
---|---|
static DefaultHeritrixLauncher |
DefaultHeritrixLauncher.getInstance(HeritrixFiles files)
Get instance of this class.
|
static BnfHeritrixLauncher |
BnfHeritrixLauncher.getInstance(HeritrixFiles files)
Get instance of this class.
|
Constructor and Description |
---|
AbstractJMXHeritrixController(HeritrixFiles files)
Create a BnfHeritrixController object.
|
BnfHeritrixController(HeritrixFiles files)
Create a BnfHeritrixController object.
|
JMXHeritrixController(HeritrixFiles files)
Deprecated.
Create a JMXHeritrixController object.
|
Modifier and Type | Method and Description |
---|---|
static DomainStatsReport |
HarvestReportGenerator.getDomainStatsReport(HeritrixFiles files) |
void |
HarvestReportGenerator.preProcess(HeritrixFiles files)
Pre-processing happens when the report is built just at the end of the crawl, before the ARC files upload.
|
Constructor and Description |
---|
HarvestReportGenerator(HeritrixFiles files)
Constructor from Heritrix report files.
|
Copyright © 2005–2016 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.