public class HarvestJob extends Object
Constructor and Description |
---|
HarvestJob(HarvestControllerServer hcs)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
File |
createCrawlDir()
Create the crawl dir, but make sure a message is sent if there is a problem.
|
Heritrix3Files |
getHeritrix3Files() |
void |
init(Job job,
HarvestDefinitionInfo origHarvestInfo,
List<MetadataEntry> metadataEntries)
Initialization of the harvestJob.
|
void |
runHarvest()
Creates the actual HeritrixLauncher instance and runs it, after the various setup files have been written.
|
Heritrix3Files |
writeHarvestFiles(File crawldir,
Job job,
HarvestDefinitionInfo hdi,
List<MetadataEntry> metadataEntries)
Writes the files needed to start a harvest..
|
public HarvestJob(HarvestControllerServer hcs)
hcs
- a HarvestControllerServer instancepublic void init(Job job, HarvestDefinitionInfo origHarvestInfo, List<MetadataEntry> metadataEntries)
job
- A job from the jobs table in the harvestdatabaseorigHarvestInfo
- metadata about the harvestmetadataEntries
- entries for the metadata file for the harvestpublic Heritrix3Files getHeritrix3Files()
public void runHarvest() throws ArgumentNotValid
ArgumentNotValid
- if an argument isn't valid.public File createCrawlDir()
PermissionDenied
- if the directory cannot be created.public Heritrix3Files writeHarvestFiles(File crawldir, Job job, HarvestDefinitionInfo hdi, List<MetadataEntry> metadataEntries)
crawldir
- The directory that the crawl should take place in.job
- The Job object containing various harvest setup data.hdi
- The object encapsulating documentary information about the harvest.metadataEntries
- Any metadata entries sent along with the job that should be stored for later use.Copyright © 2005–2016 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.