public class HarvestReportGenerator extends Object
Modifier and Type | Class and Description |
---|---|
static class |
HarvestReportGenerator.ProgressStatisticsConstants
Strings found in the progress-statistics.log, used to devise the default stop reason for domains.
|
Constructor and Description |
---|
HarvestReportGenerator()
Default constructor that does nothing.
|
HarvestReportGenerator(Heritrix3Files files)
Constructor from Heritrix report files.
|
Modifier and Type | Method and Description |
---|---|
static StopReason |
findDefaultStopReason(File logFile)
Find out whether we stopped normally in progress statistics log.
|
StopReason |
getDefaultStopReason() |
Map<String,DomainStats> |
getDomainStatsMap() |
static DomainStatsReport |
getDomainStatsReport(Heritrix3Files files) |
protected DomainStats |
getOrCreateDomainStats(String domainName)
Attempts to get an already existing
DomainStats object for that domain, and if not found creates one with
zero values. |
void |
preProcess(Heritrix3Files files)
Pre-processing happens when the report is built just at the end of the crawl, before the ARC files upload.
|
public HarvestReportGenerator()
public HarvestReportGenerator(Heritrix3Files files)
files
- the set of Heritrix reports.public void preProcess(Heritrix3Files files)
protected DomainStats getOrCreateDomainStats(String domainName)
DomainStats
object for that domain, and if not found creates one with
zero values.domainName
- the name of the domain to get DomainStats for.public static StopReason findDefaultStopReason(File logFile)
logFile
- A progress-statistics.log file.public StopReason getDefaultStopReason()
public Map<String,DomainStats> getDomainStatsMap()
public static DomainStatsReport getDomainStatsReport(Heritrix3Files files)
files
- A set of Heritrix3 files used to produce a a HarvestReport.Copyright © 2005–2018 The Royal Danish Library, the National Library of France and the Austrian National Library.. All rights reserved.