public class FullHarvest extends HarvestDefinition
audience, channelId, comments, edition, harvestDefName, isActive, numEvents, oid, submissionDate
extendedFieldValues
Constructor and Description |
---|
FullHarvest(String harvestDefName,
String comments,
Long previousHarvestDefinitionOid,
long maxCountObjects,
long maxBytes,
long maxJobRunningTime,
boolean isIndexReady,
javax.inject.Provider<HarvestDefinitionDAO> hdDaoProvider,
javax.inject.Provider<JobDAO> jobDaoProvider,
javax.inject.Provider<ExtendedFieldDAO> extendedFieldDAOProvide,
javax.inject.Provider<DomainDAO> domainDAOProvider)
Create new instance of FullHarvest configured according to the properties of the supplied DomainConfiguration.
|
Modifier and Type | Method and Description |
---|---|
Iterator<DomainConfiguration> |
getDomainConfigurations()
Returns an iterator of domain configurations for this harvest definition.
|
Iterator<DomainConfiguration> |
getDomainConfigurationsForIterativeHarvest() |
boolean |
getIndexReady()
Is index ready.
|
long |
getMaxBytes()
Get the maximum number of bytes that this fullharvest will harvest per domain, 0 for no limit.
|
long |
getMaxCountObjects()
Returns how many objects to harvest per domain, or 0 for no limit.
|
long |
getMaxJobRunningTime() |
HarvestDefinition |
getPreviousHarvestDefinition()
Get the previous HarvestDefinition which is used to base this.
|
boolean |
isSnapShot()
Returns whether this HarvestDefinition represents a snapshot harvest.
|
boolean |
runNow(Date now)
Check if this harvest definition should be run, given the time now.
|
void |
setIndexReady(boolean isIndexReady)
Set the indexReady field.
|
void |
setMaxBytes(long maxBytes)
Set the limit for how many bytes this fullharvest will harvest per domain, or -1 for no limit.
|
void |
setMaxCountObjects(long maxCountObjects) |
void |
setMaxJobRunningTime(long maxJobRunningtime)
Set the limit for how many seconds each crawljob in this fullharvest will run, or 0 for no limit.
|
void |
setPreviousHarvestDefinition(Long prev)
Set the previous HarvestDefinition which is used to base this.
|
createFullHarvest, createPartialHarvest, equals, getActive, getAudience, getChannelId, getComments, getEdition, getExtendedFieldType, getName, getNumEvents, getOid, getSubmissionDate, hashCode, setActive, setAudience, setChannelId, setComments, setEdition, setName, setNumEvents, setOid, setSubmissionDate, toString
addExtendedFieldValue, addExtendedFieldValues, getExtendedFieldValue, getExtendedFieldValues, setExtendedFieldValues, updateExtendedFieldValue
public FullHarvest(String harvestDefName, String comments, Long previousHarvestDefinitionOid, long maxCountObjects, long maxBytes, long maxJobRunningTime, boolean isIndexReady, javax.inject.Provider<HarvestDefinitionDAO> hdDaoProvider, javax.inject.Provider<JobDAO> jobDaoProvider, javax.inject.Provider<ExtendedFieldDAO> extendedFieldDAOProvide, javax.inject.Provider<DomainDAO> domainDAOProvider)
harvestDefName
- the name of the harvest definitioncomments
- commentspreviousHarvestDefinitionOid
- This harvestDefinition is used to create this Fullharvest definition.maxCountObjects
- Limit for how many objects can be fetched per domainmaxBytes
- Limit for how many bytes can be fetched per domainmaxJobRunningTime
- Limit on how much time can be spent on each job. 0 means no limitisIndexReady
- Is the deduplication index ready for this harvest.public HarvestDefinition getPreviousHarvestDefinition()
public void setPreviousHarvestDefinition(Long prev)
prev
- The id of a HarvestDefinitionpublic long getMaxCountObjects()
HarvestDefinition
getMaxCountObjects
in class HarvestDefinition
public void setMaxCountObjects(long maxCountObjects)
maxCountObjects
- The maxCountObjects to set.public long getMaxBytes()
getMaxBytes
in class HarvestDefinition
public void setMaxBytes(long maxBytes)
maxBytes
- Number of bytes to stop harvesting at.public Iterator<DomainConfiguration> getDomainConfigurations()
Domains are also excluded if they are aliases of another domain.
getDomainConfigurations
in class HarvestDefinition
public Iterator<DomainConfiguration> getDomainConfigurationsForIterativeHarvest()
public boolean runNow(Date now)
runNow
in class HarvestDefinition
now
- The current timepublic boolean isSnapShot()
isSnapShot
in class HarvestDefinition
public long getMaxJobRunningTime()
public void setMaxJobRunningTime(long maxJobRunningtime)
maxJobRunningtime
- max number of secondspublic boolean getIndexReady()
public void setIndexReady(boolean isIndexReady)
isIndexReady
- The new value of the indexReady field.Copyright © 2005–2018 The Royal Danish Library, the National Library of France and the Austrian National Library.. All rights reserved.