Package | Description |
---|---|
dk.netarkivet.harvester.datamodel | |
dk.netarkivet.harvester.datamodel.dao | |
dk.netarkivet.harvester.datamodel.extendedfield | |
dk.netarkivet.harvester.distribute | |
dk.netarkivet.harvester.harvesting |
This module handles defining, scheduling, and execution of harvests.
|
dk.netarkivet.harvester.harvesting.distribute | |
dk.netarkivet.harvester.harvesting.metadata | |
dk.netarkivet.harvester.harvesting.report | |
dk.netarkivet.harvester.heritrix3 |
This module handles defining, scheduling, and execution of harvests.
|
dk.netarkivet.harvester.scheduler | |
dk.netarkivet.harvester.scheduler.jobgen | |
dk.netarkivet.harvester.webinterface |
Class and Description |
---|
AliasInfo
Class encapsulating domain alias information.
|
DAO
Interface common to all DAO's.
|
DBSpecifics
Defines database specific implementations used by the Harvester.
|
DerbySpecifics
Derby-specific implementation of DB methods.
|
Domain
Represents known information about a domain A domain is identified by a domain name (ex: kb.dk)
|
DomainConfiguration
This class describes a configuration for harvesting a domain.
|
DomainDAO
Persistent storage for Domain objects.
|
DomainHarvestInfo
DomainConfigPair class for extracted information on harvests on a specific domain.
|
DomainHistory
Container for the historical information available for a domain.
|
DomainOwnerInfo
This class manages owner information about a domain.
|
Frequency
This class defines various frequencies at which things can happen, such as midnight every day, 13:45 the first monday
of a month, etc.
|
FullHarvest
This class contains the specific properties and operations of snapshot harvest definitions.
|
GlobalCrawlerTrapList
Class representing one or more global crawler traps, modelled as a list of regular expressions.
|
GlobalCrawlerTrapListDAO
A Data Access Object for managing persistent collections of global crawler traps.
|
HarvestChannel
Harvest channels are used to dispatch harvest jobs to specific pools of crawlers.
|
HarvestChannelDAO
Abstract class for the DAO handling the persistence of
HarvestChannel instances. |
HarvestDefinition
This abstract class models the general properties of a harvest definition, i.e.
|
HarvestDefinitionDAO
A Data Access Object for harvest definitions.
|
HarvesterDatabaseTables
Enum class defining the tables of the Harvester database and the required versions of the individual tables.
|
HarvestInfo
Summary information about a specific harvest of a domain.
|
HarvestRunInfo
Information on a single run of a harvest.
|
HeritrixTemplate
Abstract class for manipulating Heritrix Templates.
|
Job
This class represents one job to run by Heritrix.
|
JobDAO
Interface for creating and accessing jobs in persistent storage.
|
JobStatus
Enumeration of the possible states (alt.: status) a Job can be in.
|
JobStatusInfo
A simple tuple to deliver information on the status of jobs.
|
PartialHarvest
This class contains the specific properties and operations of harvest definitions which are not snapshot harvest
definitions.
|
Password
Immutable password class.
|
RunningJobsInfoDAO
Abstract class for handling the persistence of running job infos.
|
Schedule
This class implements a schedule that can be either repeating or timed, depending on the subclass.
|
ScheduleDAO
A DAO for reading and writing schedules by name.
|
SeedList
Representation of the list of harvesting seeds.
|
SparseDomain
Reduced version of the Domain class for presentation purposes.
|
SparseDomainConfiguration
Sparse version for DomainConfiguration class.
|
SparseFullHarvest
Sparse version of FullHarvest to be used for GUI purposes only.
|
SparsePartialHarvest
Sparse version of PartialHarvest to be used for GUI purposes only.
|
StopReason
Class for containing a reason for stopping the harvesting of a domain.
|
TemplateDAO
DAO methods for reading templates only.
|
TimeUnit
Enumeration of the possible time units used for frequencies in schedules.
|
TLDInfo
A container for miscellaneous information about a TLD.
|
Class and Description |
---|
DomainDAO
Persistent storage for Domain objects.
|
HarvestDefinitionDAO
A Data Access Object for harvest definitions.
|
JobDAO
Interface for creating and accessing jobs in persistent storage.
|
Class and Description |
---|
DAO
Interface common to all DAO's.
|
Class and Description |
---|
HarvestChannel
Harvest channels are used to dispatch harvest jobs to specific pools of crawlers.
|
Class and Description |
---|
HarvestDefinitionInfo
Class containing Info about a harvestjob.
|
HeritrixTemplate
Abstract class for manipulating Heritrix Templates.
|
Job
This class represents one job to run by Heritrix.
|
Class and Description |
---|
HarvestDefinitionInfo
Class containing Info about a harvestjob.
|
Job
This class represents one job to run by Heritrix.
|
JobStatus
Enumeration of the possible states (alt.: status) a Job can be in.
|
StopReason
Class for containing a reason for stopping the harvesting of a domain.
|
Class and Description |
---|
AliasInfo
Class encapsulating domain alias information.
|
Class and Description |
---|
Job
This class represents one job to run by Heritrix.
|
StopReason
Class for containing a reason for stopping the harvesting of a domain.
|
Class and Description |
---|
HarvestDefinitionInfo
Class containing Info about a harvestjob.
|
HeritrixTemplate
Abstract class for manipulating Heritrix Templates.
|
Job
This class represents one job to run by Heritrix.
|
Class and Description |
---|
HarvestChannel
Harvest channels are used to dispatch harvest jobs to specific pools of crawlers.
|
HarvestChannelDAO
Abstract class for the DAO handling the persistence of
HarvestChannel instances. |
HarvestDefinitionDAO
A Data Access Object for harvest definitions.
|
Job
This class represents one job to run by Heritrix.
|
JobDAO
Interface for creating and accessing jobs in persistent storage.
|
Class and Description |
---|
DomainConfiguration
This class describes a configuration for harvesting a domain.
|
HarvestDefinition
This abstract class models the general properties of a harvest definition, i.e.
|
Job
This class represents one job to run by Heritrix.
|
Class and Description |
---|
Domain
Represents known information about a domain A domain is identified by a domain name (ex: kb.dk)
|
DomainConfiguration
This class describes a configuration for harvesting a domain.
|
DomainDAO
Persistent storage for Domain objects.
|
DomainHarvestInfo
DomainConfigPair class for extracted information on harvests on a specific domain.
|
HarvestDefinitionDAO
A Data Access Object for harvest definitions.
|
JobDAO
Interface for creating and accessing jobs in persistent storage.
|
JobStatus
Enumeration of the possible states (alt.: status) a Job can be in.
|
JobStatusInfo
A simple tuple to deliver information on the status of jobs.
|
SeedList
Representation of the list of harvesting seeds.
|
Copyright © 2005–2015 The Royal Danish Library, the Danish State and University Library, the National Library of France and the Austrian National Library.. All rights reserved.