Package dk.netarkivet.wayback.indexer
Retrieves indexes of the ARC files in the repository which are needed by Wayback.
The WaybackIndexer starts a producer thread with uses the populate operation on the IndexerQueue class to fetch a list of all new files added to the central ArcRepository. The resulting list is added to the ArcFileQueue containing the files which should be indexed.
The WaybackIndexer also creates a number of consumer threads which indexes the retrieved ArcFileLists
-
Interface Summary Interface Description GenericDAO<T,PK extends Serializable> A generic class for managing storage and retrieval of persistent objects. -
Class Summary Class Description ArchiveFile This class represents a file in the arcrepository which may be indexed by the indexer.ArchiveFileDAO Data Access Object for ArchiveFile instances.FileNameHarvester GenericHibernateDAO<T,PK extends Serializable> An implementation of Generic DAO which is specialised for hibernate object stores.HibernateUtil This class contains a single static utility method which returns a Hibernate session: HibernateUtil.getSession().IndexerQueue Singleton class which maintains the basic data structure and methods for the indexer.ResetFailedFiles Utility to enable retry of indexing for selected files after they have reached maxFailedAttempts.WaybackIndexer The WaybackIndexer starts threads to find new files to be indexed and indexes them.WaybackIndexerApplication The entry point for the wayback indexer.