Package dk.netarkivet.wayback.indexer

Retrieves indexes of the ARC files in the repository which are needed by Wayback.

See:
          Description

Interface Summary
GenericDAO<T,PK extends Serializable> A generic class for managing storage and retrieval of persistent objects.
 

Class Summary
ArchiveFile This class represents a file in the arcrepository which may be indexed by the indexer.
ArchiveFileDAO Data Access Object for ArchiveFile instances.
FileNameHarvester  
GenericHibernateDAO<T,PK extends Serializable> An implementation of Generic DAO which is specialised for hibernate object stores.
HibernateUtil This class contains a single static utility method which returns a Hibernate session: HibernateUtil.getSession().
IndexerQueue Singleton class which maintains the basic data structure and methods for the indexer.
WaybackIndexer The WaybackIndexer starts threads to find new files to be indexed and indexes them.
WaybackIndexerApplication The entry point for the wayback indexer.
 

Package dk.netarkivet.wayback.indexer Description

Retrieves indexes of the ARC files in the repository which are needed by Wayback. Indexer-overview

The WaybackIndexer starts a producer thread with uses the populate operation on the IndexerQueue class to fetch a list of all new files added to the central ArcRepository. The resulting list is added to the ArcFileQueue containing the files which should be indexed.

The WaybackIndexer also creates a number of consumer threads which indexes the retrieved ArcFileLists