dk.netarkivet.harvester.datamodel
Class Constants

java.lang.Object
  extended by dk.netarkivet.harvester.datamodel.Constants

public class Constants
extends java.lang.Object

Constants used by the datamodel and webinterface packages.


Field Summary
static long ALIAS_TIMEOUT_IN_MILLISECONDS
          Default value for alias timeout, 1 year.
static long DEFAULT_MAX_BYTES
          This is the default number set as max bytes harvested.
static long DEFAULT_MAX_OBJECTS
          This is the default number set as max harvested objects.
static int DEFAULT_MAX_REQUEST_RATE
          This is the default number set as max request rate.
static java.lang.String GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME
          The name used for the element in order.xml which contains global crawler traps.
static long HERITRIX_MAXBYTES_INFINITY
          Max bytes of -1 means infinity.
static long HERITRIX_MAXOBJECTS_INFINITY
          Max objects of -1 means infinity.
(package private) static int MAX_COMBINED_SEED_LIST_SIZE
          Maximum size of a combined seedlist entry (for a job) in the database.
(package private) static int MAX_COMMENT_SIZE
          Maximum size of comment entries in the database.
(package private) static int MAX_CRAWLER_TRAP_SIZE
          Maximum size of crawlertrap entries in the database.
static int MAX_ERROR_DETAIL_SIZE
          Maximum size of detailed error messages from harvests and uploads.
static int MAX_ERROR_SIZE
          Maximum size of error messages from harvests and uploads.
(package private) static int MAX_NAME_SIZE
          Maximum size of name entries in the database.
(package private) static int MAX_ORDERXML_SIZE
          Maximum size of orderxml entries (stringified XML) in the database.
(package private) static int MAX_OWNERINFO_SIZE
          Maximum size of ownerinfo entries in the database.
(package private) static int MAX_PASSWORD_SIZE
          Maximum size of password entries in the database.
(package private) static int MAX_REALM_NAME_SIZE
          Maximum size of password realm entries in the database.
(package private) static int MAX_SEED_LIST_SIZE
          Maximum size of seedlist entries in the database.
(package private) static int MAX_URL_SIZE
          Maximum size of password url entries in the database.
(package private) static int MAX_USER_NAME_SIZE
          Maximum size of password username entries in the database.
static java.lang.String NEXT_JOB_ID
          Settings used in JobDBDao after admin machine break down.
(package private) static java.lang.String PROTOCOL_REGEXP
          Regexp for checking, if URL contains a protocol, like ftp://, http:// .
 
Method Summary
static java.util.regex.Matcher getIdMatcher()
          Returns a new matcher that matches harvest definition file names and sets group 1 to be the id part.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

PROTOCOL_REGEXP

static final java.lang.String PROTOCOL_REGEXP
Regexp for checking, if URL contains a protocol, like ftp://, http:// .

See Also:
Constant Field Values

MAX_NAME_SIZE

static final int MAX_NAME_SIZE
Maximum size of name entries in the database.

See Also:
Constant Field Values

MAX_COMMENT_SIZE

static final int MAX_COMMENT_SIZE
Maximum size of comment entries in the database.

See Also:
Constant Field Values

MAX_CRAWLER_TRAP_SIZE

static final int MAX_CRAWLER_TRAP_SIZE
Maximum size of crawlertrap entries in the database.

See Also:
Constant Field Values

MAX_URL_SIZE

static final int MAX_URL_SIZE
Maximum size of password url entries in the database.

See Also:
Constant Field Values

MAX_REALM_NAME_SIZE

static final int MAX_REALM_NAME_SIZE
Maximum size of password realm entries in the database.

See Also:
Constant Field Values

MAX_USER_NAME_SIZE

static final int MAX_USER_NAME_SIZE
Maximum size of password username entries in the database.

See Also:
Constant Field Values

MAX_PASSWORD_SIZE

static final int MAX_PASSWORD_SIZE
Maximum size of password entries in the database.

See Also:
Constant Field Values

MAX_OWNERINFO_SIZE

static final int MAX_OWNERINFO_SIZE
Maximum size of ownerinfo entries in the database.

See Also:
Constant Field Values

MAX_SEED_LIST_SIZE

static final int MAX_SEED_LIST_SIZE
Maximum size of seedlist entries in the database.

See Also:
Constant Field Values

MAX_COMBINED_SEED_LIST_SIZE

static final int MAX_COMBINED_SEED_LIST_SIZE
Maximum size of a combined seedlist entry (for a job) in the database.

See Also:
Constant Field Values

MAX_ORDERXML_SIZE

static final int MAX_ORDERXML_SIZE
Maximum size of orderxml entries (stringified XML) in the database.

See Also:
Constant Field Values

MAX_ERROR_SIZE

public static final int MAX_ERROR_SIZE
Maximum size of error messages from harvests and uploads.

See Also:
Constant Field Values

MAX_ERROR_DETAIL_SIZE

public static final int MAX_ERROR_DETAIL_SIZE
Maximum size of detailed error messages from harvests and uploads.

See Also:
Constant Field Values

DEFAULT_MAX_REQUEST_RATE

public static final int DEFAULT_MAX_REQUEST_RATE
This is the default number set as max request rate.

See Also:
Constant Field Values

HERITRIX_MAXBYTES_INFINITY

public static final long HERITRIX_MAXBYTES_INFINITY
Max bytes of -1 means infinity.

See Also:
Constant Field Values

HERITRIX_MAXOBJECTS_INFINITY

public static final long HERITRIX_MAXOBJECTS_INFINITY
Max objects of -1 means infinity.

See Also:
Constant Field Values

DEFAULT_MAX_BYTES

public static final long DEFAULT_MAX_BYTES
This is the default number set as max bytes harvested. Set to the max number of bytes we harvest from any domain per harvest, unless explicitly deciding otherwise.


DEFAULT_MAX_OBJECTS

public static final long DEFAULT_MAX_OBJECTS
This is the default number set as max harvested objects. Note, that although this is a long it is sometimes used as an int, so don't set this value too high. Default max objects should now be infinity, since we use the byte limit.


ALIAS_TIMEOUT_IN_MILLISECONDS

public static final long ALIAS_TIMEOUT_IN_MILLISECONDS
Default value for alias timeout, 1 year. TODO make this into a setting in HarvesterSettings

See Also:
Constant Field Values

NEXT_JOB_ID

public static final java.lang.String NEXT_JOB_ID
Settings used in JobDBDao after admin machine break down.

See Also:
Constant Field Values

GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME

public static final java.lang.String GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME
The name used for the element in order.xml which contains global crawler traps.

See Also:
Constant Field Values
Method Detail

getIdMatcher

public static java.util.regex.Matcher getIdMatcher()
Returns a new matcher that matches harvest definition file names and sets group 1 to be the id part.

Returns:
A new matcher instance.