dk.netarkivet.harvester.datamodel
Class Constants

java.lang.Object
  extended by dk.netarkivet.harvester.datamodel.Constants

public class Constants
extends java.lang.Object

Constants used by the datamodel and webinterface packages.


Field Summary
static long ALIAS_TIMEOUT_IN_MILLISECONDS
          The value for alias timeout, in milliseconds.
static long DEFAULT_MAX_BYTES
          This is the default number set as max bytes harvested.
static long DEFAULT_MAX_JOB_RUNNING_TIME
          The default maximum time in seconds available for each harvesting job.
static long DEFAULT_MAX_OBJECTS
          This is the default number set as max harvested objects.
static int DEFAULT_MAX_REQUEST_RATE
          This is the default number set as max request rate.
static java.lang.String GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME
          The name used for the element in order.xml which contains global crawler traps.
static long HERITRIX_MAXBYTES_INFINITY
          Max bytes of -1 means infinity (i.e other factors will determine when the job ends).
static long HERITRIX_MAXJOBRUNNINGTIME_INFINITY
          Max job running time of 0 means infinite job running time (i.e other factors will determine when the job ends).
static long HERITRIX_MAXOBJECTS_INFINITY
          Max objects of -1 means infinity (i.e other factors will determine when the job ends).
(package private) static int MAX_COMBINED_SEED_LIST_SIZE
          Maximum size of a combined seedlist entry (for a job) in the database.
(package private) static int MAX_COMMENT_SIZE
          Maximum size of comment entries in the database.
(package private) static int MAX_CRAWLER_TRAP_SIZE
          Maximum size of crawlertrap entries in the database.
static int MAX_ERROR_DETAIL_SIZE
          Maximum size of detailed error messages from harvests and uploads.
static int MAX_ERROR_SIZE
          Maximum size of error messages from harvests and uploads.
(package private) static int MAX_NAME_SIZE
          Maximum size of name entries in the database.
(package private) static int MAX_ORDERXML_SIZE
          Maximum size of orderxml entries (stringified XML) in the database.
(package private) static int MAX_OWNERINFO_SIZE
          Maximum size of ownerinfo entries in the database.
(package private) static int MAX_PASSWORD_SIZE
          Maximum size of password entries in the database.
(package private) static int MAX_REALM_NAME_SIZE
          Maximum size of password realm entries in the database.
(package private) static int MAX_SEED_LIST_SIZE
          Maximum size of seedlist entries in the database.
(package private) static int MAX_URL_SIZE
          Maximum size of password url entries in the database.
(package private) static int MAX_USER_NAME_SIZE
          Maximum size of password username entries in the database.
static java.lang.String NEXT_JOB_ID
          Settings used in JobDBDao after admin machine break down.
(package private) static java.lang.String PROTOCOL_REGEXP
          Regexp for checking, if URL contains a protocol, like ftp://, http:// .
 
Method Summary
static java.util.regex.Matcher getIdMatcher()
          Returns a new matcher that matches harvest definition file names and sets group 1 to be the id part.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

PROTOCOL_REGEXP

static final java.lang.String PROTOCOL_REGEXP
Regexp for checking, if URL contains a protocol, like ftp://, http:// .

See Also:
Constant Field Values

MAX_NAME_SIZE

static final int MAX_NAME_SIZE
Maximum size of name entries in the database.

See Also:
Constant Field Values

MAX_COMMENT_SIZE

static final int MAX_COMMENT_SIZE
Maximum size of comment entries in the database.

See Also:
Constant Field Values

MAX_CRAWLER_TRAP_SIZE

static final int MAX_CRAWLER_TRAP_SIZE
Maximum size of crawlertrap entries in the database.

See Also:
Constant Field Values

MAX_URL_SIZE

static final int MAX_URL_SIZE
Maximum size of password url entries in the database.

See Also:
Constant Field Values

MAX_REALM_NAME_SIZE

static final int MAX_REALM_NAME_SIZE
Maximum size of password realm entries in the database.

See Also:
Constant Field Values

MAX_USER_NAME_SIZE

static final int MAX_USER_NAME_SIZE
Maximum size of password username entries in the database.

See Also:
Constant Field Values

MAX_PASSWORD_SIZE

static final int MAX_PASSWORD_SIZE
Maximum size of password entries in the database.

See Also:
Constant Field Values

MAX_OWNERINFO_SIZE

static final int MAX_OWNERINFO_SIZE
Maximum size of ownerinfo entries in the database.

See Also:
Constant Field Values

MAX_SEED_LIST_SIZE

static final int MAX_SEED_LIST_SIZE
Maximum size of seedlist entries in the database.

See Also:
Constant Field Values

MAX_COMBINED_SEED_LIST_SIZE

static final int MAX_COMBINED_SEED_LIST_SIZE
Maximum size of a combined seedlist entry (for a job) in the database.

See Also:
Constant Field Values

MAX_ORDERXML_SIZE

static final int MAX_ORDERXML_SIZE
Maximum size of orderxml entries (stringified XML) in the database.

See Also:
Constant Field Values

MAX_ERROR_SIZE

public static final int MAX_ERROR_SIZE
Maximum size of error messages from harvests and uploads.

See Also:
Constant Field Values

MAX_ERROR_DETAIL_SIZE

public static final int MAX_ERROR_DETAIL_SIZE
Maximum size of detailed error messages from harvests and uploads.

See Also:
Constant Field Values

DEFAULT_MAX_REQUEST_RATE

public static final int DEFAULT_MAX_REQUEST_RATE
This is the default number set as max request rate.

See Also:
Constant Field Values

HERITRIX_MAXBYTES_INFINITY

public static final long HERITRIX_MAXBYTES_INFINITY
Max bytes of -1 means infinity (i.e other factors will determine when the job ends).

See Also:
Constant Field Values

HERITRIX_MAXOBJECTS_INFINITY

public static final long HERITRIX_MAXOBJECTS_INFINITY
Max objects of -1 means infinity (i.e other factors will determine when the job ends).

See Also:
Constant Field Values

HERITRIX_MAXJOBRUNNINGTIME_INFINITY

public static final long HERITRIX_MAXJOBRUNNINGTIME_INFINITY
Max job running time of 0 means infinite job running time (i.e other factors will determine when the job ends).

See Also:
Constant Field Values

DEFAULT_MAX_BYTES

public static final long DEFAULT_MAX_BYTES
This is the default number set as max bytes harvested. Set to the max number of bytes we harvest from any domain per harvest, unless explicitly deciding otherwise.


DEFAULT_MAX_OBJECTS

public static final long DEFAULT_MAX_OBJECTS
This is the default number set as max harvested objects.


DEFAULT_MAX_JOB_RUNNING_TIME

public static final long DEFAULT_MAX_JOB_RUNNING_TIME
The default maximum time in seconds available for each harvesting job. Set to unlimited (0) in the default settings. Used to restrict the running time for snapshot harvest jobs.


ALIAS_TIMEOUT_IN_MILLISECONDS

public static final long ALIAS_TIMEOUT_IN_MILLISECONDS
The value for alias timeout, in milliseconds.


NEXT_JOB_ID

public static final java.lang.String NEXT_JOB_ID
Settings used in JobDBDao after admin machine break down.

See Also:
Constant Field Values

GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME

public static final java.lang.String GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME
The name used for the element in order.xml which contains global crawler traps.

See Also:
Constant Field Values
Method Detail

getIdMatcher

public static java.util.regex.Matcher getIdMatcher()
Returns a new matcher that matches harvest definition file names and sets group 1 to be the id part.

Returns:
A new matcher instance.