Class Constants


  • public final class Constants
    extends Object
    Constants used by the datamodel and webinterface packages.
    • Field Detail

      • MAX_ERROR_SIZE

        public static final int MAX_ERROR_SIZE
        Maximum size of error messages from harvests and uploads.
        See Also:
        Constant Field Values
      • MAX_ERROR_DETAIL_SIZE

        public static final int MAX_ERROR_DETAIL_SIZE
        Maximum size of detailed error messages from harvests and uploads.
        See Also:
        Constant Field Values
      • DEFAULT_MAX_REQUEST_RATE

        public static final int DEFAULT_MAX_REQUEST_RATE
        This is the default number set as max request rate.
        See Also:
        Constant Field Values
      • HERITRIX_MAXBYTES_INFINITY

        public static final long HERITRIX_MAXBYTES_INFINITY
        Max bytes of -1 means infinity (i.e other factors will determine when the job ends).
        See Also:
        Constant Field Values
      • HERITRIX_MAXOBJECTS_INFINITY

        public static final long HERITRIX_MAXOBJECTS_INFINITY
        Max objects of -1 means infinity (i.e other factors will determine when the job ends).
        See Also:
        Constant Field Values
      • HERITRIX_MAXJOBRUNNINGTIME_INFINITY

        public static final long HERITRIX_MAXJOBRUNNINGTIME_INFINITY
        Max job running time of 0 means infinite job running time (i.e other factors will determine when the job ends).
        See Also:
        Constant Field Values
      • DEFAULT_MAX_BYTES

        public static final long DEFAULT_MAX_BYTES
        This is the default number set as max bytes harvested. Set to the max number of bytes we harvest from any domain per harvest, unless explicitly deciding otherwise.
      • DEFAULT_MAX_OBJECTS

        public static final long DEFAULT_MAX_OBJECTS
        This is the default number set as max harvested objects.
      • DEFAULT_MAX_JOB_RUNNING_TIME

        public static final long DEFAULT_MAX_JOB_RUNNING_TIME
        The default maximum time in seconds available for each harvesting job. Set to unlimited (0) in the default settings. Used to restrict the running time for snapshot harvest jobs.
      • ALIAS_TIMEOUT_IN_MILLISECONDS

        public static final long ALIAS_TIMEOUT_IN_MILLISECONDS
        The value for alias timeout, in milliseconds.
      • NEXT_JOB_ID

        public static final String NEXT_JOB_ID
        Settings used in JobDBDao after admin machine break down. *
        See Also:
        Constant Field Values
      • GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME

        public static final String GLOBAL_CRAWLER_TRAPS_ELEMENT_NAME
        The name used for the element in order.xml which contains global crawler traps.
        See Also:
        Constant Field Values
      • BYTES_PER_HERITRIX_BYTELIMIT_UNIT

        public static final long BYTES_PER_HERITRIX_BYTELIMIT_UNIT
        See Also:
        Constant Field Values
    • Method Detail

      • getIdMatcher

        public static Matcher getIdMatcher()
        Returns a new matcher that matches harvest definition file names and sets group 1 to be the id part.
        Returns:
        A new matcher instance.