Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-1819

WaybackIndexer reads whole archive every time

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Major
    • None
    • None
    • Wayback
    • None

    Description

      It takes too long to process the output of a complete FileListJob every time. Could just add a single config parameter which is a regexp for file names to match. e.g.

      8\\d{4}-.*|9\\d{4}-.*|\\d{6,}-.*
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            csr Colin Rosenthal
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: