Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2246

WaybackCDXExtractionARCBatchJob should skip the "filedesc:" record

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • 4.4
    • 4.0, 4.2
    • Wayback
    • None
    • Hide

      test by running the dk.netarkivet.wayback.batch.WaybackCDXExtractionArcAndWarcBatchJobTester.testARCProcess unittest

      Show
      test by running the dk.netarkivet.wayback.batch.WaybackCDXExtractionArcAndWarcBatchJobTester.testARCProcess unittest

    Description

      The batchjob WaybackCDXExtractionARCBatchJob should skip the "filedesc:" record. Currently, it causes a parse error to be written to the logs, which annoys our administrator.

      23-08-2013 09:11:10 dk.netarkivet.wayback.batch.WaybackCDXExtractionARCBatchJob processRecord^M
      INFO: Could not parse 'filedesc://4473-20-20130123122636-00144-sb-test-har-001.statsbiblioteket.dk.arc.open 0.0.0.0 20130123122636 text/plain 1283'

      Attachments

        Activity

          People

            svc Søren Vejrup Carlsen (Inactive)
            svc Søren Vejrup Carlsen (Inactive)
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: