Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-1919

Deduplicate Lines not Matched in Indexer

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • 4.2
    • 3.17.0
    • Wayback
    • None
    • SB/KB

    Description

      Lines like

      2011-06-30T09:49:35.648Z 200 115743 http://www.b.dk/upload/webred/FLASH/2011/jan/opinion/version_2.swf EEE http://www.b.dk/berlingskebarometer application/x-shockwave-flash #002 20110630094935566+74 sha1:EQ4J6HHINKCPAZYVSIKCT4P47IEZK5DE - le:IOException@ExtractorSWF,duplicate:"124179-32-20110627134529-00001-kb-prod-har-003.kb.dk.arc,9350968",content-size:116073
      

      are recognised as deduplication records by DeduplicateToCDXAdapter but deduplicate record cannot be parsed from them.

      Attachments

        Activity

          People

            svc Søren Vejrup Carlsen (Inactive)
            csr Colin Rosenthal
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 5h
                5h
                Remaining:
                Remaining Estimate - 5h
                5h
                Logged:
                Time Spent - Not Specified
                Not Specified