Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2598

Failure to parse deduplication record

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • 5.3
    • 5.2.1
    • Wayback
    • None
    • NAS 5.3
    • Hide

      I believe this fix is adequately verified in the release test of the compression software, which tests the affected method in a multithreaded environment.

      Show
      I believe this fix is adequately verified in the release test of the compression software, which tests the affected method in a multithreaded environment.

    Description

      Line like this:

      11:51:03.112 [Thread-7] ERROR d.n.w.batch.DeduplicateToCDXAdapter - Could not adapt deduplicate record to CDX line: '2016-12-05T10:11:38.808Z   200       2026 http://www.w3.org/Icons/valid-xhtml10-blue.png EI http://www.w3.org/Icons/valid-xhtml10-blue image/png #041 20161205101137572+1226 sha1:C3PH3IWTSURQ7XILQRHDIDDGAD2ORRPH www.kaarefc.dk duplicate:"2-1-20161205100306320-00000-4320~kb-test-har-004.kb.dk~8173.arc,151298,20161205100315930",content-size:2408'
      java.lang.NumberFormatException: multiple points
      	at sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1890) ~[na:1.8.0_05]
      	at sun.misc.FloatingDecimal.parseDouble(FloatingDecimal.java:110) ~[na:1.8.0_05]
      	at java.lang.Double.parseDouble(Double.java:538) ~[na:1.8.0_05]
      	at java.text.DigitList.getDouble(DigitList.java:169) ~[na:1.8.0_05]
      	at java.text.DecimalFormat.parse(DecimalFormat.java:2056) ~[na:1.8.0_05]
      	at java.text.SimpleDateFormat.subParse(SimpleDateFormat.java:1869) ~[na:1.8.0_05]
      	at java.text.SimpleDateFormat.parse(SimpleDateFormat.java:1514) ~[na:1.8.0_05]
      	at java.text.DateFormat.parse(DateFormat.java:364) ~[na:1.8.0_05]
      	at dk.netarkivet.wayback.batch.DeduplicateToCDXAdapter.adaptLine(DeduplicateToCDXAdapter.java:91) ~[wayback-indexer-5.2.2.jar:86df93186cc364d1486a21cc60158a1eed03f1bf]
      

      Something to with a screwy date format?

      Attachments

        Activity

          People

            Unassigned Unassigned
            csr Colin Rosenthal
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: