[NAS-2582] DeduplicateToCDXAdapter fails to identify new dedup format Created: 22/Nov/16  Updated: 03/Jan/17  Resolved: 23/Nov/16

Status: Closed
Project: NetarchiveSuite
Component/s: Wayback
Affects Version/s: 5.2
Fix Version/s: 5.2.1

Type: Bug Priority: Blocker
Reporter: Colin Rosenthal Assignee: Colin Rosenthal
Resolution: Fixed  
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified


 Description   

The new dedup format in crawl logs includes a timestamp. The DeduplicateToCDXAdapter class has been modified to identify the new format with a new regexp, but the implementation is buggy - as revealed by a simple unit test.


Generated at Thu Apr 25 13:03:52 CEST 2024 using Jira 9.4.15#940015-sha1:bdaa9cbecfb6791ea579749728cab771f0dfe90b.