[NAS-2582] DeduplicateToCDXAdapter fails to identify new dedup format Created: 22/Nov/16 Updated: 03/Jan/17 Resolved: 23/Nov/16 |
|
Status: | Closed |
Project: | NetarchiveSuite |
Component/s: | Wayback |
Affects Version/s: | 5.2 |
Fix Version/s: | 5.2.1 |
Type: | Bug | Priority: | Blocker |
Reporter: | Colin Rosenthal | Assignee: | Colin Rosenthal |
Resolution: | Fixed | ||
Labels: | None | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified |
Description |
The new dedup format in crawl logs includes a timestamp. The DeduplicateToCDXAdapter class has been modified to identify the new format with a new regexp, but the implementation is buggy - as revealed by a simple unit test. |