Description
The new dedup format in crawl logs includes a timestamp. The DeduplicateToCDXAdapter class has been modified to identify the new format with a new regexp, but the implementation is buggy - as revealed by a simple unit test.
Attachments
Issue Links
- mentioned in
-
Page Loading...