Details
-
Sub-task
-
Resolution: Fixed
-
Major
-
None
-
None
-
Rough
Description
The CDX generating code must work for both ARC and WARC files. Currently the method dk.netarkivet.common.utils.cdx.ExtractCDX.generateCDX() ignores all files not ending with .arc. This method is used in the Harvest documentation phase to generate CDX-files for the arc-files coming from Heritrix
When generating a single CDX-entry for an URL request, information from several Warc-records is combined.
Note that Wayback already has code to make an CDX from WARC:
Attachments
Issue Links
- mentioned in
-
Page Loading...