Details
-
Bug
-
Resolution: Fixed
-
Blocker
-
None
-
None
Description
07-01-2013 17:40:45 dk.netarkivet.archive.bitarchive.Bitarchive batch
INFO: Finished batch job dk.netarkivet.harvester.indexserver.RawMetadataCache$GetMetadataArchiveBatc
hJob
with result: 1 failures in processing 1 files at 172.17.0.53_BitApp_2KB-TEST-BAR-014 BitarchiveServer BitApp_2 KBN 1
07-01-2013 17:40:45 dk.netarkivet.harvester.indexserver.RawMetadataCache$GetMetadataArchiveBatchJob
processRecord
INFO: null - application/warc-fields
We know that it worked before in the 3.21 release, and from the quote above, it is evident, that it now tries to look at the warc-info record (which was not there in 3.21) during the extraction of cdx, and crawllogs from the metadata-warc-file.
We must tell the batchjob to ignore warc-info records.
We have already done that in other batchjobs used inside netarchivesuite