Details
-
New Feature
-
Resolution: Fixed
-
Minor
-
5.2.2, 5.3, 5.3.1
-
None
-
NAS 5.4
Description
You typically receive two mail notifications whenever a harvest fails
1)
Host: narcana-webdanica01.statsbiblioteket.dk
Date: Thu Nov 23 06:51:04 CET 2017
dk.netarkivet.harvester.heritrix3.PostProcessing.storeFiles(PostProcessing.java:269)
Probable error in Heritrix job setup. No arcfiles or warcfiles generated by Heritrix for job 1204
2)
Host: narcana-webdanica01.statsbiblioteket.dk
Date: Thu Nov 23 06:51:04 CET 2017
dk.netarkivet.harvester.heritrix3.PostProcessing.doPostProcessing(PostProcessing.java:165)
Trouble during postprocessing of files in '/opt/webdanica/WEBDANICA/harvester_focused/1204_1511416193560'. Errors accumulated during the postprocessing: Metadata file /opt/webdanica/WEBDANICA/harvester_focused/1204_1511416193560/metadata/1204-metadata-1.warc does not exist
dk.netarkivet.common.exceptions.IllegalState: Metadata file /opt/webdanica/WEBDANICA/harvester_focused/1204_1511416193560/metadata/1204-metadata-1.warc does not exist
at dk.netarkivet.harvester.heritrix3.IngestableFiles.getMetadataArcFiles(IngestableFiles.java:183)
at dk.netarkivet.harvester.heritrix3.PostProcessing.storeFiles(PostProcessing.java:281)
at dk.netarkivet.harvester.heritrix3.PostProcessing.doPostProcessing(PostProcessing.java:159)
at dk.netarkivet.harvester.heritrix3.HarvestControllerServer$HarvesterThread.run(HarvestControllerServer.java:457)
The problem is that if Heritrix3 doesn't create any (w)arc files, no metadata-warc is created
And you really should, as the reports are still being written by Heritrix, and they contain valuable information
Attachments
Issue Links
- related to
-
WEBDAN-282 NetarchiveSuite shouldn't fail the postprocessing, when H3 doesn't archive anything
- Resolved