Description
harvestInfo.origHarvestDefinitionComments are not included in warcinfo records of WARC data files.
{{harvestInfo.version: 0.5
harvestInfo.jobId: 18099
harvestInfo.channel: PRESSE
harvestInfo.harvestNum: 1474
harvestInfo.origHarvestDefinitionID: 28
harvestInfo.maxBytesPerDomain: -1
harvestInfo.maxObjectsPerDomain: 10000
harvestInfo.orderXMLName: page+1actu
harvestInfo.origHarvestDefinitionName: BnF actualites quotidienne micro
harvestInfo.scheduleName: quotidienne
harvestInfo.harvestFilenamePrefix: BnF-18099-28
harvestInfo.jobSubmitDate: 2016-03-13T09:00:24Z}}
When trying to fix it, we ran into another bug: Non-ASCII characters are misencoded (for this field and other text field: origHarvestDefinitionName, HarvestTemplate).