Details
-
Improvement
-
Resolution: Fixed
-
Minor
-
None
-
None
Description
The following attributes in the WARCWriterProcessor should be associated with settings in NetarchiveSuite, and should be updated in the harvest template for the harvestJob (the Job class)
<boolean name="skip-identical-digests">false</boolean>
<boolean name="write-requests">true</boolean>
<boolean name="write-metadata">true</boolean>
<boolean name="write-revisit-for-identical-digests">true</boolean>
<boolean name="write-revisit-for-not-modified">true</boolean>
together with some or all of
<boolean name="compress">false</boolean>
<string name="prefix">netarkivet</string>
<string name="suffix">HOSTNAME</string>
<long name="max-size-bytes">100000000</long>
<integer name="pool-max-active">5</integer>
<integer name="pool-max-wait">300000</integer>
<long name="total-bytes-to-write">0</long>
The prefix, suffix should probably be based on information coming from some configured naming-convention.
Attachments
Issue Links
- was spawned by
-
NAS-1958 Replace the "ARCWriterProcesser" with "WARCWriterProcessor" in our Heritrix templates.
- Resolved