Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-1720 Enable WARC file writing and handling in the NetarchiveSuite
  3. NAS-1962

Store the contents of the metadata-1.arc files as WARC-records

    XMLWordPrintable

Details

    • Rough
    • Hide

      See issue NAS-2061 for verification

      Show
      See issue NAS-2061 for verification

    Description

      Find a way to store the contents of the metadata-1.arc files as WARC-records. This means identifying a way to refer these WARC-Records to the WARC-files harvested by Heritrix in this HarvestJob.
      Idea: Extract the ID of the WARC-info record in the warc-files produced by Heritrix, and insert all these as WARC-Concurrent-To identifiers in the the warc-metadata records.

      Attachments

        Issue Links

          Activity

            People

              nicl@kb.dk Nicholas Clarke (Inactive)
              mss Mikis Seth Sørensen (Inactive)
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 35h
                  35h
                  Remaining:
                  Remaining Estimate - 35h
                  35h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified