Uploaded image for project: 'WebDanica'
  1. WebDanica
  2. WEBDAN-48

Make wrapper to run hadoop-parsedText on warc-files produced by Heritrix

    XMLWordPrintable

Details

    • Improvement
    • Resolution: Won't Fix
    • Minor
    • None
    • None
    • HADOOP
    • None

    Description

      We want to let Netarchivesuite produce the parsed-text as part of the Documentation phase of the Heritrix3 controller

      Attachments

        1. 881.txt
          4 kB
        2. 898.txt
          8 kB

        Issue Links

          Activity

            People

              Unassigned Unassigned
              svc Søren Vejrup Carlsen (Inactive)
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: