Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-1825

The batchjob HarvestedUrlsForDomainBatchJob returns too little output

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Major
    • None
    • 3.14.0
    • Viewerproxy
    • None

    Description

      Working on NAS-1815, which enables you to filter the crawl.log based on a regular expression, it seems that the batchjob HarvestedUrlsForDomainBatchJob doesn't return all the lines that are relevant to the domain in question.

      It seems that this problem exists for any domain argument.

      Attachments

        1. netarkivetdk_log_part_returned_by_batchjob.txt
          1 kB
          Søren Vejrup Carlsen
        2. netarkivetdk.logpart.txt
          24 kB
          Søren Vejrup Carlsen

        Activity

          People

            Unassigned Unassigned
            svc Søren Vejrup Carlsen (Inactive)
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: