Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2414

The H3 crawlsummary reports wrong crawlstatus during crawl

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • 5.5.1
    • 5.0-RC1, 5.1, 5.2.2, 5.3.1
    • Heritrix 3
    • None

    Description

      Curiously, the crawlsummary in the H3 GUI says (during crawl), that the crawl is "Finished - Abnormal exit from crawling"!

      crawl name: default_orderxml
      crawl status: Finished - Abnormal exit from crawling
      duration: 1h14m29s478ms
      
      seeds crawled: 17
      seeds uncrawled: 0
      
      hosts visited: 387
      
      URIs processed: 23100
      URI successes: 13666
      URI failures: 9434
      URI disregards: 0
      
      novel URIs: 13666
      
      total crawled bytes: 1291646221 (1.2 GiB) 
      novel crawled bytes: 1291646221 (1.2 GiB)
      
      URIs/sec: 3.06
      KB/sec: 282
      

      Attachments

        Activity

          People

            svc Søren Vejrup Carlsen (Inactive)
            svc Søren Vejrup Carlsen (Inactive)
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: