Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2534

Remove unwanted folders on job restart

    XMLWordPrintable

Details

    • Hide

      A releasetest needs to made to test this case.
      Easiest is use a crashed h3 job as a template for this
      Step 1: stop the Heritrix Controller application
      Step 2: insert a dummy h3 into the crawldir to prompt the PostProcessing.processOldJobs() into action
      Step 3: restart the Heritrix Controller application

      Show
      A releasetest needs to made to test this case. Easiest is use a crashed h3 job as a template for this Step 1: stop the Heritrix Controller application Step 2: insert a dummy h3 into the crawldir to prompt the PostProcessing.processOldJobs() into action Step 3: restart the Heritrix Controller application

    Description

      TLR suggest on internal wikipage https://sbprojects.statsbiblioteket.dk/pages/viewpage.action?pageId=19595346
      that the following files and directories are deleted when cleaning up after crashed jobs:

      archivefiles-report.txt
      metadata 
      tmp-metadata
      

      Attachments

        Issue Links

          Activity

            People

              svc Søren Vejrup Carlsen (Inactive)
              csr Colin Rosenthal
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: