Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2417

Error in default H3 template

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • 5.0
    • None
    • None
    • None
    • Hide

      Install a Quickstart system.
      make a netarkivet configuration for each and every template.
      generate a netarkivet harvest using each and every configuration.
      Make sure that they don't have any Spring configuration issues
      Note: If Heritrix3 starts and harvests something the test is deemed successful

      Show
      Install a Quickstart system. make a netarkivet configuration for each and every template. generate a netarkivet harvest using each and every configuration. Make sure that they don't have any Spring configuration issues Note: If Heritrix3 starts and harvests something the test is deemed successful

    Description

      During the TEST4, I found out that the crawlertrap inserted did not have any effect.

      <!-- ...and REJECT those from a configurable (initially empty) set of URI regexes... -->
          <bean class="org.archive.modules.deciderules.MatchesListRegexDecideRule">
           <property name="listLogicalOr" value="true" />
           <property name="regexList">
                 <list>
      ....
      

      should rather be

      <!-- ...and REJECT those from a configurable (initially empty) set of URI regexes... -->
          <bean class="org.archive.modules.deciderules.MatchesListRegexDecideRule">
       <property name="decision" value="REJECT"/>
           <property name="listLogicalOr" value="true" />
           <property name="regexList">
                 <list>
       ....
      

      Otherwise the urls matching the crawlertraps are Accepted as urls to be crawled instead of urls to be rejected.

      Attachments

        Activity

          People

            svc Søren Vejrup Carlsen (Inactive)
            svc Søren Vejrup Carlsen (Inactive)
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: