Details
-
Bug
-
Resolution: Fixed
-
Minor
-
None
-
None
-
None
Description
During the TEST4, I found out that the crawlertrap inserted did not have any effect.
<!-- ...and REJECT those from a configurable (initially empty) set of URI regexes... --> <bean class="org.archive.modules.deciderules.MatchesListRegexDecideRule"> <property name="listLogicalOr" value="true" /> <property name="regexList"> <list> ....
should rather be
<!-- ...and REJECT those from a configurable (initially empty) set of URI regexes... --> <bean class="org.archive.modules.deciderules.MatchesListRegexDecideRule"> <property name="decision" value="REJECT"/> <property name="listLogicalOr" value="true" /> <property name="regexList"> <list> ....
Otherwise the urls matching the crawlertraps are Accepted as urls to be crawled instead of urls to be rejected.