Description
The URL validation done by LoadSeeds is too strict.
The characters '
' are not accepted by isRejectableURL method:
nov. 15, 2017 2:59:36 PM dk.kb.webdanica.core.utils.UrlUtils isRejectableURL WARNING: UriSyntaxException thrown for seed 'http://www.ft.dk/searchResults.aspx?sortln=desc&spokesman={049C3793-4A6C-4146-BAC2-2C257673187E}&'. Seed considered bad: java.net.URISyntaxException: Illegal character in query at index 58: http://www.ft.dk/searchResults.aspx?sortln=desc&spokesman={049C3793-4A6C-4146-BAC2-2C257673187E}& at java.net.URI$Parser.fail(URI.java:2848) at java.net.URI$Parser.checkChars(URI.java:3021) at java.net.URI$Parser.parseHierarchical(URI.java:3111) at java.net.URI$Parser.parse(URI.java:3053) at java.net.URI.<init>(URI.java:588) at dk.kb.webdanica.core.utils.UrlUtils.isRejectableURL(UrlUtils.java:54) at dk.kb.webdanica.core.tools.LoadSeeds.processSeeds(LoadSeeds.java:204) at dk.kb.webdanica.core.tools.LoadSeeds.main(LoadSeeds.java:115)