Uploaded image for project: 'WebDanica'
  1. WebDanica
  2. WEBDAN-277

Url validation too strict

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • 3.0
    • 2.0
    • COMMON
    • None

    Description

      The URL validation done by LoadSeeds is too strict.
      The characters '

      {' and '}

      ' are not accepted by isRejectableURL method:

      nov. 15, 2017 2:59:36 PM dk.kb.webdanica.core.utils.UrlUtils isRejectableURL
      WARNING: UriSyntaxException thrown for seed 'http://www.ft.dk/searchResults.aspx?sortln=desc&spokesman={049C3793-4A6C-4146-BAC2-2C257673187E}&'. Seed considered bad: java.net.URISyntaxException: Illegal character in query at index 58: http://www.ft.dk/searchResults.aspx?sortln=desc&spokesman={049C3793-4A6C-4146-BAC2-2C257673187E}&
      	at java.net.URI$Parser.fail(URI.java:2848)
      	at java.net.URI$Parser.checkChars(URI.java:3021)
      	at java.net.URI$Parser.parseHierarchical(URI.java:3111)
      	at java.net.URI$Parser.parse(URI.java:3053)
      	at java.net.URI.<init>(URI.java:588)
      	at dk.kb.webdanica.core.utils.UrlUtils.isRejectableURL(UrlUtils.java:54)
      	at dk.kb.webdanica.core.tools.LoadSeeds.processSeeds(LoadSeeds.java:204)
      	at dk.kb.webdanica.core.tools.LoadSeeds.main(LoadSeeds.java:115)
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            svc Søren Vejrup Carlsen (Inactive)
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: