Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-399

IDNA converts whole URL to ascii not just hostname

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • None
    • 0.5
    • None

    Description

      Given a URL like
      http://www.danske.dk/pølse
      it is converted to
      http://www.danske.xn--dk/plse-t1a
      Also given pure ASCII seeds like
      http://www.netarkivet.dk/website/links/index-da.htm it chokes on some of the
      characters:
      WARNING: Cannot convert seed http://www.netarkivet.dk/website/links/index-da.htm
      to ASCII
      gnu.inet.encoding.IDNAException: Contains non-LDH characters.
      (which is strange since the web version http://josefsson.org/idn.php is fine
      with that one=.
      NOTE: This bug is originally from Bugzilla bug_id=398.

      Attachments

        Activity

          People

            lars lars [X] (Inactive)
            lars lars [X] (Inactive)
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: