Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-8 Switch to Maven as build tool
  3. NAS-2304

Maven file encoding should explicitly be ASCII/utf8

    XMLWordPrintable

Details

    • Sub-task
    • Resolution: Fixed
    • Minor
    • 5.0-Milestone1
    • 4.4
    • None
    • None
    • 5.0 Milestone1

    Description

      One of the more subtle error sources is how non-ASCII characters are stored in files in the file system and interpreted by programs. A typical example is storing the character "ø" in a file on a Linux machine where the character is represented by two bytes in the default UTF-8 encoding, and loading the same file on Windows machine where the two bytes are converted to two characters "ø" instead of the original single "'ø".

      The only robust solution I have found so far is to explicitly enforce in the pom.xml that all files are encoded in ASCII and then appropriately encode all non-ASCII character found in sources and testdata.

      Attachments

        Activity

          People

            nicl@kb.dk Nicholas Clarke (Inactive)
            tra Thorbjørn Ravn Andersen (Inactive)
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: