Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2587

Software stated in the metadata files warcinfo records cannot be easily parsed

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • 5.3
    • 5.2
    • WARC
    • None
    • BNF
    • NAS 5.3

    Description

      The software stated in the warcinfo record of the metadata file has changed between NAS 4 and NAS 5.
      It was: software: NetarchiveSuite/Version: 4.5.0 status RELEASE/https://sbforge.org/display/NAS
      And now, it is:
      software: NetarchiveSuite/Version: 5.2 (<a href="https://github.com/netarchivesuite/netarchivesuite/commit/f711a23f1f6efce89d29a0e117932d28fc22042f">f711a23f1f</a>)/https://sbforge.org/display/NAS

      The use of HTML code <a href=""> breaks the file parsing. And since the rest of information at this level is only plain text, we should probably stick to that. Is there a reason for having HTML code in here?

      Could we rather have something like :
      software: NetarchiveSuite/Version: 5.2 status RELEASE/https://sbforge.org/display/NAS (when it is frozen)
      software: NetarchiveSuite/Version: 5.2 status SNAPSHOT/https://github.com/netarchivesuite/netarchivesuite/commit/f711a23f1f6efce89d29a0e117932d28fc22042f (when it is still at development stage)

      or :

      software: NetarchiveSuite/Version: 5.2 status RELEASE/https://sbforge.org/display/NAS/ - https://github.com/netarchivesuite/netarchivesuite/commit/f711a23f1f6efce89d29a0e117932d28fc22042f
      if we really need to have the commit number in all production data (which, I think, is not necessary...).

      Attachments

        Activity

          People

            Unassigned Unassigned
            sara Sara Aubry
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: