Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2481

Inconsistency with new attribute system

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 5.1
    • Component/s: Heritrix 3
    • Labels:
      None
    • Verification:
      Hide

      Can be tested as part of TEST1
      Test that
      the page NAS-GUI-ROOT/History/Harveststatus-download-job-harvest-template.jsp?JobID=1&requestedContentType=text/plain

      correctly as ignore as robots.txt
      extract_java_script=true
      max_hops=20

      Show
      Can be tested as part of TEST1 Test that the page NAS-GUI-ROOT/History/Harveststatus-download-job-harvest-template.jsp?JobID=1&requestedContentType=text/plain correctly as ignore as robots.txt extract_java_script=true max_hops=20

      Description

      When I create a selective harvest with netarkivet.dk as only domain.
      I get the message

      "Viewtype 1 attribute MAX_HOPS undefined. Using default value '20"
      

      when inserting the attributes into the template of the job.

      Furthermore when inspecting the resulting template,
      I notice that

      RobotsPolicy sat til obey (should have been ignore)
      MaxHops sat til 20 (correct)
      Extract javascript true (correct)
      

      The following is the contents of the attribute tables:

      test1svc_harvestdb=> select * from eav_type_attribute;
       tree_id | id |         name         |            class_namespace            |       class_name        | datatype | viewtype | def_int | def_datetime | def_varchar | def_tex
      t 
      ---------+----+----------------------+---------------------------------------+-------------------------+----------+----------+---------+--------------+-------------+--------
      --
             2 |  1 | MAX_HOPS             | dk.netarkivet.harvester.datamodel.eav | ContentAttrType_Generic |        1 |        1 |      20 |              |             | 
             2 |  2 | HONOR_ROBOTS_DOT_TXT | dk.netarkivet.harvester.datamodel.eav | ContentAttrType_Generic |        1 |        6 |       0 |              |             | 
             2 |  3 | EXTRACT_JAVASCRIPT   | dk.netarkivet.harvester.datamodel.eav | ContentAttrType_Generic |        1 |        5 |       1 |              |             | 
      (3 rows)
      
      test1svc_harvestdb=> select * from eav_attribute;
       tree_id | id | entity_id | type_id | val_int | val_datetime | val_varchar | val_text 
      ---------+----+-----------+---------+---------+--------------+-------------+----------
      (0 rows)
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                svc Søren Vejrup Carlsen
                Reporter:
                svc Søren Vejrup Carlsen
              • Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - Not Specified
                  Not Specified
                  Logged:
                  Time Spent - 0.2h
                  0.2h