Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2481

Inconsistency with new attribute system

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • 5.1
    • None
    • Heritrix 3
    • None
    • Hide

      Can be tested as part of TEST1
      Test that
      the page NAS-GUI-ROOT/History/Harveststatus-download-job-harvest-template.jsp?JobID=1&requestedContentType=text/plain

      correctly as ignore as robots.txt
      extract_java_script=true
      max_hops=20

      Show
      Can be tested as part of TEST1 Test that the page NAS-GUI-ROOT/History/Harveststatus-download-job-harvest-template.jsp?JobID=1&requestedContentType=text/plain correctly as ignore as robots.txt extract_java_script=true max_hops=20

    Description

      When I create a selective harvest with netarkivet.dk as only domain.
      I get the message

      "Viewtype 1 attribute MAX_HOPS undefined. Using default value '20"
      

      when inserting the attributes into the template of the job.

      Furthermore when inspecting the resulting template,
      I notice that

      RobotsPolicy sat til obey (should have been ignore)
      MaxHops sat til 20 (correct)
      Extract javascript true (correct)
      

      The following is the contents of the attribute tables:

      test1svc_harvestdb=> select * from eav_type_attribute;
       tree_id | id |         name         |            class_namespace            |       class_name        | datatype | viewtype | def_int | def_datetime | def_varchar | def_tex
      t 
      ---------+----+----------------------+---------------------------------------+-------------------------+----------+----------+---------+--------------+-------------+--------
      --
             2 |  1 | MAX_HOPS             | dk.netarkivet.harvester.datamodel.eav | ContentAttrType_Generic |        1 |        1 |      20 |              |             | 
             2 |  2 | HONOR_ROBOTS_DOT_TXT | dk.netarkivet.harvester.datamodel.eav | ContentAttrType_Generic |        1 |        6 |       0 |              |             | 
             2 |  3 | EXTRACT_JAVASCRIPT   | dk.netarkivet.harvester.datamodel.eav | ContentAttrType_Generic |        1 |        5 |       1 |              |             | 
      (3 rows)
      
      test1svc_harvestdb=> select * from eav_attribute;
       tree_id | id | entity_id | type_id | val_int | val_datetime | val_varchar | val_text 
      ---------+----+-----------+---------+---------+--------------+-------------+----------
      (0 rows)
      

      Attachments

        Issue Links

          Activity

            People

              svc Søren Vejrup Carlsen (Inactive)
              svc Søren Vejrup Carlsen (Inactive)
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - Not Specified
                  Not Specified
                  Logged:
                  Time Spent - 0.2h
                  0.2h