Uploaded image for project: 'NetarchiveSuite'
  1. NetarchiveSuite
  2. NAS-2496

Redirect for jp.dk fails in test wayback

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 5.0, 5.2
    • Fix Version/s: 5.3
    • Component/s: Wayback
    • Labels:
      None
    • Verification:
      Hide

      Believed solved as part of NAS-2585. See the description for the verification.

      Show
      Believed solved as part of NAS-2585 . See the description for the verification.

      Description

      In TEST12, using the standard set of warcfiles, I can find two "hits" for jp.dk, but clicking on them produces an error like this in the the wayback tomcat log:

      11:06:47.090 [Thread 41 proxy handling: ] INFO  d.n.wayback.NetarchiveResourceStore - Received request for resource from file '5-1-20130117172315-00000-kb-test-har-002.kb.dk.warc' at offset '2804'
      11:06:47.090 [Thread 41 proxy handling: ] DEBUG d.n.a.a.d.JMSArcRepositoryClient - Requesting get of record '5-1-20130117172315-00000-kb-test-har-002.kb.dk.warc:2804'
      11:06:47.113 [Thread 41 proxy handling: ] DEBUG d.n.common.distribute.Synchronizer - Received reply for message: ID:139-130.226.228.10(bf:2a:1f:8a:82:85)-43110-1454666807098: To TEST12_COMMON_THE_REPOS ReplyTo TEST12_COMMON_THIS_REPOS_CLIENT_130_226_228_10_WIA_WAYBACKWEBAPPTEST12 OK Arcfile: 5-1-20130117172315-00000-kb-test-har-002.kb.dk.warc Offset: 2804
      11:06:47.113 [Thread 41 proxy handling: ] DEBUG d.n.a.a.d.JMSArcRepositoryClient - Reply received after 0 seconds
      11:06:47.114 [Thread 41 proxy handling: ] INFO  d.n.wayback.NetarchiveResourceStore - Retrieved resource from file '5-1-20130117172315-00000-kb-test-har-002.kb.dk.warc' at offset '2804'
      11:06:47.114 [Thread 41 proxy handling: ] DEBUG d.n.c.d.a.BitarchiveRecord - Reading 303 bytes from objectBuffer
      11:06:47.114 [Thread 41 proxy handling: ] DEBUG d.n.wayback.NetarchiveResourceStore - Setting response code '301'
      11:06:47.114 [Thread 41 proxy handling: ] INFO  d.n.wayback.NetarchiveResourceStore - Setting redirect Location header to 'http://jyllands-posten.dk/'
      11:06:47.114 [Thread 41 proxy handling: ] DEBUG d.n.wayback.NetarchiveResourceStore - ARCRecord created with code '-1'
      11:06:47.114 [Thread 41 proxy handling: ] INFO  d.n.wayback.NetarchiveResourceStore - Returning resource 'dk.netarkivet.wayback.NetarchiveResourceStore$1@24e1f49f'
      WARNING Premature EOF before end-of-record: {statuscode=301, subject-uri=jp.dk/, ip-address=www.jp.dk, absolute-offset=2804, length=303, creation-date=Thu Jan 17 18:23:16 CET 2013, content-type=application/http, version=301, Location=http://jyllands-posten.dk/}
      

      Searching directly for jyllands-posten.dk works fine.

      The exact same behaviour is seen in both ia wayback and OpenWayback.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                csr Colin Rosenthal
              • Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: