[NAS-2192] TEST12: Unable to browse properly in the material Created: 17/May/13  Updated: 22/May/13  Resolved: 22/May/13

Status: Resolved
Project: NetarchiveSuite
Component/s: Test, Wayback
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Søren Vejrup Carlsen (Inactive) Assignee: Colin Rosenthal
Resolution: Fixed  
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified


 Description   

With the current wayback setup I'm unable to browse in the material.
It could be a problem with the index, it often is.



 Comments   
Comment by Søren Vejrup Carlsen (Inactive) [ 22/May/13 ]

The TEST12 test description has now been updated to avoid this problem to occur

Comment by Colin Rosenthal [ 21/May/13 ]

It appears that the test description needs to amended to include instructions on how one can be certain that the indexer is finished before one moves the index file.

Comment by Colin Rosenthal [ 21/May/13 ]

Some domains are fairly browsable: drive-badmintonklub.dk, news.dk, kb.dk, trineogkaare.dk, but others are missing the front page. These appear to be all redirected domains so the arcfiles have some links from jp.dk and some from jyllands-posten.dk but not any whole pages from jyllands-posten.dk. I think the correct procedure would be to create a better arc/warc testsuite for future runs, using explicitly the redirected domains and with a rather higher bytelimit for the harvests.

Comment by Søren Vejrup Carlsen (Inactive) [ 17/May/13 ]

I can, however, lookup URLs with the advanced search using the "beginning with" mode.

Generated at Tue Apr 16 10:01:34 CEST 2024 using Jira 9.4.15#940015-sha1:bdaa9cbecfb6791ea579749728cab771f0dfe90b.