Agenda for the joint NetarchiveSuite tele-conference 2018-11-06, 13:00-14:00.
The Release Test for NetarchiveSuite 5.5 is under way and expected to be finished this week, followed by a formal release as soon as possible after that. See NetarchiveSuite 5.5 Release Notes for more details about the content of the release.
We are beginning a new development phase concentrating on updating our backend storage and processing architecture. We are moving to http://bitrepository.org software and also investigating a more modern mass-processing platform - for example index-generation via hadoop instead of our own NAS batch framework. For this reason we will be surveying our NAS partners to find out which parts of the NAS package are actually in use in different institutions.
2019 NAS workshop
We launched step 2 of our 3rd broad crawl this year (with a limit of 14GB per domain) on 2018/10/23
We looked at all our open issues and grouped them thematically:
The aim was to find the most urgent problems, which we cannot solve without developers help
We are working on the implementation of SOLR wayback to search in Netarchive. By now SOLR Wayback still is a protoype. Amongst others we need to clarify UX, security questions, how to do the logging and to chose a platform for the user access.
But the display of the results is much better than Blacklight
We are working on a procedure for a new type of usage from the archive: data extraction for research project from the archive. The data to be extracted are determined by a search string – hopefully this would be rather easy with SOLR-wayback
We are going to prepare a mini-event harvest “Week 46”. In week 46 the Royal Library collects local broadcast stations’ (both radio and television) productions. For a couple of years ago Netarchive started collecting there home pages.