Day 1 (Thursday 24) - 14:00 - 17:00 Technical Discussions
Location: Tower 3, Level 4, Meeting Room // Chair: Mikis
- See below ideas for technical discussions.
Day 2 (Friday 25) - 09:00 - 12:30 Common Curator/Technical Discussions
Location: Tower 3, Level 4, Meeting Room // Chairs: Mikis, Sara
- See Curator Agenda
Ideas for Technical Discussions
Using the Bit Repository as archive (Mikis)
We are in DK hoping to move to the The Bit Repository project, Mikis will talk a bit about this.
First point of the afternoon, common with Jhonas Track.
Possiblities from Spring (Nicolas)
Nicolas has as part of the work with the BnF curator front-end worked with the Spring Framework. Is there any obvious places in the NetarchoveSuite system we could benefit from using Spring?
Let's share any Wayback experiences
- Are BnF or ONB using Wayback for access.
- ONB is using Wayback for Access and QA - see Slides from Andreas.
- Are they using the NAS Wayback module.
Free text search (Mikis)
In Denmark we are starting to look into using SOLR for free text searching of our web archives. Has BnF or ONB any experience with free text searching?
Development process (Mikis)
We have had som problems up to codefreezes where is has been difficult to establish a properly QA'ed codebase. How do we work towards a more robust codefreeze?
Let's try to define a prioritized list of improvements we would like to include in the development in the near future.
- Introduce Spring.
- Convert to Maven build.
- Harvest data model refactoring (NAS-1833).
- Refactor system state monitor JMX code (NAS-1829).
- NAS-1859 Create automatic and continuous sanity test of the NAS system.
Virtualisation (Bert, Christophe)
At BnF, we have set up virtual servers to run harvesters et indexers. Has DK and ONB experience in or want to move to virtualisation?
Deduplication (Bert, Christophe)
Deduplication processes have been hard to work through and are still a black blox at BnF. At BnF, we would like to share about it.
At BnF, we have developped a tool to create a domain/seed list for our snapshot harvest which analyses DNS and redirections. Is it of interest to anyone? Could it be officially part of NetarchiveSuite?