Page tree
Skip to end of metadata
Go to start of metadata

Agenda for the joint BNF, ONB, SB and KB NetarchiveSuite tele-conference October 28th 2014, 13:00-14:00.

Practical information

  • TDC tele-conference:
    • Dial in number (+45) 70 26 50 45
    • Dial in code 9064479#
  • BridgeIT: BridgeIT conference will be available about 5 min. before start of meeting. The Bridgit url is The Bridgit password is sbview.


  • BNF: Clément, Lam and Annick
  • ONB: Michaela and Andreas
  • KB: Tue, Søren and Nicholas
  • SB: Colin, Sabine, Mikis and Ditte


  • H3 development status

NetarchiveSuite workshop 2014-2015

Status of the production sites


We are preparing an event harvest for parliamentary elections in 2015. The election campaign already started a little bit. The coming event harvest is based on the evaluation of our last big event harvest on the ESC 2014 and on experiences from former event harvests on parliamentary elections.

1st of july 2015 Netarchivet will have its 10th anniversary. We have started the preparation of the celebration in conjunction to an international RESAW conference in Aarhus in june 2015.

We just finished our last broad crawl for 2014 – with a limit of 100 MB for each domain (because we reached the TB budget limit) 2 developers are analyzing the content of our archive –we hope we can find ways to save TB with their help.


The broad crawl is almost finished!

The team is now starting to think about Heritrix 3. We have planned several work packages in December and January to be ready for the London and Tallinn workshops: gathering documentation, configuration in the BnF IT environment, demonstration of functionalities, preparation of short tests... We aim to be fully prepared to discuss this project with you!



We are currently working on our World War I collection. Recently, we have developed some small tools for our administrative tasks, e.g. e-mail transmission to the site-owners. Additionally, we are planning a QA tool for the comparison of screenshots und a new online search interface.

Next year we have an annual budget of 10 TB for following crawls:

  • broad crawl including new TLDs .wien and .tirol
  • event harvesting about Eurovision song contest
  • politics collection will be extended by four local elections in 2015


Next meeting

Any other business?




  • No labels