Page tree
Skip to end of metadata
Go to start of metadata

Agenda for the joint NetarchiveSuite tele-conference 2020-05-05, 13:00-14:00.


  • BNF: Clara, Sara, Géraldine
  • ONB: Andreas
  • KB/DK - Copenhagen: Tue, Stephen, Anders 
  • KB/DK - Aarhus: Sabine, Kristian, Colin
  • BNE: Alicia, Nuria
  • KB/Sweden: Pär, Peter

Join from PC, Mac, Linux, iOS or Android:

Or an H.323/SIP room system:

    Meeting ID: 104 443 571

    SIP: 104443571@

Or Skype for Business (Lync):

Or Telephone:

Denmark: +45 89 88 37 88 or +45 32 71 31 57
United Kingdom: +44 203 051 2874 or +44 203 481 5237 or +44 203 966 3809 or +44 131 460 1196
Finland: +358 9 4245 1488 or +358 3 4109 2129
Sweden: +46 850 539 728 or +46 8 4468 2488
Norway: +47 7349 4877 or +47 2396 0588
US: +1 669 900 6833 or +1 646 558 8656
    Meeting ID: 104 443 571

    International numbers available:

You can join a meeting by using apps from a pc, a tablet or a smartphone, but you can also use the browser based version (it works with newer versions of Chrome or Firefox)

Update on NAS latest tests and developments

Feedback on latest developments and tests.

BnF PR on H3 have been accepted:

Status of the production sites


  • Preparation for implementation of SolR wayback for external users are nearly finished. We had to clear security issues and are now waiting for the final go from our legal experts
  • We have send a recommendation for decision to our directors: We want to implement Warc-files created by crawling with into our preservation system 
  • Our broad crawl is ongoing, some of us are busy with the job follow up 
  • We are negociating with our IT department: we want them to allot time to implement the new Heritrix release, which hopefully – among others – will solve our problems with harvesting “lazy load”
  • The event crawl "Coronavirus in Denmark" is ongoing, of cause. We get help from people outside the Netarchive Team, among others, colleagues who are not able to do their ordinary work


We have finished our tests on the new Heritrix IIPC version and plan to put it into production before the end of June. This version will integrate also the migration to Postgresql 11.

After this deployment, we will be ready to launch a crawl of YouTube channels about the coronavirus. To enrich this collection, we will also launch an Instagram crawl : we are targeting a selection of 150 instagram profiles. Images and text will be crawled from



  • We are focused on our coronavirus collection. We are collecting proposals from the regional web curators because they don’t have access to the tools due to the situation. We are also accepting public nominations using a web form. We have more than 5 Tb of information and almost 2,000 seeds.
  • We had planned to launch our annual broad crawl in April but we postpone it until the situation is normalized.
  • We have already installed the version 6.1 of BCWeb in a test environment. He will test it these days before uploading it to the production environment


Next meetings

  • June 9, 2020
  • July 7, 2020
  • September 8, 2020
  • October 6, 2020
  • November 3, 2020
  • December 8, 2020
  • January 5, 2021

Any other business?


  • No labels