Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Agenda for the joint BNF, ONB, SB, KB and BNE NetarchiveSuite tele-conference 2016-09-20, 13:00-14:00.

Practical information


  • BNF: Lam, Annick, Sara
  • ONB: Michaela, Andreas
  • KB/DK: Søren, Stephen, Nicholas
  • SB: Sabine, Colin, Niels
  • BNE: Juan Carlos, Fernando, Elena
  • KB/Sweden: Bengt

IIPC crawler hackathon in London

September 22-23. Søren, Colin, Bert will attend.

Topics, attendees:

Common questions/interests to bring?

NAS 5.2 Developement Update

On BnF side: some bugfixes:

  NAS-2544 - Getting issue details... STATUS

  NAS-2545 - Getting issue details... STATUS

  NAS-2546 - Getting issue details... STATUS

  NAS-2553 - Getting issue details... STATUS

Translation of new keys in French and German.

Considering the adoption of WARC revisit records for duplicates.

NAS workshop in Vienna

January 30th 2017 - February 1st 2017 - Vienna


NetarchiveSuite Curator Issues

Should we "reanimate" our curator roadmap/backlog, revise it and discuss it in Vienna?


Status of the production sites


Broad crawl

  • Last week we launched the third broad crawl 2016. The crawl limit per domaine will be max. 100 MB. There will be special crawls for ministeries and government bodies, and for ultra big sites (e.g.
  • We will try to get in touch with the webpage owneers/web hotels who are blocking our crawler (about 11% are blocking us)

Event crawl

  • The event collection for the Olympics in Rio 2016 will go on until the end of the Paralympics 2016

Selctive crawls

  • We are working on the configuration of the regional/local news media crawls.
  • Facebook
    • We have test-crawled about 60 Danish Facebook profiles with Archive-IT. We are analyzing how much we get from the profiles. We have to renew our account with Archive-IT after the end of November and we are trying to negotiate a good prize.
    • We made a special crawl of Prime Minister Lars Løkkes Facebook profile on 2016.08.30, the day he published his 2025 plan.

Compression of the archive

  • We are preparing for the compression, but this awaits NAS release 5.3

Last not least

Last week we learned, that the ministry of culture wants KB and SB to merge: From January 2017 we will be “Nationalbiblioteket” with two locations, in Copenhagen and Aarhus






  • We switched to NAS 5.2 already because we had severe problems with https websites with the former version. It went smooth so far. We are still using the arc format, because we have to refactor all our tools before we switch to warc.
  • The crawl about our presidential elections still running, we have a new election date beginning of December and hope to be able to finish the crawl soon.
  • Apart from one small, additional thematic crawl we will only have ongoing crawls until the end of the year. Next domain crawl is scheduled for 2017.





Next meetings

  • October 25
  • November 29
  • January 3, 2017

Any other business?




  • No labels