Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.



The broad crawl is almost finished!

The team is now starting to think about Heritrix 3. We have planned several work packages in December and January to be ready for the London and Tallinn workshops: gathering documentation, configuration in the BnF IT environment, demonstration of functionalities, preparation of short tests... We aim to be fully prepared to discuss this project with you!




We are currently working on our World War I collection. Recently, we have developed some small tools for our administrative tasks, e.g. e-mail transmission to the site-owners. Additionally, we are planning a QA tool for the comparison of screenshots und a new online search interface.

Next year we have an annual budget of 10 TB for following crawls:

  • broad crawl including new TLDs .wien and .tirol
  • event harvesting about Eurovision song contest
  • politics collection will be extended by four local elections in 2015


Next meeting

Any other business?