The broad crawl is almost finished!
The team is now starting to think about Heritrix 3. We have planned several work packages in December and January to be ready for the London and Tallinn workshops: gathering documentation, configuration in the BnF IT environment, demonstration of functionalities, preparation of short tests... We aim to be fully prepared to discuss this project with you!
We are currently working on our World War I collection. Recently, we have developed some small tools for our administrative tasks, e.g. e-mail transmission to the site-owners. Additionally, we are planning a QA tool for the comparison of screenshots und a new online search interface.
Next year we have an annual budget of 10 TB for following crawls:
Any other business?