Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.



First of all, we are pleased to welcome, in our team, Kevin Locoh-Donou, data engineer for the LIFRANUM research project, for a period of 9 months.

On the occasion of "Fantastic Futures" the 3rd international conference on Artificial Intelligence (AI) for librairies, archives and museums which took place at the BnF on December 10th 2021, the digital legal deposit service highlights its AI websites collection:

Finally, our 2021 broad crawl ended on November 14th and lasted a little less than 5 weeks. 2.5 billion URLs were crawled for a total of 114TB.




  • Ending the broad crawl of the .gal domain (regional domain from Galicia) 2.500 domains and 315 GB of information
  • We are studying the creation a new comic collection to harvest the webcomics and comics on the internet, an all this short-lived production in free access.
  • Updating of Nas 7.2 is ready in Preporduction. We are waiting for a new powerful hardware on Janaury to carry it out



Next meetings