Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.



This month we're opening an experimental access interface, Archives de l'internet Labs. This interface provides full-text searching of a small part of our collections, with the possibility to export results and save searches and selections in a personal workspace. It also provides access to statistics and metadata on the collections.

This interface builds on the work we have done over the past year or so on data mining and full text indexing. It is part of a four-year project at the BnF studying the creation of a service to provide researchers with corpora from the digital collections of the BnF, the web archives having been chosen as the case study for the first year of the project. For the moment this interface will only be available to researchers working on two specific projects who have signed a convention with the BnF, but as part of the overall project we will be looking at how this kind of service can be offered to more researchers.


  • Please complete the doodle poll for the NAS meeting in Vienna by end of July:
    The number of participants is needed to calculate the budget. Thank you!
  • End of May presidential elections took place in Austria, the crawl continues until the new president is sworn in. As mentioned during one of the last calls, one of the political parties is blocking our crawlers. We captured the content with, but still had no time to investigate how to include the warc-files into our archive.
  • The new online search interface will be launched soon and we look forward to your feedback. We are currently waiting for a security check of our IT-department to be completed.



Next meeting