Update on NAS latest tests and developments
NetarchiveSuite 7.0 has been released: NetarchiveSuite 7.0 Release Notes
For the rest of the Spring, the Core Development Team (ie Colin + Rasmus) will be concentrating on support tasks in connection with migration and deployment of NetarchiveSuite 7.0 so there will be very limited resources for development work on the NetarchiveSuite codebase.
Status of the production sites
We are pleased to announce that, last month, we published our selective crawls seed lists on the new version of the BnF website dedicated to APIs and datasets. These lists are created from BCWeb exports including some crawl settings and descriptive elements like themes and keywords.
For the second consecutive year, we launched an Instagram crawl. We plan to make five Instagram crawls, some of them are about specific subjects like the Olympic games or the regional and departmental elections in France.
And finally, our in-house harvesting workshop about Flash is going to finish. It was complicated to find a way to harvest automatically some of the websites with Flash animations because some URLs are dynamically generated or relative, and so they are inaccessible to Heritrix. So we will try to discover all the URLs with the help of a human hand and we will launch the harvest in a second time.