Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 31 Next »

The NetarchiveSuite software was originally developed by the two national deposit libraries in Denmark, The Royal Library (KB) and The State and University Library (SB), and has been running in production, harvesting the Danish world wide web since 2005. The French National Library (BnF) and the Austrian National Libraries (Onb) joined the project in 2008.

The NetarchiveSuite is a complete web archiving software package developed from 2004 and onwards. The primary function of the NetarchiveSuite is to plan, schedule and run web harvests of parts of the Internet. It scales to a wide range of tasks, from small, thematic harvests (e.g. related to special events, or special domains) to harvesting and archiving the content of an entire national domain. The software has built-in bit preservation functionality. The systems architecture allows for the software to be distributed among several machines, possibly on more than one geographical location. The NetarchiveSuite is built around the Heritrix web crawler, which it uses to harvest the web.


Issue tracker
Continuous integration
Browser source code
Code analysis (Pending Sonar enablement)
Releases/Nightly (Pending Maven enablement)
Maven site (Pending Maven enablement)
code coverage
Maven reports

Want to join the NetarchiveSuite project?

Sign up to SBForge (top left corner) and send a request for participation.


Recently Updated

  • No labels