Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Status of the production sites

Netarkivet

Panel
 

We are still working on the reorganization of the selective crawls: the strategy is

  • Extension of the selective crawls and smaller broad crawls –
    • We now collect all national Danish news media selectively – both newspaper websites and news media only existing online.
    • We investigate all local news media in order to decide frequency and depth for the future crawls.
    • We made a first crawl of university repositories (with OAI-extraction)

 As Heritrix 3 is not able to archive Facebook profiles. But Archive-IT is able to collect Facebook profiles with an API. We will collect about 100 representative open Facebook profiles at Archive-IT, at the moment we are doing the selection of the profiles.

 We are working on the compression of our archive

BnF

Panel
 

ONB

Panel
 

BNE

Panel
 

...