Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Status of the production sites

Netarkivet

Panel
  1. Systems
    1. Overall, our systems work
    2. Firewall / network upgrade and consequent crashes in the Network Archive have filled a lot. ITD is working at high pressure to get a stable infrastructure in KBH, but we will probably have to enter 2021 before everything has been renewed and updated.
    3. Heritrix IIPC standard is in production
    4. SolrWayback is soon on its way into production and in a new updated version.

  2. Who uses our systems
    1. Browsing in the Online Archive:
      1. Statistics 6 months back: At least 1 external (but issues with seeing how many) and correspondingly 13 internal (for QA, development and much more).
    2. 40+ external user have access to our systems
    3. Delivery of data from Netarkivet takes place on an ongoing basis and I only expect it to be more comprehensive in the future.

  3. Collection
    1. Netarkivet has made a great effort in relation to Corona event harvesting
    2. Heritrix standard version may mean more efficient harvests from the 2nd cross-sectional harvest 2020, which is underway.
    3. Much of the interesting content at the moment. requires manual flows: Facebook (especially when it comes to comments)
    4. Still an issue to get certain types of dynamically generated content. Until we have other solutions e.g. browser-based harvesting uses Archive-IT and various work arounds (eg use of XML sitemaps that give us the URLs Heritrix does not immediately see).
    5. We are looking at how / if we can get Warc files from webrecorder / conifer.org in Netarkivet. It looks promising.

  4. Preservation
    1. We are well underway with the major projects in relation to DKM-077 - one online copy (Closed and part of DKM-085) and DKM-085 - Bitmagasine. Schedules have been made and special work is being done to refine the cutover process.
  5. Access
    1. Solrwayback on the way in production. Internal and external rejoice. It looks promising.

  6. Organization
    1. It goes well. Our more agile approach with daily stand-up meetings, review, retrospective and planning with a small group of Netarkiv people, provides added value.

  7. Cooperation
    1. More and more interest from external. Several from the Netarkivet have participated in a seminar on research in the Netarkivet for researchers at KU (KUB)
    2. I think there will be a higher demand for Netarkivet's content in the future.
    3. 40+ external users the last years

  8. The future
    1. NAS development and/vs. external solutions (decision proposals must be made)
    2. Webdanica analysis
    3. BcWeb
    4. SolrWayback further development / project setting.
    5. Workshop with ITU / DKU in relation to our access solutions - PyWb for playback of the Netarkivet in tandem with SolrWayback - IIPC will in future support 1 solution: PyWb (instead of Open Wayback as before)
    6. More dialogue with researchers about their wishes in relation to e.g. access solutions
    7. A long way to go to get enough resources / competencies in relation to the Network Archive's tasks (ITU + DKU). Part of 5-year plan.

BnF

Panel


ONB

Panel


BNE

Panel


KB-Sweden

...