Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Panel

Our main focus is on the following:

  • Solving problems with the broad crawl: We started step 1 of our third broad crawl for 2018 on 25 August. We had lots of hanging jobs. We got developers help to solve a part of the problem, but still some jobs are hanging.
  • Testing Wayback access with different browsers: we found that different browsers replay more or less content from the Wayback archive. We tested with different browsers, IE, Chrome, Edge. The result is, that Edge is best to replay content (i.g. images)
  • Event harvest with BCWeb: we finished the event crawl for the official commemoration day for Danish soldiers, who had been deployed in war or conflict zones. Just some documentation is to be done. We did this crawl in collaboration with archivists from the National Archive. The fact that the hardcore coded schedules do not match the figures in the configurations on NAS domain pages caused some confusion.   This was our first collection collaboration using BCWeb (besides the pilot project)
  • We have implemented Umbra in our test environment and are looking forward to the results.
  • Special crawl for man hunt by Danish police: Almost every part of Denmark got closed by the police 28 September (no ferries, no flights abroad). We primarily crawled foreign articles on the event.  
  • Kim Larsen, a Danish rock musician, known by almost every Dane died last Saturday, 29 September. We added a hashtag to our Twitter crawls – otherwise content on his death is captured by our selective news media crawls.

BnF

Panel

ONB

Panel
  • We are still running our domain crawl for this year. We are in the middle of the first stage (10MB)
  • And we are doing testcrawls of new seeds for the next run of our half yearly Woman/Gender Crawl which starts next month



BNE

Panel

KB-Sweden

Panel

Next meetings

...