0-effort Ingest

Problem: Getting stuff recognized by our systems take way to long

 

Current state of most collections: Limbo

Files on disk

 Disk is backed up

Files are checksummed

Metadata reside somewhere and referenced in the collection document

New Way

All known collections are created as Bitrepository collections right now.

Files are put in the Bitrepository, rather than Limbo

In addition to checksumming, they are analysed with Tika or Fits or similar.

Create doms collection corresponding to bitrepository collection

Attach metadata sources to doms collection

Create file object for each file in the collection

 

End result