Started 3 yr 1 mo ago

Not built Build NetarchiveSuite - harvester - core (08-Mar-2021 08:20:32)

Changes
  1. fix for https://sbforge.org/jira/browse/NAS-2790 (commit: 834a5b322c067b1e83204b0865c5f520286dfd1c) (detail)
  2. Get seedlist with/without alpha order (commit: 2e16d80f4f6239ec7a437d60822ea697fde029a6) (detail)
  3. UI fixes (commit: 69c20c76dbe8ca697cb623d14596552c03cd676b) (detail)
  4. Add comment for crawler trap in crawler-beans (commit: 22d62b1629ed0c6b25a4e8589fd87acb5c0caddc) (detail)
  5. [maven-release-plugin] prepare release netarchivesuite-5.6 (commit: 7b15d7abf1ed8a11f000177058f3acbea23ab29f) (detail)
  6. [maven-release-plugin] prepare for next development iteration (commit: 97ebda03303de3f69cd2656ced3de95af6226083) (detail)
  7. Updated version to a unique name (commit: e3b328a3cca883f6396a2955ef28bb0e2c7d2300) (detail)
  8. Added hadoop job for getting metadata lines from archive files and an (commit: 553e20659df3bb62c6d121d50c6effa3fc8947e9) (detail)
  9. Added filehandling for GetMetadataArchiveMapper and small touch ups (commit: 24aaecbc74299fa9fda9191dfe510977aa027b8f) (detail)
  10. Small refactor of ArchiveFile/HadoopUtils, few touch ups and started on (commit: 0d880c6017572102b4bf24e60d87ea35a84e2470) (detail)
  11. 'Start' of https://sbprojects.statsbiblioteket.dk/jira/browse/NARK-1970 (commit: d52a6bfda1ed72ad4fc125356ae274f18e0de8c6) (detail)
  12. Integration of Hadoop dedup indexing with GetMetadataArchiveMapper now (commit: ca2c62d474caf14d80e8e1e8f3970a4582e84672) (detail)
  13. Cleaned up a few things in RawMetadataCache and refactored HadoopUtils (commit: d10211a994d936309d076e87f0ae9699d99f385e) (detail)
  14. Squashed commit of the following: (commit: 8d9adc2b50d996dfaa544528b34a7b6b96947d1e) (detail)
  15. Added pattern configuration constants in GetMetadataMapper (commit: 9c130776d86bffa43170028cad724353348ec8dc) (detail)
  16. Review https://sbforge.org/fisheye/cru/CR-NAS-385 changes (commit: 553c4afcb7ddf654b26cc4c9afa3c4cdc7c79197) (detail)
  17. Fixed up FileResolverRESTClient for review and refactored code to enable (commit: 0a31340c22213cb7707a5188ec83ded5143c22ce) (detail)
  18. Added cdx indexing for metadata files in CDXIndexer and proper testing (commit: 7000ae16f8936955299227260d601c5db7005b81) (detail)
  19. Got Hadoop replacement for ArchiveExtractCDXJob ready, refactored some (commit: 52c718231cc4eb4ba13f6161ace4f701aeb4b738) (detail)
  20. Added setting for new job input/output dirs and more logging (commit: 629996d5c70f12e916878cb48e1c234b932eedcd) (detail)
  21. Setting fix from review https://sbforge.org/jira/browse/NARK-1954 (commit: 5cb2bc46120e35bc9e1074ec1f135efd672d4b2a) (detail)
  22. Review changes https://sbforge.org/fisheye/cru/CR-NAS-393, changes to (commit: bf5e943440e3760f4e25662ffcf603c3a78c0b2e) (detail)
  23. Fixed SimpleFileResolver, refactored how Hadoop jobs can be started, and (commit: 987c230dc013d45aaac9554df0392043965e56a0) (detail)
  24. Added settings for new job and finished last refactoring parts (commit: dcee3b48afde25b3ab1ac42fa65adc64f672d91e) (detail)
  25. Made small fix/cleanup in crawl log mapper and added more documentation (commit: e052b35ecbe8d07c2a88e914d3202d863b57bf50) (detail)
  26. Squashed commit of the following: (commit: ab9b8860ca1f5323ca20cabf8a23c7ee01009bc8) (detail)
  27. Squashed commit of the following: (commit: 9687194f6e849461945a3f75bdd3906f128d71c8) (detail)
  28. First attempt at a kill switch that returns an empty index for dedups (commit: 08d62e8104de4fe99d49b71b4b7933e41987bb56) (detail)
  29. Second attempt using IndexReadyMessage (commit: 3199f61725d3badd01b8e26a1c0c295c7564cb09) (detail)
  30. Added some logging (commit: aea04138a7ce030b1772456dee253c744b10453e) (detail)
  31. Further attempt (commit: 701b2c647c674bc72091877fbd6ab2bd8e989ca9) (detail)
  32. Further attempt using IndexReadyMessage (commit: ca9377f522312bdb2babb2c33e664becd1fbcf81) (detail)
  33. Back to reply (commit: ddb5dd34fc6b691f0318b0965dc09b51f3da9f66) (detail)
  34. Added a bit more logging. (commit: e4c67af253358e93ca41e108cfefff2072a6d9fa) (detail)
  35. Removed potential error when requesting empty cache (commit: f6a7d91cbb5b2f3feaf42fc1b7e83ada5f2bb73a) (detail)
  36. Clean-up (commit: f0f4a71edd0773f7fd35d30bfb5f80abe93057eb) (detail)
  37. Refactoring to make MetadataIndexingApplication closer to a reusable (commit: 48366f7a72262d6f1442b635b466a2b929b9bcd3) (detail)