SuccessChanges

Summary

  1. Updated version to a unique name (commit: e3b328a3cca883f6396a2955ef28bb0e2c7d2300) (details)
  2. Added hadoop job for getting metadata lines from archive files and an (commit: 553e20659df3bb62c6d121d50c6effa3fc8947e9) (details)
  3. Added filehandling for GetMetadataArchiveMapper and small touch ups (commit: 24aaecbc74299fa9fda9191dfe510977aa027b8f) (details)
  4. Small refactor of ArchiveFile/HadoopUtils, few touch ups and started on (commit: 0d880c6017572102b4bf24e60d87ea35a84e2470) (details)
  5. 'Start' of https://sbprojects.statsbiblioteket.dk/jira/browse/NARK-1970 (commit: d52a6bfda1ed72ad4fc125356ae274f18e0de8c6) (details)
  6. Integration of Hadoop dedup indexing with GetMetadataArchiveMapper now (commit: ca2c62d474caf14d80e8e1e8f3970a4582e84672) (details)
  7. Cleaned up a few things in RawMetadataCache and refactored HadoopUtils (commit: d10211a994d936309d076e87f0ae9699d99f385e) (details)
  8. Squashed commit of the following: (commit: 8d9adc2b50d996dfaa544528b34a7b6b96947d1e) (details)
  9. Added pattern configuration constants in GetMetadataMapper (commit: 9c130776d86bffa43170028cad724353348ec8dc) (details)
  10. Review https://sbforge.org/fisheye/cru/CR-NAS-385 changes (commit: 553c4afcb7ddf654b26cc4c9afa3c4cdc7c79197) (details)
  11. Fixed up FileResolverRESTClient for review and refactored code to enable (commit: 0a31340c22213cb7707a5188ec83ded5143c22ce) (details)
  12. Added cdx indexing for metadata files in CDXIndexer and proper testing (commit: 7000ae16f8936955299227260d601c5db7005b81) (details)
  13. Got Hadoop replacement for ArchiveExtractCDXJob ready, refactored some (commit: 52c718231cc4eb4ba13f6161ace4f701aeb4b738) (details)
  14. Added setting for new job input/output dirs and more logging (commit: 629996d5c70f12e916878cb48e1c234b932eedcd) (details)
  15. Setting fix from review https://sbforge.org/jira/browse/NARK-1954 (commit: 5cb2bc46120e35bc9e1074ec1f135efd672d4b2a) (details)
  16. Review changes https://sbforge.org/fisheye/cru/CR-NAS-393, changes to (commit: bf5e943440e3760f4e25662ffcf603c3a78c0b2e) (details)
  17. Fixed SimpleFileResolver, refactored how Hadoop jobs can be started, and (commit: 987c230dc013d45aaac9554df0392043965e56a0) (details)
  18. Added settings for new job and finished last refactoring parts (commit: dcee3b48afde25b3ab1ac42fa65adc64f672d91e) (details)
  19. Made small fix/cleanup in crawl log mapper and added more documentation (commit: e052b35ecbe8d07c2a88e914d3202d863b57bf50) (details)
  20. Squashed commit of the following: (commit: ab9b8860ca1f5323ca20cabf8a23c7ee01009bc8) (details)
  21. Squashed commit of the following: (commit: 9687194f6e849461945a3f75bdd3906f128d71c8) (details)
  22. First attempt at a kill switch that returns an empty index for dedups (commit: 08d62e8104de4fe99d49b71b4b7933e41987bb56) (details)
  23. Second attempt using IndexReadyMessage (commit: 3199f61725d3badd01b8e26a1c0c295c7564cb09) (details)
  24. Added some logging (commit: aea04138a7ce030b1772456dee253c744b10453e) (details)
  25. Further attempt (commit: 701b2c647c674bc72091877fbd6ab2bd8e989ca9) (details)
  26. Further attempt using IndexReadyMessage (commit: ca9377f522312bdb2babb2c33e664becd1fbcf81) (details)
  27. Back to reply (commit: ddb5dd34fc6b691f0318b0965dc09b51f3da9f66) (details)
  28. Added a bit more logging. (commit: e4c67af253358e93ca41e108cfefff2072a6d9fa) (details)
  29. Removed potential error when requesting empty cache (commit: f6a7d91cbb5b2f3feaf42fc1b7e83ada5f2bb73a) (details)
  30. Clean-up (commit: f0f4a71edd0773f7fd35d30bfb5f80abe93057eb) (details)
  31. Refactoring to make MetadataIndexingApplication closer to a reusable (commit: 48366f7a72262d6f1442b635b466a2b929b9bcd3) (details)
  32. Initial version using fileresolver (commit: f212546a060f7271527d9d722308241d60e1b720) (details)
  33. Added explicit jersey-server dep. to GUI. (commit: 06f48b57ed3030f29f067d2abae08d74bfab1f98) (details)
  34. Added a necessary filtering stage to match only current collection (commit: 5501fca44675886868cd1672489a9000f4a15c97) (details)
  35. Tidying up for review. (commit: e68e482d4385d1ac0a0c8aac903055e1b23eaff3) (details)
  36. Forcing HadoopJobStrategy to use hdfs (commit: bafdb18e7a4b7667e765882e8a1205de182f3c91) (details)
  37. Forcing HadoopJobStrategy to use hdfs (commit: 6adbe99384b5754740a0cc4e6359ad3d1cc4e9ea) (details)
  38. Added harvester-core to uber jar (commit: 2214312e8db00065089f6ee1da3811cff20a181f) (details)
  39. Moved Kerberos logins (commit: d022a62c4c855acd8a042db1a240ea515a063d7f) (details)
  40. Small fixes and revert (commit: ba7b6362e8dc63a0711bdc4a89c957ebca15a6bd) (details)
  41. Readded Kerberos login to IndexRequestServer (commit: 9e06be74dd416fc3e0c6036fcc457796b4547e53) (details)
  42. collectionID setting fix to always default to env name when unset (commit: a114fff89a4046d8084f41be0a25f599dd785575) (details)
  43. Follow up to own review comments (commit: 681d1147f7824aa7a18804c4b1d9475ce9c26c19) (details)
Commit e3b328a3cca883f6396a2955ef28bb0e2c7d2300 by Colin Rosenthal (csr)
Updated version to a unique name
(commit: e3b328a3cca883f6396a2955ef28bb0e2c7d2300)
The file was modifieddeploy/pom.xml
The file was modifiedcommon/pom.xml
The file was modifieddeploy/distribution/pom.xml
The file was modifiedintegration-test/pom.xml
The file was modifiedintegration-test/system-test/pom.xml
The file was modifieddeploy/deploy-core/pom.xml
The file was modifiedharvester/heritrix3/heritrix3-controller/pom.xml
The file was modifiedmonitor/pom.xml
The file was modifiedcommon/common-core/pom.xml
The file was modifiedharvester/harvestchannel-gui/pom.xml
The file was modifiedarchive/archive-test/pom.xml
The file was modifiedharvester/heritrix3/heritrix3-monitor/pom.xml
The file was modifiedwayback/pom.xml
The file was modifiedwayback/wayback-test/pom.xml
The file was modifiedcommon/common-test/pom.xml
The file was modifiedmonitor/status-gui/pom.xml
The file was modifiedharvester/harvest-scheduler/pom.xml
The file was modifiedarchive/bitpreservation-gui/pom.xml
The file was modifiedharvester/history-gui/pom.xml
The file was modifiedharvester/heritrix3/heritrix3-extensions/pom.xml
The file was modifiedharvester/heritrix3/pom.xml
The file was modifiedharvester/heritrix3/heritrix3-bundler/pom.xml
The file was modifiedharvester/qa-gui/pom.xml
The file was modifiedarchive/archive-core/pom.xml
The file was modifiedharvester/harvester-test/pom.xml
The file was modifiedmonitor/monitor-test/pom.xml
The file was modifiedarchive/pom.xml
The file was modifiedwayback/wayback-resourcestore/pom.xml
The file was modifieddeploy/deploy-test/pom.xml
The file was modifiedharvester/pom.xml
The file was modifiedcommon/netarchivesuite-test-utils/pom.xml
The file was modifiedmonitor/monitor-core/pom.xml
The file was modifiedwayback/wayback-indexer/pom.xml
The file was modifiedbuild-tools/pom.xml
The file was modifiedharvester/harvestdefinition-gui/pom.xml
The file was modifiedharvester/harvester-core/pom.xml
The file was modifiedpom.xml
Commit 553e20659df3bb62c6d121d50c6effa3fc8947e9 by Rasmus Bohl Kristensen (rbkr)
Added hadoop job for getting metadata lines from archive files and an
integration test to go with it
(commit: 553e20659df3bb62c6d121d50c6effa3fc8947e9)
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/GetMetadataArchiveBatchJobTester.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataArchiveMapper.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedharvester/harvester-test/pom.xml
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetaDataArchiveHadoopJobTester.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMap.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/archive/GetMetadataArchiveBatchJob.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJob.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXJob.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/TestInfo.java
Commit 24aaecbc74299fa9fda9191dfe510977aa027b8f by Rasmus Bohl Kristensen (rbkr)
Added filehandling for GetMetadataArchiveMapper and small touch ups
(commit: 24aaecbc74299fa9fda9191dfe510977aa027b8f)
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataArchiveMapperTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataArchiveMapper.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/harvesting/metadata/MetadataFile.java
The file was removedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetaDataArchiveHadoopJobTester.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/GetMetadataArchiveBatchJobTester.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/RawMetadataCacheTester.java
Commit 0d880c6017572102b4bf24e60d87ea35a84e2470 by Rasmus Bohl Kristensen (rbkr)
Small refactor of ArchiveFile/HadoopUtils, few touch ups and started on
metadata job
(commit: 0d880c6017572102b4bf24e60d87ea35a84e2470)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataArchiveMapper.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/HadoopUtils.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
Commit d52a6bfda1ed72ad4fc125356ae274f18e0de8c6 by Rasmus Bohl Kristensen (rbkr)
'Start' of https://sbprojects.statsbiblioteket.dk/jira/browse/NARK-1970
(commit: d52a6bfda1ed72ad4fc125356ae274f18e0de8c6)
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/FileResolver.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/GetMetadataArchiveBatchJobTester.java
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/SimpleFileResolver.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMap.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/FileResolver.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/HadoopUtils.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataArchiveMapperTester.java
Commit ca2c62d474caf14d80e8e1e8f3970a4582e84672 by Rasmus Bohl Kristensen (rbkr)
Integration of Hadoop dedup indexing with GetMetadataArchiveMapper now
works - still needs few tweaks though
(commit: ca2c62d474caf14d80e8e1e8f3970a4582e84672)
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopFileUtils.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was removedcommon/common-core/src/main/java/dk/netarkivet/common/utils/HadoopUtils.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
Commit d10211a994d936309d076e87f0ae9699d99f385e by Rasmus Bohl Kristensen (rbkr)
Cleaned up a few things in RawMetadataCache and refactored HadoopUtils
into two separate classes
(commit: d10211a994d936309d076e87f0ae9699d99f385e)
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMap.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was removedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataArchiveMapper.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataMapper.java
The file was addedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMapper.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopFileUtils.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was removedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataArchiveMapperTester.java
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataMapperTester.java
Commit 8d9adc2b50d996dfaa544528b34a7b6b96947d1e by Rasmus Bohl Kristensen (rbkr)
Squashed commit of the following:
commit d10211a994d936309d076e87f0ae9699d99f385e Author: bohlski
<rbkr@kb.dk> Date:   Mon Sep 21 15:08:37 2020 +0200
    Cleaned up a few things in RawMetadataCache and refactored
HadoopUtils into two separate classes
commit ca2c62d474caf14d80e8e1e8f3970a4582e84672 Author: bohlski
<rbkr@kb.dk> Date:   Tue Sep 15 13:07:22 2020 +0200
    Integration of Hadoop dedup indexing with GetMetadataArchiveMapper
now works - still needs few tweaks though
commit d52a6bfda1ed72ad4fc125356ae274f18e0de8c6 Author: bohlski
<rbkr@kb.dk> Date:   Fri Sep 11 09:41:00 2020 +0200
    'Start' of
https://sbprojects.statsbiblioteket.dk/jira/browse/NARK-1970
commit 73ec57e3facac150ad9dccb85f46718a98456bc0 Merge: 0d880c601
57b380f2f Author: bohlski <rbkr@kb.dk> Date:   Thu Sep 3 13:37:28 2020
+0200
    Merge branch 'NARK-1882-hadoop-indexing' of
https://github.com/netarchivesuite/netarchivesuite into
NARK-1882-hadoop-indexing
commit 0d880c6017572102b4bf24e60d87ea35a84e2470 Author: bohlski
<rbkr@kb.dk> Date:   Thu Sep 3 13:37:24 2020 +0200
    Small refactor of ArchiveFile/HadoopUtils, few touch ups and started
on metadata job
commit 24aaecbc74299fa9fda9191dfe510977aa027b8f Author: bohlski
<rbkr@kb.dk> Date:   Tue Sep 1 14:30:03 2020 +0200
    Added filehandling for GetMetadataArchiveMapper and small touch ups
(commit: 8d9adc2b50d996dfaa544528b34a7b6b96947d1e)
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataMapperTester.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMap.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataMapper.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/FileResolver.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/RawMetadataCacheTester.java
The file was addedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMapper.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/harvesting/metadata/MetadataFile.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/GetMetadataArchiveBatchJobTester.java
The file was removedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetaDataArchiveHadoopJobTester.java
The file was removedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataArchiveMapper.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/FileResolver.java
The file was removedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/SimpleFileResolver.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopFileUtils.java
The file was removedcommon/common-core/src/main/java/dk/netarkivet/common/utils/HadoopUtils.java
Commit 9c130776d86bffa43170028cad724353348ec8dc by Rasmus Bohl Kristensen (rbkr)
Added pattern configuration constants in GetMetadataMapper
(commit: 9c130776d86bffa43170028cad724353348ec8dc)
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMapper.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataMapper.java
Commit 553c4afcb7ddf654b26cc4c9afa3c4cdc7c79197 by Rasmus Bohl Kristensen (rbkr)
Review https://sbforge.org/fisheye/cru/CR-NAS-385 changes
(commit: 553c4afcb7ddf654b26cc4c9afa3c4cdc7c79197)
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopFileUtils.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataMapperTester.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMapper.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/FileResolver.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/GetMetadataMapper.java
The file was addedcommon/common-core/src/test/java/dk/netarkivet/common/utils/SimpleFileResolverTester.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXIndexer.java
Commit 0a31340c22213cb7707a5188ec83ded5143c22ce by Colin Rosenthal (csr)
Fixed up FileResolverRESTClient for review and refactored code to enable
its use via factory method
(commit: 0a31340c22213cb7707a5188ec83ded5143c22ce)
The file was modifiedcommon/common-core/src/test/java/dk/netarkivet/common/utils/FileResolverRESTClientTest.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedcommon/common-core/src/test/java/dk/netarkivet/common/utils/SimpleFileResolverTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/FileResolverRESTClient.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/FileResolver.java
Commit 7000ae16f8936955299227260d601c5db7005b81 by Rasmus Bohl Kristensen (rbkr)
Added cdx indexing for metadata files in CDXIndexer and proper testing
that works
(commit: 7000ae16f8936955299227260d601c5db7005b81)
The file was addedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXMapperTester.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMapper.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXIndexer.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedwayback/wayback-indexer/pom.xml
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
Commit 52c718231cc4eb4ba13f6161ace4f701aeb4b738 by Rasmus Bohl Kristensen (rbkr)
Got Hadoop replacement for ArchiveExtractCDXJob ready, refactored some
stuff, fixed old bugs in CDXMapper and added more tests for it
(commit: 52c718231cc4eb4ba13f6161ace4f701aeb4b738)
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopFileUtils.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/batch/UrlCanonicalizerFactory.java
The file was modifiedharvester/harvester-core/pom.xml
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXIndexer.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXMapperTester.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit 629996d5c70f12e916878cb48e1c234b932eedcd by Rasmus Bohl Kristensen (rbkr)
Added setting for new job input/output dirs and more logging
(commit: 629996d5c70f12e916878cb48e1c234b932eedcd)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
Commit 5cb2bc46120e35bc9e1074ec1f135efd672d4b2a by Rasmus Bohl Kristensen (rbkr)
Setting fix from review https://sbforge.org/jira/browse/NARK-1954
(commit: 5cb2bc46120e35bc9e1074ec1f135efd672d4b2a)
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/WaybackIndexer.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/FileNameHarvester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
Commit bf5e943440e3760f4e25662ffcf603c3a78c0b2e by Rasmus Bohl Kristensen (rbkr)
Review changes https://sbforge.org/fisheye/cru/CR-NAS-393, changes to
uber-jar set up and small pom fixes
(commit: bf5e943440e3760f4e25662ffcf603c3a78c0b2e)
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXMapper.java
The file was modifiedharvester/heritrix3/heritrix3-bundler/pom.xml
The file was modifiedharvester/heritrix3/heritrix3-extensions/pom.xml
The file was modifiedpom.xml
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was modifiedharvester/history-gui/pom.xml
The file was modifiedharvester/harvester-core/pom.xml
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXMapperTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/DedupIndexer.java
The file was addedhadoop-uber-jar/pom.xml
The file was modifiedharvester/harvest-scheduler/pom.xml
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXMapper.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/hadoop/CDXIndexer.java
The file was modifiedwayback/wayback-indexer/pom.xml
Commit 987c230dc013d45aaac9554df0392043965e56a0 by Rasmus Bohl Kristensen (rbkr)
Fixed SimpleFileResolver, refactored how Hadoop jobs can be started, and
implemented getCrawlLogLinesMatchingRegexp
(commit: 987c230dc013d45aaac9554df0392043965e56a0)
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXMapperTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/batch/FileBatchJob.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobTool.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXMapperTester.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/CrawlLogLinesMatchingRegexp.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/viewerproxy/webinterface/TestInfo.java
The file was modifiedcommon/common-core/src/test/java/dk/netarkivet/common/utils/SimpleFileResolverTester.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJob.java
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionMapperTester.java
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionMapper.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/JobType.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataMapperTester.java
Commit dcee3b48afde25b3ab1ac42fa65adc64f672d91e by Rasmus Bohl Kristensen (rbkr)
Added settings for new job and finished last refactoring parts
(commit: dcee3b48afde25b3ab1ac42fa65adc64f672d91e)
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionStrategy.java
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXExtractionStrategy.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was removedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/JobType.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJob.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobStrategy.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/MetadataExtractionStrategy.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit e052b35ecbe8d07c2a88e914d3202d863b57bf50 by Rasmus Bohl Kristensen (rbkr)
Made small fix/cleanup in crawl log mapper and added more documentation
(commit: e052b35ecbe8d07c2a88e914d3202d863b57bf50)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionMapper.java
Commit ab9b8860ca1f5323ca20cabf8a23c7ee01009bc8 by Rasmus Bohl Kristensen (rbkr)
Squashed commit of the following:
commit 92109a0ad5ddb9238d593fbce21023ae05804b16 Author: bohlski
<rbkr@kb.dk> Date:   Mon Dec 7 11:41:48 2020 +0100
    Small changes from review https://sbforge.org/fisheye/cru/CR-NAS-395
commit e052b35ecbe8d07c2a88e914d3202d863b57bf50 Author: bohlski
<rbkr@kb.dk> Date:   Wed Dec 2 15:05:48 2020 +0100
    Made small fix/cleanup in crawl log mapper and added more
documentation
commit dcee3b48afde25b3ab1ac42fa65adc64f672d91e Author: bohlski
<rbkr@kb.dk> Date:   Wed Dec 2 14:07:18 2020 +0100
    Added settings for new job and finished last refactoring parts
commit 987c230dc013d45aaac9554df0392043965e56a0 Author: bohlski
<rbkr@kb.dk> Date:   Mon Nov 30 11:56:25 2020 +0100
    Fixed SimpleFileResolver, refactored how Hadoop jobs can be started,
and implemented getCrawlLogLinesMatchingRegexp
(commit: ab9b8860ca1f5323ca20cabf8a23c7ee01009bc8)
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionMapper.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/MetadataExtractionStrategy.java
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXMapperTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/batch/FileBatchJob.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/SimpleFileResolver.java
The file was modifiedcommon/common-core/src/test/java/dk/netarkivet/common/utils/SimpleFileResolverTester.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/CrawlLogLinesMatchingRegexp.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionStrategy.java
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXMapperTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJob.java
The file was addedharvester/harvester-test/src/test/java/dk/netarkivet/viewerproxy/webinterface/hadoop/CrawlLogExtractionMapperTester.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataMapperTester.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was addedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXExtractionStrategy.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/viewerproxy/webinterface/TestInfo.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobStrategy.java
The file was addedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobTool.java
Commit 9687194f6e849461945a3f75bdd3906f128d71c8 by Rasmus Bohl Kristensen (rbkr)
Squashed commit of the following:
commit a738bebf1170c0527ccbbc2925c4e15515ed28ab Author: bohlski
<rbkr@kb.dk> Date:   Mon Jan 4 14:31:28 2021 +0100
    Changes from review https://sbforge.org/fisheye/cru/CR-NAS-396
commit 3e922d974c296333cd469778c91f6bad0174270d Author: bohlski
<rbkr@kb.dk> Date:   Wed Dec 9 11:15:02 2020 +0100
    Small addition to error message on invalid base url
commit 16706d0129abb1caed2a9bdaa6df6b7b087ed895 Author: bohlski
<rbkr@kb.dk> Date:   Tue Dec 8 16:36:59 2020 +0100
    Modified the bitmag arcrepositoryclient get() to now request records
through the warc record service.
commit 9a92eca3bdfd3b6546a9a8569433b270f4f938cf Author: bohlski
<rbkr@kb.dk> Date:   Mon Dec 7 15:00:49 2020 +0100
    Fixed missing settings documentation from NARK-1998
(commit: 9687194f6e849461945a3f75bdd3906f128d71c8)
The file was modifiedintegration-test/system-test/src/test/java/dk/netarkivet/systemtest/NASSystemUtil.java
The file was modifiedcommon/common-core/src/main/resources/dk/netarkivet/common/settings.xml
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedquickstart/deploy_standalone_bitmag.xml
The file was removedarchive/archive-core/src/main/java/dk/netarkivet/archive/arcrepository/distribute/JMSBitmagArcRepositoryClient.java
The file was addedcommon/common-core/src/main/resources/dk/netarkivet/common/distribute/arcrepository/bitrepository/BitmagArcRepositoryClientSettings.xml
The file was removedcommon/common-core/src/main/resources/dk/netarkivet/common/distribute/arcrepository/bitrepository/JmsBitmagArcRepositoryClientSettings.xml
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXMapper.java
The file was addedarchive/archive-core/src/main/java/dk/netarkivet/archive/arcrepository/distribute/BitmagArcRepositoryClient.java
Commit 08d62e8104de4fe99d49b71b4b7933e41987bb56 by Colin Rosenthal (csr)
First attempt at a kill switch that returns an empty index for dedups
(commit: 08d62e8104de4fe99d49b71b4b7933e41987bb56)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit 3199f61725d3badd01b8e26a1c0c295c7564cb09 by Colin Rosenthal (csr)
Second attempt using IndexReadyMessage
(commit: 3199f61725d3badd01b8e26a1c0c295c7564cb09)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit aea04138a7ce030b1772456dee253c744b10453e by Colin Rosenthal (csr)
Added some logging
(commit: aea04138a7ce030b1772456dee253c744b10453e)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit 701b2c647c674bc72091877fbd6ab2bd8e989ca9 by Colin Rosenthal (csr)
Further attempt
(commit: 701b2c647c674bc72091877fbd6ab2bd8e989ca9)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit ca9377f522312bdb2babb2c33e664becd1fbcf81 by Colin Rosenthal (csr)
Further attempt using IndexReadyMessage
(commit: ca9377f522312bdb2babb2c33e664becd1fbcf81)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit ddb5dd34fc6b691f0318b0965dc09b51f3da9f66 by Colin Rosenthal (csr)
Back to reply
(commit: ddb5dd34fc6b691f0318b0965dc09b51f3da9f66)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit e4c67af253358e93ca41e108cfefff2072a6d9fa by Colin Rosenthal (csr)
Added a bit more logging.
(commit: e4c67af253358e93ca41e108cfefff2072a6d9fa)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit f6a7d91cbb5b2f3feaf42fc1b7e83ada5f2bb73a by Colin Rosenthal (csr)
Removed potential error when requesting empty cache
(commit: f6a7d91cbb5b2f3feaf42fc1b7e83ada5f2bb73a)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit f0f4a71edd0773f7fd35d30bfb5f80abe93057eb by Colin Rosenthal (csr)
Clean-up
(commit: f0f4a71edd0773f7fd35d30bfb5f80abe93057eb)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit 48366f7a72262d6f1442b635b466a2b929b9bcd3 by Colin Rosenthal (csr)
Refactoring to make MetadataIndexingApplication closer to a reusable
real-world case.
(commit: 48366f7a72262d6f1442b635b466a2b929b9bcd3)
The file was modifiedhadoop-uber-jar/src/main/java/MetadataIndexingApplication.java
The file was modifiedhadoop-uber-jar-invoker/src/main/resources/run.sh
The file was addedhadoop-uber-jar-invoker/src/main/resources/input.txt
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was modifiedpom.xml
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedcommon/common-core/src/main/resources/dk/netarkivet/common/settings.xml
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
Commit f212546a060f7271527d9d722308241d60e1b720 by Colin Rosenthal (csr)
Initial version using fileresolver
(commit: f212546a060f7271527d9d722308241d60e1b720)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit 06f48b57ed3030f29f067d2abae08d74bfab1f98 by Colin Rosenthal (csr)
Added explicit jersey-server dep. to GUI.
(commit: 06f48b57ed3030f29f067d2abae08d74bfab1f98)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedcommon/common-core/pom.xml
Commit 5501fca44675886868cd1672489a9000f4a15c97 by Colin Rosenthal (csr)
Added a necessary filtering stage to match only current collection
(commit: 5501fca44675886868cd1672489a9000f4a15c97)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit e68e482d4385d1ac0a0c8aac903055e1b23eaff3 by Colin Rosenthal (csr)
Tidying up for review.
(commit: e68e482d4385d1ac0a0c8aac903055e1b23eaff3)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/distribute/bitrepository/action/getfile/GetFileAction.java
The file was modifiedarchive/archive-core/src/main/java/dk/netarkivet/archive/arcrepository/distribute/BitmagArcRepositoryClient.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/distribute/bitrepository/action/getfile/GetFileEventHandler.java
Commit bafdb18e7a4b7667e765882e8a1205de182f3c91 by Colin Rosenthal (csr)
Forcing HadoopJobStrategy to use hdfs
(commit: bafdb18e7a4b7667e765882e8a1205de182f3c91)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit 6adbe99384b5754740a0cc4e6359ad3d1cc4e9ea by Colin Rosenthal (csr)
Forcing HadoopJobStrategy to use hdfs
(commit: 6adbe99384b5754740a0cc4e6359ad3d1cc4e9ea)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit 2214312e8db00065089f6ee1da3811cff20a181f by Colin Rosenthal (csr)
Added harvester-core to uber jar
(commit: 2214312e8db00065089f6ee1da3811cff20a181f)
The file was modifiedhadoop-uber-jar/pom.xml
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
Commit d022a62c4c855acd8a042db1a240ea515a063d7f by Rasmus Bohl Kristensen (rbkr)
Moved Kerberos logins
(commit: d022a62c4c855acd8a042db1a240ea515a063d7f)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/ArchiveFile.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedharvester/harvester-test/src/test/java/dk/netarkivet/harvester/indexserver/hadoop/GetMetadataMapperTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/BasicTwoWaySSLProvider.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/WaybackIndexer.java
Commit ba7b6362e8dc63a0711bdc4a89c957ebca15a6bd by Rasmus Bohl Kristensen (rbkr)
Small fixes and revert
(commit: ba7b6362e8dc63a0711bdc4a89c957ebca15a6bd)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/RawMetadataCache.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/hadoop/MetadataCDXExtractionStrategy.java
Commit 9e06be74dd416fc3e0c6036fcc457796b4547e53 by Rasmus Bohl Kristensen (rbkr)
Readded Kerberos login to IndexRequestServer
(commit: 9e06be74dd416fc3e0c6036fcc457796b4547e53)
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
Commit a114fff89a4046d8084f41be0a25f599dd785575 by Rasmus Bohl Kristensen (rbkr)
collectionID setting fix to always default to env name when unset
(commit: a114fff89a4046d8084f41be0a25f599dd785575)
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/service/CGIRequestBuilder.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/WaybackIndexer.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/harvester/indexserver/distribute/IndexRequestServer.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/distribute/bitrepository/BitmagUtils.java
The file was modifiedarchive/archive-core/src/main/java/dk/netarkivet/archive/arcrepository/distribute/BitmagArcRepositoryClient.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/webinterface/GUIWebServer.java
The file was modifiedwayback/wayback-indexer/src/main/java/dk/netarkivet/wayback/indexer/FileNameHarvester.java
Commit 681d1147f7824aa7a18804c4b1d9475ce9c26c19 by Colin Rosenthal (csr)
Follow up to own review comments
(commit: 681d1147f7824aa7a18804c4b1d9475ce9c26c19)
The file was modifiedcommon/common-core/src/test/java/dk/netarkivet/common/utils/warc/WarcRecordClientTester.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/utils/hadoop/HadoopJobUtils.java
The file was modifiedcommon/common-core/src/main/java/dk/netarkivet/common/CommonSettings.java
The file was modifiedharvester/harvester-core/src/main/java/dk/netarkivet/viewerproxy/webinterface/Reporting.java
The file was modifiedhadoop-uber-jar/pom.xml
The file was modifiedwayback/wayback-indexer/src/test/java/dk/netarkivet/wayback/hadoop/CDXJobTest.java
The file was modifiedcommon/common-core/src/test/java/dk/netarkivet/common/utils/warc/WarcRecordClientTest.java