Changes
#142 (08-Mar-2021 08:16:55)
- fix for https://sbforge.org/jira/browse/NAS-2790 (commit: 834a5b322c067b1e83204b0865c5f520286dfd1c) — andreas.predikaka / githubweb
- Added a bitmagasin client configuration for quickstart. (commit: cbf6b00bf99a3548a3b18fa7dc0d67e53ad42b45) — Colin Rosenthal (csr) / githubweb
- Added a bitmagasin client configuration for quickstart. (commit: 3f8c34f8521b1cfe6d278f322500d1d02fe42970) — Colin Rosenthal (csr) / githubweb
- Added a bunch of logging. (commit: 0050b0b4e64137e3744cf5404e128fdc0b529511) — Colin Rosenthal (csr) / githubweb
- Added a bunch of logging. (commit: 40657dd874dafab228544e408aa6c094f6eb7215) — Colin Rosenthal (csr) / githubweb
- Fixed merge mess-up in pomfile. (commit: ff1c04542317064085a656fa5a13e8616d2f99f1) — Colin Rosenthal (csr) / githubweb
- Moved initialisation of bitrepository connection to first upload, so (commit: 0656feba923374c1de9d2a7267784530a83a6507) — Colin Rosenthal (csr) / githubweb
- Set CollectionID for upload to default to the NAS environment name. (commit: 51496b92545fe793eb11c297ea69e42ce968f0cb) — Colin Rosenthal (csr) / githubweb
- Added a check that the CollectionID is known to the bitrepository. (commit: 3e1e6d17582fc2d7692727e13f63c6477dfa3e6c) — Colin Rosenthal (csr) / githubweb
- Follow ups to https://sbforge.org/fisheye/cru/CR-NAS-370 (commit: 0e15e313cc0ffc98e75f5856fafb9c74acc4e44d) — Colin Rosenthal (csr) / githubweb
- Modified to handle case where bitarchive is a symbolic link. (commit: 7fc4cd52261be9f3332edbafed22e42691621480) — Colin Rosenthal (csr) / githubweb
- Follow-ups following code review: (commit: bcb1233a7ab6325d68fa64e34dfb51bf984e9641) — Colin Rosenthal (csr) / githubweb
- Document about simple test that narcana works (commit: a3bdc818c8afe3560edb176d07af6f4f9baea5c3) — Asger Askov Blekinge (abr) / githubweb
- Documented how to use narcana-suite01 (commit: cd2f3276325aa81f84b0e06e1f2c806e4ad6e564) — Asger Askov Blekinge (abr) / githubweb
- Most of the followup to https://sbforge.org/fisheye/cru/CR-NAS-372 (commit: f3a1ba688da30792c7846d519fc99bc6c448c30d) — Asger Askov Blekinge (abr) / githubweb
- Most of the followup nr 2 to https://sbforge.org/fisheye/cru/CR-NAS-372 (commit: 4cc7008f4620754fa82ca22dd2ae81bbe88625f1) — Asger Askov Blekinge (abr) / githubweb
- Partial followup on CR-NAS-372. (commit: 1197c122a2491839b401dc97122d644b31da1bf5) — Jonas Lindberg Frellesen (jolf) / githubweb
- Fixed method call (commit: d79ad774301a18f159cfa257a9f61c1f322c9d3d) — Jonas Lindberg Frellesen (jolf) / githubweb
- Further followup on CR-NAS-372. (commit: 8d4ebfd77f7f61844663c86e18df4668d4dc2534) — Jonas Lindberg Frellesen (jolf) / githubweb
- Merge branch 'bitmag' of /home/csr/projects/netarchivesuite with (commit: 1767d415911495676652ecf7ee9050cfda9a43a9) — Colin Rosenthal (csr) / githubweb
- Most of the followup nr 3 to https://sbforge.org/fisheye/cru/CR-NAS-372 (commit: b3cd9fcee1116c7eb651328059ef0bc1e7ba1797) — Asger Askov Blekinge (abr) / githubweb
- Some trivial documentation followups to (commit: f86fec56c346c383f94f3f5111be69d97143252e) — Colin Rosenthal (csr) / githubweb
- Followup on TestBitrepository stuff (commit: 374be4d1731ec21db2ccc30da2c6d12fe9285c86) — Jonas Lindberg Frellesen (jolf) / githubweb
- Fixes according to CR-NAS-373 (commit: 08c5a0110ef55abdbc806e49969199c8a3b007a4) — Jonas Lindberg Frellesen (jolf) / githubweb
- DKUPB-237: The structure is about right, but the final implementation is (commit: a6672bf630454d1b5aa1c6228ce0b9d0d6eb6ef0) — Knud Åge Hansen (kaah) / githubweb
- half-finished - should copy client config to all linux machines but (commit: 279e0a534b498de812907c53168597d42c850829) — Colin Rosenthal (csr) / githubweb
- Fully implemented distribution of client configuration with rest of (commit: 54b7309b9e94e39af4f4c32616b8af190381fa01) — Colin Rosenthal (csr) / githubweb
- Follow-ups after review (commit: 8525dd52b5d136bdc869ecc7cad9ff280617c3aa) — Colin Rosenthal (csr) / githubweb
- Fixed issue that -D params should not include dk.netarkivet (commit: 52c8672d5d12c60a7c087eed1dc54f0cd80535e9) — Colin Rosenthal (csr) / githubweb
- Fixed issue so bitmag config is sent to all linux machines. (commit: 61b04d8ab09decfe1b9560031cedb8c975b44149) — Colin Rosenthal (csr) / githubweb
- This class gives repeated problems so we are removing it from the test (commit: 9c6c5e6d325fc7ff931f27e7436be933bb5159b3) — Colin Rosenthal (csr) / githubweb
- Rebuilt complete settings. (commit: b4c4500ce4f060edef53f9029f7d41a70f9af7e2) — Colin Rosenthal (csr) / githubweb
- Disabled the tests, which fails. Instead of the entire test. (commit: 762fa2cd9f54cb8d88fa76227f090a67888c6285) — Jonas Lindberg Frellesen (jolf) / githubweb
- Readded the tests, which does not fail... Seems to fail, when the test (commit: 7ea1ec2f0ad54d932377166a91d44de0ad5bf62a) — Jonas Lindberg Frellesen (jolf) / githubweb
- Added a startup hook to umbra harvests to allow cleanup of umbra prior (commit: 13599b102fcc614e6dee7c575786f68e22af492d) — Colin Rosenthal (csr) / githubweb
- Added environment variable BITMAGCONF to automatic test environment (commit: fff35a350d5eda32ef5b34484f5abd00b87a88ae) — Colin Rosenthal (csr) / githubweb
- Fixed test to recognise that there is no ChecksumApplication when using (commit: cdf880332ba3f999dbeebf0d1af5c3fe6648959c) — Colin Rosenthal (csr) / githubweb
- fix HarvestController when BindingException occurs (commit: 01b2292e383f418f40372de37f01a461ff177e6d) — clara.wiatrowski / githubweb
- Get seedlist with/without alpha order (commit: 2e16d80f4f6239ec7a437d60822ea697fde029a6) — clara.wiatrowski / githubweb
- UI fixes (commit: 69c20c76dbe8ca697cb623d14596552c03cd676b) — clara.wiatrowski / githubweb
- fix space in domain name (commit: 1cd0a433def0e6d6ea2cb8011b6a370bdd48add0) — clara.wiatrowski / githubweb
- Add comment for crawler trap in crawler-beans (commit: 22d62b1629ed0c6b25a4e8589fd87acb5c0caddc) — clara.wiatrowski / githubweb
- Updated to new matching Heritrix version (commit: dd7bf4369a7a0fb262a9ba00359b7684540c8fe4) — Colin Rosenthal (csr) / githubweb
- [maven-release-plugin] prepare release netarchivesuite-5.6 (commit: 7b15d7abf1ed8a11f000177058f3acbea23ab29f) — Colin Rosenthal (csr) / githubweb
- [maven-release-plugin] prepare for next development iteration (commit: 97ebda03303de3f69cd2656ced3de95af6226083) — Colin Rosenthal (csr) / githubweb
- Actually want to write requests and metadata by default in tests! (commit: 31fe76b19dc9d899c3c26ba8bdcf318806e89c26) — Colin Rosenthal (csr) / githubweb
- Actually want to write requests and metadata by default in tests! (commit: 3ed813811e34e2d2d86d7dfede9d9ea5a318328f) — Colin Rosenthal (csr) / githubweb
- Updated version to a unique name (commit: e3b328a3cca883f6396a2955ef28bb0e2c7d2300) — Colin Rosenthal (csr) / githubweb
- Quick fix to NARK-1819 (commit: 7a4be0771bb6221cfc11689154393b1536dbb940) — Colin Rosenthal (csr) / githubweb
- non-function arcrep for use in bitmag development (commit: 143645020442b9cbaa453401829ad7fcbeb1e11f) — Colin Rosenthal (csr) / githubweb
- poms updated with Hadoop and basic settings for it added in (commit: 1360e402889d881aee7e6a964c7847f2a96de0b5) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Small changes to settings (commit: a85a3ad520234cf8cedeeef058911ad47c304474) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Can now at least work with local bitmag, it seems (commit: 91bdce496fa2ebe20ee0eeb8ce97528a868505f1) — Rasmus Bohl Kristensen (rbkr) / githubweb
- FileNameHarvester now grabs list of files directly from bitmag. Added (commit: 6824e324e6ccd74a10740790c11968bd5803a312) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Indexing through hadoop instead of batch should now work for WARC files (commit: 982056795b045ed339987660c2d13dd7f41d1073) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Changes from review (commit: a4871eadd0dcfd71fe3bda54170c5f911ae3da88) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Fixed dependency conflict with hadoop-client package and finished (commit: e8735f0b8fa3800b012a7928e369cf358b5248d8) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Refactored Bitrepository to a singleton. (commit: e494e8d0482fde2bc44ee840b4e675dac00a43d5) — Colin Rosenthal (csr) / githubweb
- Small logging changes (commit: c091c483521bf77ea95b7450dd0aee6449f5ccac) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Bitrepository class changes (commit: 9a39a4b2aacdc9283f4c34b080693ca5a1ddfe92) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Dependency fix to avoid logging loop and small logging changes (commit: a709b310c9f6af51c6312e3ed3fec73a912e2622) — Rasmus Bohl Kristensen (rbkr) / githubweb
- WarcRecordClient.java andApacheClientReaderFactory.java in (commit: d71da3e20c6df7f04c793f8b55f0fc6a028286d1) — Peter Christiansen (pech) / githubweb
- WarcRecordClient get and getFile changed (commit: 2b5e8b16b240a009f9ae1ee456350d70aaa821ac) — Peter Christiansen (pech) / githubweb
- Mulig del-løsning (commit: c81836a89060b2194e91860f82947d17a27fe8dc) — Colin Rosenthal (csr) / githubweb
- Fixed datafil og tilføjet lidt dokumentation (commit: 26339a4799be4c7ecff361b2be29861d8af2e2f8) — Colin Rosenthal (csr) / githubweb
- Added some integration tests for indexing on hadoop (commit: 96dfe55073b04b9cab36dbc4607c7496c877c2fd) — Colin Rosenthal (csr) / githubweb
- Removed the test which was less like the anticipated prod architecture (commit: 1ac65bb88c41f4f60214f728145a0c39ad035d46) — Colin Rosenthal (csr) / githubweb
- Tidied up the hadoop/cdx integration test (commit: b8eaa110fb2b6f1b8e2530776aa5c8d3e901273c) — Colin Rosenthal (csr) / githubweb
- Added an integration test for WarcRecordClient (commit: 64ac9178de086347e9335ba4b98316370ab699b8) — Colin Rosenthal (csr) / githubweb
- Added Readme file in empty directory (commit: c4da8b152e8fa7f10de0bd1fb48ed0868b7664eb) — Colin Rosenthal (csr) / githubweb
- Added Readme file in empty directory (commit: 96b73b68fa7e1301ab0328df8e0210d56a735c52) — Colin Rosenthal (csr) / githubweb
- Added a hdfs setting that seems relevant (commit: 60cd273673637ba210763dda8b107a7b96bef508) — Colin Rosenthal (csr) / githubweb
- WarcRecord fixes for WarcRecordClientTest and Tester (commit: 961cff06f0e4c6241fc550d476add1739b142f7f) — Peter Christiansen (pech) / githubweb
- Error fix (commit: 3cce599a2ccae8aa6ba1c26c75a33cd893ca6220) — Peter Christiansen (pech) / githubweb
- Made method for indexing with Hadoop that assumes direct access to input (commit: b227515615b8ca0bd8c8fe2fd34be189679549c8) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Dedup indexing (commit: f5508c47ebc69bc946ff16ca87218c949a59508a) — Colin Rosenthal (csr) / githubweb
- latest from pc (commit: 9cb0209af9f385e3527307c1dd08c4a78b588c4a) — Peter Christiansen (pech) / githubweb
- Moved getWarc from constructor to get (commit: 4e00feebd87d28c227a323907ab83cdaca0e143d) — Peter Christiansen (pech) / githubweb
- Code-maturation for cdx-indexing (commit: c87a69b57c851c9d90e12982a344da40f614bc74) — Colin Rosenthal (csr) / githubweb
- URI corrected to include filename Not yet robust for files not in gzip (commit: 3148c07477ae875b0e29021e146090e7487312f7) — Peter Christiansen (pech) / githubweb
- Hardcoded finName for testing (commit: dda3889ee6740e4306cf84f8e0b7bbee557ee8ee) — Peter Christiansen (pech) / githubweb
- Hardcoded finName for testing (commit: dee3eaab03e37446ffd1be94b1783193a68bba4d) — Peter Christiansen (pech) / githubweb
- Attempt to avoid double-indexing (commit: e1c23281deb8bd3055c9f85508c5076076f37c32) — Colin Rosenthal (csr) / githubweb
- Now passes integration test. (commit: 65fd5e068a37d7040f4ef741dd35347d00136ce5) — Colin Rosenthal (csr) / githubweb
- Now returns correct record. (commit: 671d7f64ec0db29802bba897eddfdce56d026b49) — Colin Rosenthal (csr) / githubweb
- Efter lidt cleanup (commit: dd9de321b98a8635ba8dd1948152469046b3985f) — Peter Christiansen (pech) / githubweb
- Initial work on FileResolver (commit: d411beb865c7dace665ae2709953857503a37c27) — Colin Rosenthal (csr) / githubweb
- Efter endnu lidt cleanup, men før logs (commit: e7504646b6f369f362a96d9ddcf01e4df7266323) — Peter Christiansen (pech) / githubweb
- Added hadoop job for getting metadata lines from archive files and an (commit: 553e20659df3bb62c6d121d50c6effa3fc8947e9) — Rasmus Bohl Kristensen (rbkr) / githubweb
- latest update (commit: 5763a813e2ec0f0eb1e8ad50cefcafc4767f0455) — Peter Christiansen (pech) / githubweb
- Added filehandling for GetMetadataArchiveMapper and small touch ups (commit: 24aaecbc74299fa9fda9191dfe510977aa027b8f) — Rasmus Bohl Kristensen (rbkr) / githubweb
- added null response if http statuscode is not 200 (commit: 4804a0787d97a2a9c3802edb716d3d3c78753259) — Peter Christiansen (pech) / githubweb
- removed printlns and added logging for http exception (commit: db425bdcd25a943fdf3c59f741fb5eeb04da82b1) — Peter Christiansen (pech) / githubweb
- Added pattern-matching method to file-resolver (commit: 57b380f2f8c289544404aecbb81f6cef8a084274) — Colin Rosenthal (csr) / githubweb
- Small refactor of ArchiveFile/HadoopUtils, few touch ups and started on (commit: 0d880c6017572102b4bf24e60d87ea35a84e2470) — Rasmus Bohl Kristensen (rbkr) / githubweb
- added test methods for archive files and negative testing (commit: 085be39423a4881b64793f1328a73ba0286a3f3f) — Peter Christiansen (pech) / githubweb
- Changed test to use paths relative to module root (commit: 0602fcb9458cfa655d0cb39f98d004e720bc9e42) — Colin Rosenthal (csr) / githubweb
- Added tests (commit: ec684feb1b69f99952e68dd4e366289b45710ed6) — Peter Christiansen (pech) / githubweb
- test corrections excludes .gz (commit: aca94d36b025e95458e0f718bfd83b9d6876545a) — Peter Christiansen (pech) / githubweb
- Added a conf flag to switch between standard indexing and dedup indexing (commit: 2e2173b833c5c5ddd879e7548b96d875a06353a1) — Colin Rosenthal (csr) / githubweb
- 'Start' of https://sbprojects.statsbiblioteket.dk/jira/browse/NARK-1970 (commit: d52a6bfda1ed72ad4fc125356ae274f18e0de8c6) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Tiny settings change for NARK-1882 review (commit: 27a3d95258902008eda3d450c7261e3d694a4c10) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Integration of Hadoop dedup indexing with GetMetadataArchiveMapper now (commit: ca2c62d474caf14d80e8e1e8f3970a4582e84672) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Cleaned up a few things in RawMetadataCache and refactored HadoopUtils (commit: d10211a994d936309d076e87f0ae9699d99f385e) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Squashed commit of the following: (commit: 8d9adc2b50d996dfaa544528b34a7b6b96947d1e) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Added pattern configuration constants in GetMetadataMapper (commit: 9c130776d86bffa43170028cad724353348ec8dc) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Cleanup aaording review (commit: 5b3c5fbb6202b4be581afacd06be50dbe3e0deb2) — Peter Christiansen (pech) / githubweb
- latest changes i getFile etc. (commit: 2c586ca6fd4c5b62a1df805f10f8bd1ac451b323) — Peter Christiansen (pech) / githubweb
- corrected (commit: 216184c3737c5b895f1eba74be84f5e4beab244e) — Peter Christiansen (pech) / githubweb
- A few final edits. (commit: e7fbf863417a855376671d375fbe0e4332b9f0ca) — Colin Rosenthal (csr) / githubweb
- 'Initial' commit (commit: c4553748c54d383e3f082fc10cf29b2ca4688ab3) — Rasmus Bohl Kristensen (rbkr) / githubweb
- First commit on arc_record branch (commit: d27f60647703e6333b0baa83ece150e2b1c238a5) — Peter Christiansen (pech) / githubweb
- Review https://sbforge.org/fisheye/cru/CR-NAS-385 changes (commit: 553c4afcb7ddf654b26cc4c9afa3c4cdc7c79197) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Added testing (commit: 779f5a0dc37edfd41a0dd30d1dc4dd2167c8e13b) — Peter Christiansen (pech) / githubweb
- added .arc test-files (commit: dfffbac038e84ceb95a38d42024b6eb2fd557fdf) — Peter Christiansen (pech) / githubweb
- Fixed dependency problem and added simple application class to run (commit: 016eb5f4073e02288b2acd38f940e0002db0642f) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Fixed get .arc-record with positive offset (commit: 66c495b026566a13658e39cf40e28120e91a31b8) — Peter Christiansen (pech) / githubweb
- Small refactor and implemented harvestRecentFilenames (commit: 199355bf63aba632f2dcb23c66de7778b794e97e) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Javadoc added to few files https://sbforge.org/fisheye/cru/CR-NAS-387 (commit: 18336a02e91749175a3a19b4a9fafaa181af053c) — Rasmus Bohl Kristensen (rbkr) / githubweb
- More review changes https://sbforge.org/fisheye/cru/CR-NAS-387 (commit: 274b0fb8819d98558b17b3fae93c4885ebb6200c) — Rasmus Bohl Kristensen (rbkr) / githubweb
- minor changes tests (commit: fc4184449c23c26d5a014544479b6673b024164c) — Peter Christiansen (pech) / githubweb
- Initial functioning FileResolverRESTClient (commit: c60029793838e1421e0eb97c3ca8aee8d2b32149) — Colin Rosenthal (csr) / githubweb
- Removed some old bitmag classes (commit: 4674413716dbf6fc8e322c47ab8c2fb89c259664) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Improved handling of try/catch logic (commit: 7a7cda6012b1ba4c6e7d0a1832e5c22b5133bb65) — Colin Rosenthal (csr) / githubweb
- Added some new tests and matured code ready for review (commit: b005ce2942cfab8a18e1389da311c32f2e5ac1ee) — Colin Rosenthal (csr) / githubweb
- Removed more old bitmag classes, refactored parts of some classes for (commit: dbf8703610bf19bfe044cf7f6f59a10710fdf7b4) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Fixed some old imports that made the compiler angry (commit: 48d011475ef9dc0480986dbab35e8266e3207f4f) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Fixed up FileResolverRESTClient for review and refactored code to enable (commit: 0a31340c22213cb7707a5188ec83ded5143c22ce) — Colin Rosenthal (csr) / githubweb
- Added more logging to FileNameHarvester (commit: b7120d70f3adf93ca6a12368f30897084a8a6295) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Small refactor to make ArchiveFile's collectHadoopResults use (commit: 358b6977ce8fc98987a32327d57815bd30c0f34a) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Latest bug fixes on loop testing (commit: f41a6bc3cf46940f48d0dbfca3121cd68679342a) — Peter Christiansen (pech) / githubweb
- Fixed bug with indexing threads sharing same filesystem instance (commit: d8c00a93115685b3357ec4202d7de727d76486fa) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Undo of file-change permissions. (commit: 0019993d36df0e7d3be07594ded1edfe0c2e101b) — Colin Rosenthal (csr) / githubweb
- Fixed bug with indexing threads sharing same filesystem instance (commit: e9969a932d57a7e301ec95365219e3a662c3b0c6) — Colin Rosenthal (csr) / githubweb
- Fixed handling of returning used client to pool (commit: cba38820408a11fa56fe247f7b1c9eab668c6ba2) — Colin Rosenthal (csr) / githubweb
- Added cdx indexing for metadata files in CDXIndexer and proper testing (commit: 7000ae16f8936955299227260d601c5db7005b81) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Got Hadoop replacement for ArchiveExtractCDXJob ready, refactored some (commit: 52c718231cc4eb4ba13f6161ace4f701aeb4b738) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Added setting for new job input/output dirs and more logging (commit: 629996d5c70f12e916878cb48e1c234b932eedcd) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Setting fix from review https://sbforge.org/jira/browse/NARK-1954 (commit: 5cb2bc46120e35bc9e1074ec1f135efd672d4b2a) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Tidied up logic in client and tests (commit: 6996761e42e6f09685269eee0b656c2686cbc2d6) — Colin Rosenthal (csr) / githubweb
- Review changes https://sbforge.org/fisheye/cru/CR-NAS-393, changes to (commit: bf5e943440e3760f4e25662ffcf603c3a78c0b2e) — Rasmus Bohl Kristensen (rbkr) / githubweb
- FileResolverRESTClient now sends collectionId as an extra query (commit: 4327720c7867c82e5f3533a23657a5cd16149eba) — Colin Rosenthal (csr) / githubweb
- Fixed SimpleFileResolver, refactored how Hadoop jobs can be started, and (commit: 987c230dc013d45aaac9554df0392043965e56a0) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Added collectionId parameter to WarcRecordClient (commit: 7aadc2501609e5b0ee956af0928ef4793af85ccb) — Colin Rosenthal (csr) / githubweb
- Added exactfilename parameter to FileResolverRESTClient. (commit: c8272eac493023dd1aaa9ebef671a1e3a42c3742) — Colin Rosenthal (csr) / githubweb
- Added settings for new job and finished last refactoring parts (commit: dcee3b48afde25b3ab1ac42fa65adc64f672d91e) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Made small fix/cleanup in crawl log mapper and added more documentation (commit: e052b35ecbe8d07c2a88e914d3202d863b57bf50) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Squashed commit of the following: (commit: ab9b8860ca1f5323ca20cabf8a23c7ee01009bc8) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Small changes from review https://sbforge.org/fisheye/cru/CR-NAS-395 (commit: 3d6bc39bc70440065a715acfe11a76e0575c5ea3) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Added a default value for setting useBitmagHadoopBackend (commit: d4d44145f82d3a6d425472d62eed5dd070672e8f) — Colin Rosenthal (csr) / githubweb
- Squashed commit of the following: (commit: 9687194f6e849461945a3f75bdd3906f128d71c8) — Rasmus Bohl Kristensen (rbkr) / githubweb
- First attempt at a kill switch that returns an empty index for dedups (commit: 08d62e8104de4fe99d49b71b4b7933e41987bb56) — Colin Rosenthal (csr) / githubweb
- Second attempt using IndexReadyMessage (commit: 3199f61725d3badd01b8e26a1c0c295c7564cb09) — Colin Rosenthal (csr) / githubweb
- Added some logging (commit: aea04138a7ce030b1772456dee253c744b10453e) — Colin Rosenthal (csr) / githubweb
- Further attempt (commit: 701b2c647c674bc72091877fbd6ab2bd8e989ca9) — Colin Rosenthal (csr) / githubweb
- Further attempt using IndexReadyMessage (commit: ca9377f522312bdb2babb2c33e664becd1fbcf81) — Colin Rosenthal (csr) / githubweb
- Back to reply (commit: ddb5dd34fc6b691f0318b0965dc09b51f3da9f66) — Colin Rosenthal (csr) / githubweb
- Added a bit more logging. (commit: e4c67af253358e93ca41e108cfefff2072a6d9fa) — Colin Rosenthal (csr) / githubweb
- Removed potential error when requesting empty cache (commit: f6a7d91cbb5b2f3feaf42fc1b7e83ada5f2bb73a) — Colin Rosenthal (csr) / githubweb
- Clean-up (commit: f0f4a71edd0773f7fd35d30bfb5f80abe93057eb) — Colin Rosenthal (csr) / githubweb
- Bit of refactoring and made SSL provider to work with https (commit: 8f03956481c07a5c8639a847a6a51a30a8827882) — Rasmus Bohl Kristensen (rbkr) / githubweb
- First attempt at a command-line metadata extraction job. (commit: 8183749da409edb2229dabb86b9e2fbaca8a182d) — Colin Rosenthal (csr) / githubweb
- Changed how the SSLContext is built to avoid trusting self-signed certs (commit: 18e8f959abf99005115a5e2521286242e98db82f) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Fixed the error with closed hadoop file system (commit: 87397cff475747f2efdba4befe07276742918bc7) — Asger Askov Blekinge (abr) / githubweb
- Downgraded hadoop to stable 3.2.2 (commit: 2dd04f160af25e4c61fcdc14c697af66e40b3ed7) — Asger Askov Blekinge (abr) / githubweb
- Created an invoker-module to prevent the job from including all the (commit: fffd21b9e21b9572899722516c0c6e10a16db15c) — Asger Askov Blekinge (abr) / githubweb
- Package in libs (commit: 4ca12576421f97206a423b90978a9b684619ae7b) — Asger Askov Blekinge (abr) / githubweb
- Create FileSystem with newInstance and close it afterwards. DO NOT CLOSE (commit: b006660cc04ac3f6c2442dc0d09b74b6c017c9c9) — Asger Askov Blekinge (abr) / githubweb
- Added an extra sanity check in the run.sh script. (commit: dae7ccb17d1e55a49039a7388ff60797959fc609) — Colin Rosenthal (csr) / githubweb
- Added an extra line to show how to customise location of krb5.conf (commit: 6086fdeca97e33163c28c23062593b6f17f23b95) — Colin Rosenthal (csr) / githubweb
- Improved logging (commit: 6486dd243255f8bbf407a34207a76e5a4a27d331) — Colin Rosenthal (csr) / githubweb
- Modified to support dynamic identification of the correct file-system (commit: dd68c05afd42a7a8beb6c10dde0a134b2a9b47a4) — Colin Rosenthal (csr) / githubweb
- Few clarification fixes to java doc (commit: 44da8bb849b06dc55392c01eaf7475563bc75313) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Refactoring to make MetadataIndexingApplication closer to a reusable (commit: 48366f7a72262d6f1442b635b466a2b929b9bcd3) — Colin Rosenthal (csr) / githubweb
- Parametrised the script to make it more flexible. (commit: 048313b7a4b5d7b85e5aba5d9e31227a96008016) — Colin Rosenthal (csr) / githubweb
- Refactored to use login mechanism instead of doAs. (commit: dc8b3564c4d580f0a8078d58514296c11b01dbce) — Colin Rosenthal (csr) / githubweb
- Removed all unnecessary configuration overrides. (commit: e1c9cc6202e4f7da4fa340f9d9d84ab4f94ea957) — Colin Rosenthal (csr) / githubweb
- Added default truststore settings (commit: d3522cfe5d45e6eeafeaf740119b5ef9bc7604d1) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Squashed commit of the following: (commit: 73e016f0e681e23ea9b50428fed236cbba4b706c) — Rasmus Bohl Kristensen (rbkr) / githubweb
- Added line 122 with casting (CleanupIF) (commit: 67d5ca5cad1db2c701d9bcf10046c9368895d4bc) — Peter Christiansen (pech) / githubweb
- Outcommented TestCorrect, FileChecksumArchiveTester, (commit: 4319452c3343d6cc5aa64da31c9df337b5672688) — Peter Christiansen (pech) / githubweb
- Changed outcommenting to @Ignore (commit: f99ba354e51331301700cc3a95e044d7f53b9af3) — Peter Christiansen (pech) / githubweb
- Tests that fail locally are ignored (commit: ede844af57f06c03032ab97465a1294c881f4a5b) — Peter Christiansen (pech) / githubweb
- switched line 123 with line 122 CleanupIF.. (commit: fd87b5a10b73ec8f3acbf23ad48445cd61ee6662) — Peter Christiansen (pech) / githubweb
- ny pom.xmm a intergaces og annotations (commit: 1fd4d1d17ef45ac49b8f1d91504b390ecb47a70e) — Peter Christiansen (pech) / githubweb
- Fixed circular dependency (commit: b9239bc6e51ea9fafa652287c25a3a1a76306c30) — Peter Christiansen (pech) / githubweb
#136 (20-May-2020 08:51:30)
- Updated version for h3 and snapshot version number for NAS (commit: 47601c8cbdcdfb34cce19ac232a207ba5c3a16c3) — Colin Rosenthal (csr) / githubweb
#133 (03-Mar-2020 09:42:34)
- Quick fix to NARK-1819 (commit: 7a4be0771bb6221cfc11689154393b1536dbb940) — Colin Rosenthal (csr) / githubweb
#131 (09-Dec-2019 10:40:21)
- Updated version to a unique name (commit: e3b328a3cca883f6396a2955ef28bb0e2c7d2300) — Colin Rosenthal (csr) / githubweb
#128 (14-Nov-2018 15:37:32)
- Fixed merge mess-up in pomfile. (commit: 961c519cc862817b7a1309b3ccc501367c5b69cb) — Colin Rosenthal (csr) / githubweb
#125 (11-Oct-2018 11:19:37)
- Tidying up of presumed-functioning system test. (commit: 7d1d19dd3d812d4d95bea7048a30ab95058362df) — Colin Rosenthal (csr) / githubweb
#124 (10-Oct-2018 11:43:31)
- Deactivated failing cleanup method. (commit: 980e7603b64465fb8ed6848f438c007b38b34239) — Colin Rosenthal (csr) / githubweb
- Fixed an issue with deactivation on shutdown. (commit: 1faae7cba3d62959b5aebece7f0c3d9529ce9922) — Colin Rosenthal (csr) / githubweb
#123 (10-Oct-2018 10:30:17)
- Consistent dependencies for selenium and htmlunit. (commit: cc534c4379b21c0f3f71c4cd0ab28ff3d2bea0aa) — Colin Rosenthal (csr) / githubweb