Up

Changes

#155 (12-May-2022 11:17:34)

  1. Added some fixes so getMetadataMapper works when caching is disabled. (commit: ab09f8ef5eb0c73702fe9598fedc9a6b92f197fc) — Colin Rosenthal (csr) / detail
  2. Testing new crawlrss (commit: c95f4b4e426fcf194116ae8a0e3ead5d2c26ecc0) — Colin Rosenthal (csr) / detail

#148 (07-Feb-2022 08:43:40)

  1. NAS-2874 Increment loaded TLD counter (commit: 9ce76c4c90200e2aab6c837ae8da2ecc45ed58a6) — clara.wiatrowski / detail
  2. [maven-release-plugin] prepare release netarchivesuite-7.3 (commit: 21bc5d6b60808accb511deb6542151803e3fd283) — Colin Rosenthal (csr) / detail
  3. [maven-release-plugin] prepare for next development iteration (commit: ab6a59444fb018007fec0f5ca2b218a72398b382) — Colin Rosenthal (csr) / detail
  4. Added fallback behaviour if hdfs caching fails to cache file (commit: 76ac9cb9eb40bd9fcc9010d8f12d7e64e25ac443) — Colin Rosenthal (csr) / detail

#145 (25-Jan-2022 12:43:52)

  1. Added some logging. (commit: 4f2a7081eba3dbf57dd74356caac68bf394bb4de) — Colin Rosenthal (csr) / detail
  2. Improved logging in PutFileEventHandler (commit: fa6ad587d2ca341a37f1c95a72cc699737b71052) — Colin Rosenthal (csr) / detail
  3. Added handling for IDENTIFY_TIMEOUT and correct handling of out-of-sync (commit: 0fe311646422afdbc3d46bebc0d550af56e558bd) — Colin Rosenthal (csr) / detail
  4. Ensure jobs are closed to prevent threadleak in invoking java process (commit: bf6398619466adaf8f019aae7210544afc6d142c) — Asger Askov Blekinge (abr) / detail
  5. Ensure Filesystem objects are closed after use (commit: 802c4d77c7232bf263fc1b0a534c0ee2944b83b1) — Asger Askov Blekinge (abr) / detail
  6. Include exit code in IOFailure exception (commit: 75bc6fef31051e97fa06da8e58c28c3d508346b7) — Asger Askov Blekinge (abr) / detail
  7. HadoopJobTools logs if the job failes (commit: 22d8f391af675451ab3dbef953113f2d77d968a3) — Asger Askov Blekinge (abr) / detail
  8. Changed log level. (commit: 6d4214fb0e32b897d1bddc72add08d79a5ed0dde) — Colin Rosenthal (csr) / detail
  9. GetMetadataMapper and cacheFile report progress to prevent a (commit: 934d4bb20c17b3e7ed28636af4164857e0fb7705) — Asger Askov Blekinge (abr) / detail
  10. Merged commit (commit: 7a085bb6f6c16325f007b5ff3614731fe928968e) — Colin Rosenthal (csr) / detail
  11. Explicitly create cache file when caching hdfs (commit: 34952e98ab34e3985d9171f85ee4128d6e7f8d29) — Asger Askov Blekinge (abr) / detail
  12. Modified FileResolver to return empty if http response code is not 200. (commit: cbc51994639305fe8a36746c6ba4c00492b3173e) — Colin Rosenthal (csr) / detail
  13. Fixed bitmag getfileids and some cleanup (commit: 520e4de5e267e3af2d40fa6e78803c15a33df510) — Colin Rosenthal (csr) / detail
  14. Added direct output streaming from hdfs (commit: 480e7411841e986bc48e2bc9fa7d5f759f5eead7) — Colin Rosenthal (csr) / detail
  15. Rewritten GetFileIDsAction to use a new handler for each call. (commit: d354945a9730fe2b394f1cc434afefa39d4cf69e) — Colin Rosenthal (csr) / detail
  16. Fixed some issues with holding large hadoop result sets in memory (commit: c5d6c5d38bb0cfe52a15493ccde9fa24a07e6a8d) — Colin Rosenthal (csr) / detail