NetarchiveSuite-Webdanica

Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Added and fixed drop in hbase-phoenix ddl.

Added some more unittest. changed da language rule to higher than 0.99 instead of 0.95

Update of automatic-workflow scripts

    • -0
    • +8
    /automatic-workflow/crontab.webdanica
    • -2
    • +3
    /automatic-workflow/pig16-call-script.sh
    • -0
    • +29
    /automatic-workflow/verify_pig_bootup.sh
    • -12
    • +41
    /automatic-workflow/webdanica-analysis-cron.sh
Removing cronjobs from their old location

Restructuring the folder for webdanica-cronjobs

Send admin mail whenever workflow has crashed

Implemented new curator LIKELY_DK danish codes (WEBDAN-213)

Fixes for WEBDAN-209 and WEBDAN-210

Various updates to harvestWorkflow and GUI

  1. … 7 more files in changeset.
Added a setting for max time to wait for a harvest, and started on improving the domain page

Fixed various bugs - finished the SynchronizeCrawlertraps tool

  1. … 3 more files in changeset.
Add reports field to table 'harvests' and remove fields 'seed_report', and 'crawllog'

Removed NPE when showing domain details

Improvement of domain page - disabling of harvestdefinition after job is finished

Added continuous logging to loadSeeds tool

Merge pull request #5 from blekinge/betterErrorHandling

Ændringer til bedre exception håndtering i loadSeeds

Merge branch 'master' of github.com:netarchivesuite/webdanica into betterErrorHandling

# Conflicts:

# webdanica-core/src/main/java/dk/kb/webdanica/core/tools/LoadSeeds.java

Rework of exception handling starting with loadSeeds and moving towards the DAOs

Implemented WEBDAN-205

Bumped version to 1.1-RC5 - added errors.log to LoadSeeds, and added missing fields to seeds table: exported and exported_time

Added the latest script to create the statecache table

Work on WEBDAN-165 - improved the logging

Add comments to the crontab defaults

Added reasonable default schedules for the filtering wf (every 10 mins), and cacheupdate wf (every 15 mins)

Only run filtering according to crontab

Finish work on WEBDAN-196

    • -1
    • +1
    /webdanica-webapp/src/main/webapp/master.html
Work on WEBDAN-196 - changed the SeedsResource to use the statecache

Work on WEBDAN-169 - implemented most of the cache code now - and the matching update workflow

Adding script for creating statecache (WEBDAN-196)

Making an 1.1-RC2 build

  1. … 6 more files in changeset.