Jeppe Ravn-Grove

Merge branch 'NAS-2463-crawl-log-filter'

NAS-2463: Fix of unit test

NAS-2463: Fix of closed stream

NAS-2463: Fix of test to work with the Stream

NAS-2463: For scalability, code now uses Stream instead of array to process the crawled URLs.

NAS-2463: Cleanup of unused code, commented-out code plus documentation

Handling invalid domain input from user

Handling invalid domain input from user

Fix of null-pointer exception in case of no running jobs

NAS-2463: Fixing a minor oops.

NAS-2463: Use existing NASEnvironment instead of making a new one.

Merge remote-tracking branch 'origin/master' into NAS-2463-crawl-log-filter

# Conflicts:

# harvester/harvester-core/src/main/java/dk/netarkivet/harvester/webinterface/servlet/Heritrix3JobMonitorThread.java

# harvester/harvester-core/src/main/java/dk/netarkivet/harvester/webinterface/servlet/JobResource.java

# quickstart-vagrant-environment/redeploy.sh

Committing minor changes to sync code linenumbers/breakpoints with running testcode

Got a "Source code does not match the bytecode" while debugging. This might fix it.

Logging for debugging

Fixing the fix. Erhm :)

Fix of synchronized-bug

More fix of logging for debugging.

Another fix of logging for debugging.

Fix of logging for debugging.

Inserted logging for debugging.

Factoring code out of the jsp, to be able to place breakpoints in the code for debugging

Implementation of functionality to get the crawled URLs, along with a unit-test for it.

This should fix one nullpointer exception (bcoz of error in condition)

Attempt at debugging a new nullpointer exception

Debugging null jobId

Domain search functionality, showing only jobs harvesting searched domain

Method for getting the crawllog

Pulled in the newest version of run-vagrant.sh from NAS-2463.

    • -2
    • +6
    /quickstart-vagrant-environment/run-vagrant.sh
Fixed seed urls having no protocol part.