[NAS-2732] Columns Bytes Harvested / Documents Harvested / Stopped due to on page Details for job XXX are always empty after the job termination. Created: 09/Apr/18 Updated: 10/Apr/18 Resolved: 10/Apr/18 |
|
Status: | Closed |
Project: | NetarchiveSuite |
Component/s: | GUI |
Affects Version/s: | None |
Fix Version/s: | 5.4 |
Type: | Bug | Priority: | Blocker |
Reporter: | Søren Vejrup Carlsen (Inactive) | Assignee: | Unassigned |
Resolution: | Rejected | ||
Labels: | None | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified |
Description |
It seems that BNF has experienced errors in the numbers reported back from the harvestcontroller when using the BnfHarvestReport instead of the default LegacyHarvestReport |
Comments |
Comment by Colin Rosenthal [ 10/Apr/18 ] |
Ok. Maybe we should have an issue suggesting an improvement in the GUI that flags this possibility. |
Comment by Sara Aubry [ 10/Apr/18 ] |
After further tests, we noticed that problems were coming from (human) errors in domain names that did not match the seeds so NAS could not associate seed caclulation in the crawl.log and domains in the harvest definition. |
Comment by Colin Rosenthal [ 10/Apr/18 ] |
I've tried various things but haven't been able to reproduce this : <harvestReport> <class>dk.netarkivet.harvester.harvesting.report.BnfHarvestReport</class> <disregardSeedURLInfo>false</disregardSeedURLInfo> </harvestReport> . . . <objectLimitIsSetByQuotaEnforcer>false</objectLimitIsSetByQuotaEnforcer> Is there anything in your crawler-beans that might be relevant? |