[NAS-2732]  Columns Bytes Harvested / Documents Harvested / Stopped due to on page Details for job XXX are always empty after the job termination. Created: 09/Apr/18  Updated: 10/Apr/18  Resolved: 10/Apr/18

Status: Closed
Project: NetarchiveSuite
Component/s: GUI
Affects Version/s: None
Fix Version/s: 5.4

Type: Bug Priority: Blocker
Reporter: Søren Vejrup Carlsen (Inactive) Assignee: Unassigned
Resolution: Rejected  
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified


 Description   

It seems that BNF has experienced errors in the numbers reported back from the harvestcontroller when using the BnfHarvestReport instead of the default LegacyHarvestReport



 Comments   
Comment by Colin Rosenthal [ 10/Apr/18 ]

Ok. Maybe we should have an issue suggesting an improvement in the GUI that flags this possibility.

Comment by Sara Aubry [ 10/Apr/18 ]

After further tests, we noticed that problems were coming from (human) errors in domain names that did not match the seeds so NAS could not associate seed caclulation in the crawl.log and domains in the harvest definition.
The issue can be closed or deleted.

Comment by Colin Rosenthal [ 10/Apr/18 ]

I've tried various things but haven't been able to reproduce this :

         <harvestReport>
                <class&gt;dk.netarkivet.harvester.harvesting.report.BnfHarvestReport</class&gt;
                <disregardSeedURLInfo>false</disregardSeedURLInfo>
            </harvestReport>
.
.
.
             <objectLimitIsSetByQuotaEnforcer>false</objectLimitIsSetByQuotaEnforcer> 

Is there anything in your crawler-beans that might be relevant?
Can you go directly into your database and see if the HarvestReport info is in there?

Generated at Fri Apr 26 18:17:16 CEST 2024 using Jira 9.4.15#940015-sha1:bdaa9cbecfb6791ea579749728cab771f0dfe90b.