[NAS-2495] NPE in OAIExtractor in bundled Heritrix3 Created: 03/Feb/16 Updated: 19/Feb/16 Resolved: 19/Feb/16 |
|
Status: | Resolved |
Project: | NetarchiveSuite |
Component/s: | None |
Affects Version/s: | None |
Fix Version/s: | 5.1 |
Type: | Bug | Priority: | Minor |
Reporter: | Søren Vejrup Carlsen (Inactive) | Assignee: | Søren Vejrup Carlsen (Inactive) |
Resolution: | Fixed | ||
Labels: | None | ||
Remaining Estimate: | Not Specified | ||
Time Spent: | Not Specified | ||
Original Estimate: | Not Specified |
Issue Links: |
|
||||||||
Verification: | Verified by making a netarkivet.dk harvest with the ExtractorOAI enabled |
Description |
SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". SLF4J: Defaulting to no-operation (NOP) logger implementation SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details. 2016-02-03 15:40:03.251 INFO thread-12 org.archive.crawler.framework.Engine.addJobDirectory() added crawl job: 11_1454513930810 2016-02-03 15:40:04.572 INFO thread-12 org.archive.crawler.framework.CrawlJob.instantiateContainer() Job instantiated 2016-02-03 15:40:04.832 INFO thread-12 org.archive.crawler.framework.CrawlJob.launch() Job launched 2016-02-03 15:40:06.352 INFO thread-15 org.archive.spring.PathSharingContext.initLaunchId() launch id 20160203154006 2016-02-03 15:40:06.704 INFO thread-15 org.archive.io.WriterPool.<init>() Initial configuration: prefix=11-1, template=${prefix}-${timestamp17}-${serialno}-ciblee_2015_${heritrix.hostname}, compress=false, maxSize=1000000000, maxActive=3, maxWait=500 2016-02-03 15:40:06.756 INFO thread-15 org.archive.crawler.framework.CrawlJob.onApplicationEvent() PREPARING 20160203154006 2016-02-03 15:40:07.732 INFO thread-20 org.archive.crawler.framework.CrawlController.noteFrontierState() Crawl running. 2016-02-03 15:40:07.734 INFO thread-20 org.archive.crawler.framework.CrawlJob.onApplicationEvent() RUNNING 20160203154006 2016-02-03 15:40:16.567 INFO thread-63 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:40:17.231 INFO thread-65 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:40:22.755 INFO thread-65 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:40:31.983 INFO thread-45 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:41:24.145 INFO thread-40 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:41:41.112 INFO thread-26 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:41:44.189 INFO thread-47 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:41:47.180 INFO thread-62 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:41:53.229 INFO thread-67 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:41:57.219 INFO thread-55 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:42:01.249 INFO thread-25 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:42:06.275 INFO thread-36 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:42:14.397 INFO thread-44 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:42:18.307 INFO thread-52 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:42:22.328 INFO thread-63 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:43:46.595 INFO thread-26 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:44:36.524 INFO thread-28 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:46:42.158 INFO thread-23 org.archive.modules.extractor.Extractor.handleException() Exception java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03 15:47:37.808 INFO thread-20 org.archive.crawler.framework.CrawlController.noteFrontierState() Crawl empty. 2016-02-03 15:47:37.808 INFO thread-20 org.archive.crawler.framework.CrawlJob.onApplicationEvent() STOPPING 20160203154006 2016-02-03 15:47:37.809 INFO thread-20 org.archive.crawler.framework.CrawlJob.onApplicationEvent() EMPTY 20160203154006 2016-02-03 15:47:39.360 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/crawl-report.txt 2016-02-03 15:47:39.379 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/seeds-report.txt 2016-02-03 15:47:39.401 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/hosts-report.txt 2016-02-03 15:47:39.416 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/source-report.txt 2016-02-03 15:47:39.422 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/mimetype-report.txt 2016-02-03 15:47:39.426 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/responsecode-report.txt 2016-02-03 15:47:39.427 INFO thread-20 org.archive.modules.writer.WARCWriterProcessor.report() final stats: {response={numRecords=582, totalBytes=37483767, contentBytes=37277685, sizeOnDisk=37483767}, totals={numRecords=582, totalBytes=37483767, contentBytes=37277685, sizeOnDisk=37483767}, warcinfo={numRecords=0, totalBytes=0, contentBytes=0, sizeOnDisk=0}} 2016-02-03 15:47:39.428 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/processors-report.txt 2016-02-03 15:47:39.434 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/frontier-summary-report.txt 2016-02-03 15:47:39.434 INFO thread-20 org.archive.crawler.reporting.StatisticsTracker.writeReportFile() wrote report: /home/devel/TEST6/harvester_high/11_1454513930810/heritrix3/./jobs/11_1454513930810/20160203154006/reports/threads-report.txt 2016-02-03 15:47:39.434 INFO thread-20 org.archive.crawler.framework.CheckpointService.stop() Cleaned up Checkpoint TimerThread. 2016-02-03 15:47:39.440 INFO thread-20 org.archive.crawler.framework.CrawlJob.onApplicationEvent() FINISHED 20160203154006 2016-02-03 15:47:39.440 INFO thread-20 org.archive.crawler.frontier.AbstractFrontier.crawlEnded() Closing with 0 urls still in queue. 2016-02-03 15:51:06.987 INFO thread-79 org.archive.crawler.framework.CrawlJob.doTeardown() Job instance discarded |
Comments |
Comment by Søren Vejrup Carlsen (Inactive) [ 19/Feb/16 ] |
Fixed in https://github.com/netarchivesuite/netarchivesuite/commit/a1e409f015a37b20518e88f15e876a030377f07a |
Comment by Søren Vejrup Carlsen (Inactive) [ 03/Feb/16 ] |
It seems the NPE comes from harvesting some netarkivet.dk urls 2016-02-03T12:16:29.829Z 200 27054 http://netarkivet.dk/feed/ RE http://netarkivet.dk/ application/rss+xml #002 20160203121629393+269 sha1:VCPFV3QTS44Q6TSR7BXDTW2RPCZZOO77 http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:27379 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:16:30.472Z 200 740 http://netarkivet.dk/comments/feed/ RE http://netarkivet.dk/ application/rss+xml #001 20160203121630241+225 sha1:77L4PLRYGYSMMQJI3X6UMP6RZ3N555MM http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1060 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:16:37.427Z 200 1045 http://netarkivet.dk/wp-includes/wlwmanifest.xml RE http://netarkivet.dk/ application/xml #017 20160203121637360+59 sha1:Z4ZPZP7W5ZKAGNNYLC2PTSYE2ZS5V4A2 http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1260 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:16:46.597Z 200 728 http://netarkivet.dk/om-netarkivet/feed/ RLE http://netarkivet.dk/om-netarkivet/ application/rss+xml #046 20160203121646367+225 sha1:AN7HISCMBJTNNXJSWKS5CWG7FBG3ZXUL http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1098 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:17:38.793Z 200 28415 http://netarkivet.dk/arkiv/nyhed/feed/ RLE http://netarkivet.dk/arkiv/nyhed/ application/rss+xml #047 20160203121738496+264 sha1:NLLIUDYADA6G7BQJT7F4ZDYQ6OABPJU3 http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:28740 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:17:49.757Z 200 762 http://netarkivet.dk/status-december-2015/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #035 20160203121749527+226 sha1:ECI4XHQZJS4MWNT5OMLCE2UEPRR7J7SN http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1135 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:17:52.766Z 200 770 http://netarkivet.dk/hvem-bruger-netarkivet/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #004 20160203121752537+225 sha1:5T7XXJIEQQP6RL7ILE5PPZ45ACBYXYLN http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1143 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:17:55.802Z 200 824 http://netarkivet.dk/begivenhedshoestning-om-flygtningekrisen/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #027 20160203121755558+241 sha1:EDSBPCPSL5HGU2C4EYDM57AQEQNYVYFY http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1197 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:01.846Z 200 806 http://netarkivet.dk/10-aar-med-webarkivering-i-danmark/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #038 20160203121801615+222 sha1:FV72I6IO3ZVWHA6WMIJRJVJ2JVSA25OZ http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1179 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:05.877Z 200 806 http://netarkivet.dk/netarkivet-dokumenterer-valgkampen/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #019 20160203121805615+258 sha1:IDZWTZABX7Z3SAPATTB6FYFEQKMHGUA3 http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1179 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:09.855Z 200 830 http://netarkivet.dk/netarkivet-er-klar-til-folketingsvalg-2015/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #042 20160203121809626+226 sha1:7P6CTLJFAXTUWWPNW4HAH2VDDB2QJQYW http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1203 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:14.866Z 200 773 http://netarkivet.dk/webarkiverings-workshop/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #046 20160203121814640+222 sha1:MPS2IZH3LUSO2G3MRGHYTBNVT6MCC5LY http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1146 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:20.915Z 200 785 http://netarkivet.dk/danmarks-foerste-hjemmeside/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #022 20160203121820687+225 sha1:ZAGAQKXMHQTAFBEAAMECW3LZM5OQ2XAO http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1158 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:24.916Z 200 722 http://netarkivet.dk/i-like/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #045 20160203121824693+219 sha1:DDMVG7PVQ3WB2UGWNC6BVH3NOY5GG6C4 http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1095 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:18:28.927Z 200 907 http://netarkivet.dk/eurovision-song-contest-2014-hidtil-stoerste-begivenhedshoestning/feed/ REX http://netarkivet.dk/feed/ application/rss+xml #032 20160203121828700+224 sha1:YYIHIN2TCTWCVZKZOU7VTG7E2KEYOBIO http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1280 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:20:00.384Z 200 788 http://netarkivet.dk/nyt-vaerktoej-til-netarkivet/feed/ RLEX http://netarkivet.dk/arkiv/nyhed/feed/ application/rss+xml #042 20160203122000105+276 sha1:E63OJALCVK3IVLZRPZZYXMW7IHLVSD3Z http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:1161 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:20:55.428Z 200 74095 https://wordpress.org/news/feed/ REXRER http://wordpress.org/news/feed/ application/rss+xml #029 20160203122054334+946 sha1:YK35MRXPI6VEVYGFAKTQWO2FHWB726AY http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:74453 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) 2016-02-03T12:22:41.818Z 200 95333 https://s.w.org/wp-includes/fonts/dashicons.svg REXREE https://s.w.org/wp-includes/css/dashicons.css?20150710 image/svg+xml #038 20160203122241593+216 sha1:JZ2HFVR5SEHYE5V6JHRRIVE4EATPR2E2 http://www.netarkivet.dk/ err=java.lang.NullPointerException,content-size:95615 java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) java.lang.NullPointerException at dk.netarkivet.harvester.harvesting.extractor.ExtractorOAI.innerExtract(ExtractorOAI.java:111) at org.archive.modules.extractor.ContentExtractor.extract(ContentExtractor.java:37) at org.archive.modules.extractor.Extractor.innerProcess(Extractor.java:102) at org.archive.modules.Processor.innerProcessResult(Processor.java:175) at org.archive.modules.Processor.process(Processor.java:142) at org.archive.modules.ProcessorChain.process(ProcessorChain.java:131) at org.archive.crawler.framework.ToeThread.run(ToeThread.java:148) |
Comment by Søren Vejrup Carlsen (Inactive) [ 03/Feb/16 ] |
Declared as minor, as it is not considered a blocker for the |