我正在尝试在Eclipse中运行nutch 1.9,我的所有配置均根据本文(http://yewintko.wordpress.com/2014/02/02/setting-up-nutch-in-eclipse-indigo/)进行。但是我得到了这个错误:
CrawlDb update: starting at 2014-11-10 15:50:10
CrawlDb update: db: urls
CrawlDb update: segments: [3, crawl]
CrawlDb update: additions allowed: true
CrawlDb update: URL normalizing: false
CrawlDb update: URL filtering: false
CrawlDb update: 404 purging: false
CrawlDb update: Merging segment data into db.
CrawlDb update: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
at org.apache.nutch.crawl.CrawlDb.update(CrawlDb.java:119)
at org.apache.nutch.crawl.CrawlDb.run(CrawlDb.java:219)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.CrawlDb.main(CrawlDb.java:179)
最佳答案
您是否尝试过按照Nutch WIKI的步骤进行操作?
关于eclipse - 在Eclipse中运行nutch1.9时出现错误CrawlDb更新:java.io.IOException:作业失败,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/26839442/