nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sherjeel Niazi <sherj...@softmatics.com>
Subject How to resume crawler after crash
Date Thu, 23 Apr 2009 15:02:42 GMT
Hi,

I am using Nutch 0.9
I am crawling a series of URL's of a website but after some time the crawler
crash with the following error:

Exception in thread "main" java.io.IOException: Job failed!
    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
    at org.apache.nutch.crawl.CrawlDb.update(CrawlDb.java:97)
    at org.apache.nutch.crawl.CrawlDb.update(CrawlDb.java:62)
    at org.apache.nutch.crawl.Crawl.main(Crawl.java:128)

How can I resume the crawler where it ends?


Sherjeel

Mime
View raw message