nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christophe Noel <christophe.n...@gmail.com>
Subject Plugins problems
Date Thu, 03 Mar 2005 08:52:40 GMT
Hello,

I need to know more about the parse-ext plugin ... what can it do for 
example ?

Then, I get the following error when I crawl with index-more plugin :

050302 183540 Updating /nutch-0.6/agoria.2mar/db
050302 183540 Updating for /nutch-0.6/agoria.2mar/segments/20050302183116
050302 183540 Processing document 0
050302 183541 Finishing update
050302 183542 Processing pagesByURL: Sorted 2931 instructions in 0.915 
seconds.
050302 183542 Processing pagesByURL: Sorted 3203.27868852459 
instructions/second
Exception in thread "main" java.io.IOException: already exists: 
/nutch-0.6/agoria.2mar/db/webdb.new/pagesByURL
        at net.nutch.io.MapFile$Writer.<init>(MapFile.java:67)
        at 
net.nutch.db.WebDBWriter$CloseProcessor.closeDown(WebDBWriter.java:536)
        at net.nutch.db.WebDBWriter.close(WebDBWriter.java:1531)
        at 
net.nutch.tools.UpdateDatabaseTool.close(UpdateDatabaseTool.java:301)
        at 
net.nutch.tools.UpdateDatabaseTool.main(UpdateDatabaseTool.java:351)
        at net.nutch.tools.CrawlTool.main(CrawlTool.java:128)

Thanks for help.

Christophe.

Mime
View raw message