nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Davide Cavalaglio <davide.cavalag...@desktopsrl.com>
Subject Nutch on file system and web
Date Mon, 04 Oct 2010 10:49:08 GMT
Hi,
I have a question: It's possible to configure nutch for crawling on
file system and web at the same time?

I want to start crawler on two seeds:
1) http://www.myWebSite.com/
2) file:///C:/MyFile/
It's possible with single crawler? It's possible to use only one
configuration (nutch-site.xml, crawl-urlfilter.txt) for two different
protocols?

Thanks,
Davide

Mime
View raw message