nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sami Siren <ssi...@gmail.com>
Subject Re: how is crawl-urlfilter.txt taken care of?
Date Wed, 09 May 2007 17:58:56 GMT
Manoharam Reddy wrote:
> I find four url-filters
> 
> automaton-urlfilter.txt
> regex-urlfilter.txt
> suffix-urlfilter.txt
> crawl-urlfilter.txt
> 
> I can see plugins for the first 4 in nutch-site.xml file but not for
> the 4th one. So, how is the crawl-urlfilter.txt considered by Nutch?

This question is more suitable for the user list.

crawl-urlfilter is used by the crawl command by default (see crawl-tool.xml)

--
 Sami Siren


Mime
View raw message