nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jérôme Charron <jerome.char...@gmail.com>
Subject Re: regex-normalize.xml
Date Fri, 02 Sep 2005 10:23:06 GMT
> 
> i think i expressed it wrong. The Question was if its a feature or a bug
> that regex-normalize.xml is used only after this changes.

the regex-normalize.xml is used only after you specify that you want to use 
the RegexUrlNormalizer implementation. So it is used only if you specify 
urlnormalizer.class=org.apache.nutch.net.RegexUrlNormalizer.
But it must also works if you remove the urlnormalizer.class = 
org.apache.nutch.net.BasicUrlNormalizer int the nutch-default.

Regards

Jérôme


-- 
http://motrech.free.fr/
http://www.frutch.org/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message