nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lewis John Mcgibbney <lewis.mcgibb...@gmail.com>
Subject Re: JIRA Nutch 968, File Protocol error 404 while fetching files that contains CJK character in the file name
Date Sat, 01 Sep 2012 10:28:40 GMT
Hi Ye,

On Fri, Aug 31, 2012 at 4:11 PM, Ye T Thet <yethura.thet@gmail.com> wrote:
>
> What is the guide-line for adding properties to the nutch-default.xml? I am
> thinking of using file.name.encoding.
>

Generally speaking the name attribute you suggest looks OK. However
for consistency maybe it should mimic the parser property for
encoding. Namely that the property should be
file.character.encoding.default?

Thanks

Lewis

Mime
View raw message