nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ye T Thet <yethura.t...@gmail.com>
Subject Re: JIRA Nutch 968, File Protocol error 404 while fetching files that contains CJK character in the file name
Date Sun, 02 Sep 2012 07:54:07 GMT
Hi Lewis,

Your suggestion sounds good. I supposed the patch I would be submitting
changes in two file then.

nutch-default.xml for default encoding setting
FileResponse.java for some code change

Please advise.

Thanks,

Ye


On Sat, Sep 1, 2012 at 6:28 PM, Lewis John Mcgibbney <
lewis.mcgibbney@gmail.com> wrote:

> Hi Ye,
>
> On Fri, Aug 31, 2012 at 4:11 PM, Ye T Thet <yethura.thet@gmail.com> wrote:
> >
> > What is the guide-line for adding properties to the nutch-default.xml? I
> am
> > thinking of using file.name.encoding.
> >
>
> Generally speaking the name attribute you suggest looks OK. However
> for consistency maybe it should mimic the parser property for
> encoding. Namely that the property should be
> file.character.encoding.default?
>
> Thanks
>
> Lewis
>

Mime
View raw message