nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <>
Subject [jira] [Updated] (NUTCH-540) some problem about the Nutch cache
Date Fri, 01 Apr 2011 14:35:07 GMT


Markus Jelsma updated NUTCH-540:

Bulk close of legacy issues:

> some problem about the Nutch cache
> ----------------------------------
>                 Key: NUTCH-540
>                 URL:
>             Project: Nutch
>          Issue Type: Bug
>          Components: searcher
>    Affects Versions: 0.9.0
>         Environment: Red hat AS4 + Tomcat5.5 + Nutch0.9
>            Reporter: crossany
>         Attachments: 1.gif, 1186733525.jpg
> I'am a chinese.
> I just test to search chinese word in nutch. I install nutch0.9 in tomcat5 on linux.and
the Tomcat charset it's UTF-8 and I use nutch to Crawl the website it a chinese website the
web charset it's also UTF-8. when Use the nutch on tomcat for search chinese word , I find
the search result' Title and description was right to display. but when I click the cache,
the cache web was display a error charset code, I see the cache
> web' charset also utf-8. I find a website use Nutch I just
test to search chinese word . It's also error.
> I use Luke to see the segments It's can display chinese word, I think maybe it's a Bug.

This message is automatically generated by JIRA.
For more information on JIRA, see:

View raw message