nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NUTCH-540) some problem about the Nutch cache
Date Fri, 01 Apr 2011 14:35:07 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Jelsma updated NUTCH-540:
--------------------------------


Bulk close of legacy issues:
http://www.lucidimagination.com/search/document/2738eeb014805854/clean_up_open_legacy_issues_in_jira

> some problem about the Nutch cache
> ----------------------------------
>
>                 Key: NUTCH-540
>                 URL: https://issues.apache.org/jira/browse/NUTCH-540
>             Project: Nutch
>          Issue Type: Bug
>          Components: searcher
>    Affects Versions: 0.9.0
>         Environment: Red hat AS4 + Tomcat5.5 + Nutch0.9
>            Reporter: crossany
>         Attachments: 1.gif, 1186733525.jpg
>
>
> I'am a chinese.
> I just test to search chinese word in nutch. I install nutch0.9 in tomcat5 on linux.and
the Tomcat charset it's UTF-8 and I use nutch to Crawl the website it a chinese website the
web charset it's also UTF-8. when Use the nutch on tomcat for search chinese word , I find
the search result' Title and description was right to display. but when I click the cache,
the cache web was display a error charset code, I see the cache
> web' charset also utf-8. I find a website use Nutch http://www.synoo.com:8080/zh/ I just
test to search chinese word . It's also error.
> I use Luke to see the segments It's can display chinese word, I think maybe it's a Bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message