lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Toru Matsuzawa (JIRA)" <>
Subject [jira] Created: (LUCENE-973) Token of "" returns in CJK
Date Tue, 07 Aug 2007 13:32:00 GMT
Token of  "" returns in CJK

                 Key: LUCENE-973
             Project: Lucene - Java
          Issue Type: Bug
          Components: Analysis
    Affects Versions: 2.3
            Reporter: Toru Matsuzawa

The "" string returns as Token in the boundary of two byte character and one byte character.

There is no problem in CJKAnalyzer. 
When CJKTokenizer is used with the unit, it becomes a problem. (Use it with 
Solr etc.)

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message