lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Trejkaz <trej...@trypticon.org>
Subject Re: Is StandardAnalyzer good enough for multi languages...
Date Wed, 09 Jan 2013 05:12:43 GMT
On Wed, Jan 9, 2013 at 10:57 AM, Steve Rowe <sarowe@gmail.com> wrote:
> Trejkaz (and maybe Sai too): ICUTokenizer in Lucene's icu module may be be of interest
to you, along with the token filters in that same module. - Steve

ICUTokenizer sounds like it's implementing UAX #29, which is exactly
the standard filled with all the issues I was describing. Unless it
does more than that, I would recommend against using that also.

TX

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message