lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Hostetter <>
Subject Re: whats the correct way to do normalisation?
Date Wed, 08 Nov 2006 18:42:16 GMT

: I want "Überraschung" is found by
: Überr*
: Ueberr*
: So the best i can do is to do the normalisation manually(not by an
: analyzer) before the indexing/searching process?

Or use an Analyzer at index time that puts both the UTF-8 version of the
string and the Latin-1 version of the string in the same field (at the
same position so they still work with phrases) and at query time just
search for the text the user types in as is ... that should work for both
straight term queries and prefix/wildcard queries that don't get analyzed
at query time.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message