lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From marco turchi <>
Subject Language Specific Analyzer
Date Sat, 14 Nov 2015 16:38:54 GMT
Dear Users,
I need to develop my language specific analyzer that:
1) does not remove punctuations
2) lowercases and stems each term in the text.

I have tried some of the pre-implemented language analyzer (e.g. German and
Italian analyzers), but they remove punctuation.  I/m not sure, but
probably what I need is the whitespace analyzer instead of the standard

Is there a way to force each language specific analyzer to use the
whitespace analyzer or in general not to remove punctuations?

Thanks a lot!

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message