lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marius Seiceanu <>
Subject German stamming algorithm problem
Date Fri, 03 Oct 2003 13:50:07 GMT

    I have an application which make searches in Lucene indexed 
documents. The documents content is in German language.
    I use Lucene 1.3rc1.

    If I search for "Universit├Ąt" i get some results, but if I search 
for "universit├Ąt" i get no results.

    In the CHANGES.TXT of 1.3rc1 
point 11 says that stamming is not case sensitive anymore.
11. Changed the German stemming algorithm to ignore case while 
stripping. The new algorithm is faster and produces more equal stems 
from nouns and verbs derived from the same word. (gschwarz)
    For  "Gesetz" and "gesetz" i get the same number of results!

Thank you,
       Marius Seiceanu.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message