lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steven Rowe <>
Subject Re: new spanish analyzer
Date Tue, 10 Jan 2006 16:42:29 GMT
Hola José,

Did you know that Java Lucene already has a contributed Snowball-based 
stemmer/analyzer, very similar to yours?

It looks to me as though your Spanish stopword list is the only 
significant difference.  Would you agree that this is true?

Also, your stoplist loader (SpanishAnalyzer.loadStopWords()) is not 
respecting the '|' comment-to-end-of-line character in your stoplist 


José Ramón Pérez Agüera wrote:
> I have developed a spanish analyzer with spanish stemmer based in Porter algorithm. Its
under GNU license and free for use. I hope that will be useful for spanish lucene users. You
can download the stemmer here:
> if somebody have any sugerences, i will be happy to improve my implementation
> Sorry for my english :-)
> jose
> José Ramón Pérez Agüera
> Despacho 411 tlf. 913947599
> Dept. de Sistemas Informáticos y Programación
> Facultad de Informática
> Universidad Complutense de Madrid

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message