lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dominique Bejean <dominique.bej...@eolya.fr>
Subject Re: Stemming and accents
Date Sat, 11 Feb 2017 14:13:27 GMT
Thank you both for your answers.

I tried to find some French homophone words (tache / tâche, bouche /
bouché, ...) with different stems (with snowball, minimal and light
stemmers), but without success. So put the ASCIIFolding filter before the
stemmer is not a big issue (in French) for precision.

Dominique


Le ven. 10 févr. 2017 à 23:06, Ahmet Arslan <iorixxx@yahoo.com> a écrit :

> Hi,
>
> I have experimented before, and found that Snowball is sensitive to
> accents/diacritics.
> Please see for more details:
> http://www.sciencedirect.com/science/article/pii/S0306457315001053
>
> Ahmet
>
>
>
> On Friday, February 10, 2017 11:27 AM, Dominique Bejean <
> dominique.bejean@eolya.fr> wrote:
> Hi,
>
> Is the SnowballPorterFilter sensitive to the accents for French for
> instance ?
>
> If I use both SnowballPorterFilter and ASCIIFoldingFilter, do I have to
> configure ASCIIFoldingFilter after SnowballPorterFilter  ?
>
> Regards.
>
> Dominique
> --
> Dominique Béjean
> 06 08 46 12 43
>
-- 
Dominique Béjean
06 08 46 12 43

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message