lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shalin Shekhar Mangar <shalinman...@gmail.com>
Subject Re: ISOLatin1AccentFilter before or after Snowball?
Date Wed, 07 Oct 2009 08:44:52 GMT
On Tue, Oct 6, 2009 at 4:33 PM, Chantal Ackermann <
chantal.ackermann@btelligent.de> wrote:

> Hi all,
>
> from reading through previous posts on that subject, it seems like the
> accent filter has to come before the snowball filter.
>
> I'd just like to make sure this is so. If it is the case, I'm wondering
> whether snowball filters for i.e. French process accented language
> correctly, at all, or whether they remove accents anyway... Or whether
> accents should be removed whenever making use of snowball filters.
>
>
I'd think so but I'm not sure. Perhaps someone else can weigh in.


>
> And also: it really is meant to take UTF-8 as input, even though it is
> named ISOLatin1AccentFilter, isn't it?
>
>
See http://markmail.org/message/hi25u5iqusfu542b

-- 
Regards,
Shalin Shekhar Mangar.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message