lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Stromnov <>
Subject Re: Problem with Russian stemmer in Solr 1.2
Date Tue, 17 Jul 2007 10:34:01 GMT

Hi Daniel

How to implement custom Russian factory with various Tokenizers and Filters?

Can you provide some code examples?


Daniel Alheiros wrote:
> Hi Andrew
> Yes, I saw that. As I'm not knowledgeable in Russian I had to infer it was
> adequate. But as you have much more to add to it, it could be interesting
> if
> you could contribute that.
> The problem is Russian analyzer and it's filters are all final class,
> don't
> allowing an elegant extension. But you can create an analyzer that reuse
> what is interesting for you (in this case, the stemmer) and customize the
> other filters. I would propose you to do that creating the Solr factories
> so
> you can point to your files containing your stopwords. Any chance you
> could
> contribute with this stopwords list?
> One of my reasons to not use directly the RussianAnalyzer was that I need
> to
> use an WhitespaceTokenizer removing HTML code... So I created my
> factories.
> Regards,
> Daniel 

View this message in context:
Sent from the Solr - User mailing list archive at

View raw message