lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Stromnov <strom...@gmail.com>
Subject Re: Problem with Russian stemmer in Solr 1.2
Date Tue, 17 Jul 2007 10:34:01 GMT

Hi Daniel

How to implement custom Russian factory with various Tokenizers and Filters?

Can you provide some code examples?

Regards,
Andrew


Daniel Alheiros wrote:
> 
> Hi Andrew
> 
> Yes, I saw that. As I'm not knowledgeable in Russian I had to infer it was
> adequate. But as you have much more to add to it, it could be interesting
> if
> you could contribute that.
> 
> The problem is Russian analyzer and it's filters are all final class,
> don't
> allowing an elegant extension. But you can create an analyzer that reuse
> what is interesting for you (in this case, the stemmer) and customize the
> other filters. I would propose you to do that creating the Solr factories
> so
> you can point to your files containing your stopwords. Any chance you
> could
> contribute with this stopwords list?
> 
> One of my reasons to not use directly the RussianAnalyzer was that I need
> to
> use an WhitespaceTokenizer removing HTML code... So I created my
> factories.
> 
> Regards,
> Daniel 
> 

-- 
View this message in context: http://www.nabble.com/Problem-with-Russian-stemmer-in-Solr-1.2-tf4049948.html#a11646823
Sent from the Solr - User mailing list archive at Nabble.com.


Mime
View raw message