From solr-user-return-4955-apmail-lucene-solr-user-archive=lucene.apache.org@lucene.apache.org Wed Jul 11 15:59:42 2007 Return-Path: Delivered-To: apmail-lucene-solr-user-archive@locus.apache.org Received: (qmail 51073 invoked from network); 11 Jul 2007 15:59:41 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Jul 2007 15:59:41 -0000 Received: (qmail 68085 invoked by uid 500); 11 Jul 2007 15:59:40 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 68062 invoked by uid 500); 11 Jul 2007 15:59:40 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 68047 invoked by uid 99); 11 Jul 2007 15:59:40 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jul 2007 08:59:40 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [206.190.39.202] (HELO web50311.mail.re2.yahoo.com) (206.190.39.202) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 11 Jul 2007 08:59:36 -0700 Received: (qmail 31497 invoked by uid 60001); 11 Jul 2007 15:59:14 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type:Message-ID; b=Hp/TljFRyHT05V9Dgt2En/IsbB5bm1MKS1kNAGu2Ng6qBTKcSbwmNbh8gJzA+mDx+wOo4i01BT7fQtyLQ1nQVDbX9xRVr9YLfgDCSdqzrSPanHbirtpEq+cpjHXIezdvSl6/rEt+LMusk8kwuY8NOF9VTEdDyeOAjTcAkTAWWSo=; X-YMail-OSG: SwIFXqUVM1k80K.uCXxDstwhJ0GJqwImoeJj9y27uKY5zlDcxWVTvuUzDDy.8XvMbmAL.wp.vDpmm.TbC5mV7iBPZPWlqqV6B5J8iv94H1YLP7bcVbWYnhDGcFyjOs78nqh6tWt4HQcIsIA- Received: from [89.172.81.102] by web50311.mail.re2.yahoo.com via HTTP; Wed, 11 Jul 2007 08:59:14 PDT X-Mailer: YahooMailRC/651.41 YahooMailWebService/0.7.41.16 Date: Wed, 11 Jul 2007 08:59:14 -0700 (PDT) From: Otis Gospodnetic Subject: Re: Stemmer bug? To: solr-user@lucene.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=ascii Message-ID: <902488.30731.qm@web50311.mail.re2.yahoo.com> X-Virus-Checked: Checked by ClamAV on apache.org Without looking at SnowballPorterFilterFactory sources, have you tried with a different language="XXXX" and content in alternative language? Otis -- Lucene Consulting -- http://lucene-consulting.com/ ----- Original Message ---- From: Andrew Stromnov To: solr-user@lucene.apache.org Sent: Wednesday, July 11, 2007 12:12:53 AM Subject: Re: Stemmer bug? Hi RussianAnalyzer produces russian stemmed forms, but SnowballPorterFilterFactory with language="Russian" leaves _all_ russian content unchanged. hossman wrote: > > > : Subject: Stemmer bug? > > can you elaborate on what exactly you view as a bug? > > if the issue is just that one of the examples stemms something in a way > thta you think makes sense, but the other one does not that really isn't a > bug so much as it is a comment on the effectiveness of the Snowball > Stemmer for Russian vs the RussianStemmer class used by the > RussianAnalzer. if you like the stemming that comes out of hte > RussianAnalyzer you can use the RussianStemFilter yourslf by creating a > simple FilterFactory arround it (there are lots of examples in teh Solr > code base) > > Also keep in mind that the Snowball Stemmer is not designed to produce > "real" words when it stems ... it's an algorithmic stemmer designed to > produce artificial stems for common cases ... so if you think it's a bug > because it produces terms that aren't real words -- it's not, that's just > the way it works -- what matters is that it produces the same artificaial > stem for related words. > > -Hoss > -- View this message in context: http://www.nabble.com/Problem-with-Russian-stemmer-in-Solr-1.2-tf4049948.html#a11530601 Sent from the Solr - User mailing list archive at Nabble.com.