From solr-user-return-35733-apmail-lucene-solr-user-archive=lucene.apache.org@lucene.apache.org Tue May 04 10:13:43 2010 Return-Path: Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: (qmail 64514 invoked from network); 4 May 2010 10:13:43 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 May 2010 10:13:43 -0000 Received: (qmail 28606 invoked by uid 500); 4 May 2010 10:13:40 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 28512 invoked by uid 500); 4 May 2010 10:13:40 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 28504 invoked by uid 99); 4 May 2010 10:13:39 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 May 2010 10:13:39 +0000 X-ASF-Spam-Status: No, hits=2.8 required=10.0 tests=AWL,FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dctech1000@gmail.com designates 209.85.223.189 as permitted sender) Received: from [209.85.223.189] (HELO mail-iw0-f189.google.com) (209.85.223.189) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 May 2010 10:13:34 +0000 Received: by iwn27 with SMTP id 27so4448555iwn.5 for ; Tue, 04 May 2010 03:13:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=XCx1zSoowO2AhL6G9OvdvphE9TfGrVI+k+TeTdSJvK0=; b=WxelidKmpzXTYIsk84qL69zcZq6LheYXTmIHy5muO5WEJYa2hT7RsYxVQKDJPOE9O3 BNtVwPDcyS0cHvi0Apm0HDBd606exGODawT/tokUJ6k2Q5kx/h1mkZu31eH4fiMUst5A sbiXKo6pLc1nFZQGucohfWiQJe6fTpSJpvUt0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=ddx2bk4dlc+8Q4RTvbse7tth0FdEYv32aHA6SiUeoyp4zkfjDc30bJoc9b7xreb4va CVdipgsfAVhvmASB5mSPr7s2thCET657IEkZDN4raFF/U4FIBZA9mJPqZ0+fb2U8iDfg oQQcY8rCZvuiVq/N5ONwJFbjM2L0wBuKj4r0o= MIME-Version: 1.0 Received: by 10.231.154.132 with SMTP id o4mr629997ibw.68.1272967993739; Tue, 04 May 2010 03:13:13 -0700 (PDT) Received: by 10.231.183.146 with HTTP; Tue, 4 May 2010 03:13:13 -0700 (PDT) In-Reply-To: <4BDFD0C7.5070901@zalando.de> References: <4BDFD0C7.5070901@zalando.de> Date: Tue, 4 May 2010 06:13:13 -0400 Message-ID: Subject: Re: Score cutoff From: dc tech To: solr-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Michael, The cutoff filter would be very useful for us as well. We want to use it for more like this feature where only the top n similar docs tend to be reallt similar. On 5/4/10, Michael Kuhlmann wrote: > Am 03.05.2010 23:32, schrieb Satish Kumar: >> Hi, >> >> Can someone give clues on how to implement this feature? This is a very >> important requirement for us, so any help is greatly appreciated. >> > > Hi, > > I just implemented exactly this feature. You need to patch Solr to make > this work. > > We at Zalando are planning to set up a technology blog where we'll offer > such tools, but at the moment this is not done. I can make a patch out > of my work and send it to you today. > > Greetings, > Michael > >> On Tue, Apr 27, 2010 at 5:54 PM, Satish Kumar < >> satish.kumar.just.doit@gmail.com> wrote: >> >>> Hi, >>> >>> For some of our queries, the top xx (five or so) results are of very high >>> quality and results after xx are very poor. The difference in score for >>> the >>> high quality and poor quality results is high. For example, 3.5 for high >>> quality and 0.8 for poor quality. We want to exclude results with score >>> value that is less than 60% or so of the first result. Is there a filter >>> that does this? If not, can someone please give some hints on how to >>> implement this (we want to do this as part of solr relevance ranking so >>> that >>> the facet counts, etc will be correct). >>> >>> >>> Thanks, >>> Satish >>> >> > > -- Sent from my mobile device