lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Walter Underwood <wunderw...@netflix.com>
Subject Re: Keyword Density
Date Mon, 01 Jun 2009 19:20:22 GMT
That is the normal relevance scoring formula in Solr and Lucene.
It is a bit fancier than that, but you don't have to do anything
special to get that behavior.

Solr also uses the inverse document frequency (rarity) of each
word for weighting.

Look up "tf.idf" for more info.

wunder

On 6/1/09 11:46 AM, "Alex Shevchenko" <caezaris@gmail.com> wrote:

> Something like that. Just not '> N times' but '<numbers of foo
> appears>/<total number of words> > <some value>'
> 
> On Mon, Jun 1, 2009 at 21:00, Otis Gospodnetic
> <otis_gospodnetic@yahoo.com>wrote:
> 
>> 
>> Hi Alex,
>> 
>> Could you please provide an example of this?  Are you looking to do
>> something like "find all docs that match name:foo and where foo appears > N
>> times (in the name field) in the matching document"?
>> 
>>  Otis
>> --
>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>> 
>> 
>> 
>> ----- Original Message ----
>>> From: Alex Shevchenko <caezaris@gmail.com>
>>> To: solr-user@lucene.apache.org
>>> Sent: Monday, June 1, 2009 1:32:49 PM
>>> Subject: Re: Keyword Density
>>> 
>>> HI All,
>>> 
>>> Is there a way to perform filtering based on keyword density?
>>> 
>>> Thanks
>>> 
>>> --
>>> Alex Shevchenko



Mime
View raw message