lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ron Mayer <>
Subject Re: Prioritizing adjectives in solr search
Date Wed, 13 Oct 2010 03:23:29 GMT
Erick Erickson wrote:
> You can do some interesting things with payloads. You could index a
> particular value as the payload that identified the "kind" of word it was,
> where "kind" is something you define. Then at query time, you could
> boost depending on what part kind of word you identified it as in both
> the query and at indexing time.
> But I can't even imagine how one would go about supporting this in a
> general search engine. This kind of thing seems far too domain
> specific.....

Well, the "pf2" and "pf3" parameters in edismax come pretty close.

For example, for the search query "red baseball cap black leather jacket",
a "pf2" with no "phrase slop", combined with a "pf2" with a "phrase slop of 3"
will do a pretty good job at finding "red caps" and "black jackets"
and "baseball caps" and "leather jackets" before it'll find
"red baseball jackets" and "leather caps".

All it depended on is the convention that in english someone'll probably
put adjectives before nouns in both the query and the document's text.

The one annoyance is that I think the phrase slop doesn't care much
about the order of words......

> On Sun, Oct 10, 2010 at 8:50 PM, Ron Mayer <> wrote:
>> Walter Underwood wrote:
>>> I think this is a bad idea. The tf.idf algorithm will already put a
>> higher weight on "hammers" than on "blue", because "hammers" will be more
>> rare than "blue". Plus, you are making huge assumptions about the queries.
>> In a search for "Canon camera", "Canon" is an adjective, but it is the
>> important part of the query.
>>> Have you looked at your query logs and which queries are successful and
>> which are not?
>>> Don't make radical changes like this unless you can justify them from the
>> logs.
>> The one radical change I'd like in the area of adjectives in noun clauses
>> is if
>> more weight were put when the adjectives apply to the appropriate noun.
>> For example, a search for:
>>   'red baseball cap black leather jacket'
>> should find a doc with "the guy wore a red cap, blue jeans, and a leather
>> jacket"
>> before one that says "the guy wore a black cap, leather pants, and a red
>> jacket".
>> The closest I've come at doing this was to use a variety of "phrase slop"
>> boosts simultaneously - so that "red [any_few_words] cap" "baseball cap"
>> "leather jacket", "black [any_few_words] jacket" all add boosts to the
>> score.
>>> wunder
>>> On Oct 4, 2010, at 8:38 PM, Otis Gospodnetic wrote:
>>>> Hi,
>>>> If you want "blue" to be used in search, then you should not treat it as
>> a
>>>> stopword.
>>>> Re payloads:
>>>> and (even
>> better, look at
>>>> hit #1)
>>>> Otis
>>>> ----
>>>> Sematext :: :: Solr - Lucene - Nutch
>>>> Lucene ecosystem search ::
>>>> ----- Original Message ----
>>>>> From: Hasnain <>
>>>>> To:
>>>>> Sent: Mon, October 4, 2010 9:50:46 AM
>>>>> Subject: Re: Prioritizing advectives in solr search
>>>>> Hi Otis,
>>>>>         Thank you for replying,  unfortunately Im unable to fully grasp
>> what
>>>>> you are trying to say, can you  please elaborate what is payload with
>>>>> adjective terms?
>>>>> also Im using  stopwords.txt to stop adjectives, adverbs and verbs, now
>> when
>>>>> I search for  "Blue hammers", solr searches for "blue hammers" and
>> "hammers"
>>>>> but not  "blue", but the problem here is user can also search for just
>>>>> "Blue", then it  wont search for anything...
>>>>> any suggestions on this??
>>>>> --
>>>>> View  this message in context:
>>>>> Sent  from the Solr - User mailing list archive at

View raw message