lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ryan McKinley <ryan...@gmail.com>
Subject negative boosting / analysis?
Date Tue, 01 Jul 2008 15:41:58 GMT
Hi-

I'm working on a case where we have review text that may include words  
that describe what the item is *not*.

Given the text "the kitten is not clean", searching for "clean" should  
not include (at least at the top) the kitten.

The approach I am considering is to copy the text to a negation field  
and do simple heuristic analysis in a TokenFilter.  This analysis  
would only keep tokens for words that follow "not", then we could add  
a negative boost for this field:
   title^2 content^1 negation^0.1

Does this seem like a reasonable approach?  Any other ideas /  
suggestions / pointers?

thanks
ryan

Mime
View raw message