lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Rennie" <jren...@gmail.com>
Subject Re: diversity in results
Date Mon, 04 Aug 2008 22:17:28 GMT
Does the MLT handler simply select a few high tfidf terms from the doc and
use them as a query?  Sounds like a useful tool.  Do you know anything about
relevant performance issues?  I noticed that the Solr MoreLikeThis wiki page
recommends turning on TermVectors for corresponding fields.  Can lucene not
easily return term counts for a document with the standard indexing b/c it's
term-based (i.e. "inverted").  Does TermVectors=true cause solr/lucene to
store an additional doc-based index?

Thanks,

Jason

On Mon, Aug 4, 2008 at 5:06 PM, Brian Whitman <brian.whitman@variogr.am>wrote:

> not out of the box, but I would use the mlt handler on the first result and
> remove all the ones that appear in both the MLT and query response.
>
> B
>
>
-- 
Jason Rennie
Head of Machine Learning Technologies, StyleFeeder
http://www.stylefeeder.com/
Samantha's blog & pictures: http://samanthalyrarennie.blogspot.com/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message