lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Rennie" <>
Subject Re: diversity in results
Date Mon, 04 Aug 2008 22:17:28 GMT
Does the MLT handler simply select a few high tfidf terms from the doc and
use them as a query?  Sounds like a useful tool.  Do you know anything about
relevant performance issues?  I noticed that the Solr MoreLikeThis wiki page
recommends turning on TermVectors for corresponding fields.  Can lucene not
easily return term counts for a document with the standard indexing b/c it's
term-based (i.e. "inverted").  Does TermVectors=true cause solr/lucene to
store an additional doc-based index?



On Mon, Aug 4, 2008 at 5:06 PM, Brian Whitman <>wrote:

> not out of the box, but I would use the mlt handler on the first result and
> remove all the ones that appear in both the MLT and query response.
> B
Jason Rennie
Head of Machine Learning Technologies, StyleFeeder
Samantha's blog & pictures:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message