lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simone Gianni (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-4414) MoreLikeThis on a shard finds no interesting terms if the document queried is not in that shard
Date Tue, 29 Apr 2014 12:51:25 GMT

    [ https://issues.apache.org/jira/browse/SOLR-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13984261#comment-13984261
] 

Simone Gianni commented on SOLR-4414:
-------------------------------------

Managed to work around this bug using the TermVectorComponent (which is properly sharded)
to fetch term vectors and adapting the query generation code in Lucene's MoreLikeThis component
to create the MLT query client side. It's two calls (one for the term vectors and then one
to perform the MLT query), but it works and is totally sharded.

Just a hint on how to eventually work it around. 

> MoreLikeThis on a shard finds no interesting terms if the document queried is not in
that shard
> -----------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4414
>                 URL: https://issues.apache.org/jira/browse/SOLR-4414
>             Project: Solr
>          Issue Type: Bug
>          Components: MoreLikeThis, SolrCloud
>    Affects Versions: 4.1
>            Reporter: Colin Bartolome
>
> Running a MoreLikeThis query in a cloud works only when the document being queried exists
in whatever shard serves the request. If the document is not present in the shard, no "interesting
terms" are found and, consequently, no matches are found.
> h5. Steps to reproduce
> * Edit example/solr/collection1/conf/solrconfig.xml and add this line, with the rest
of the request handlers:
> {code:xml}
> <requestHandler name="/mlt" class="solr.MoreLikeThisHandler" />
> {code}
> * Follow the [simplest SolrCloud example|http://wiki.apache.org/solr/SolrCloud#Example_A:_Simple_two_shard_cluster]
to get two shards running.
> * Hit this URL: [http://localhost:8983/solr/collection1/mlt?mlt.fl=includes&q=id:3007WFP&mlt.match.include=false&mlt.interestingTerms=list&mlt.mindf=1&mlt.mintf=1]
> * Compare that output to that of this URL: [http://localhost:7574/solr/collection1/mlt?mlt.fl=includes&q=id:3007WFP&mlt.match.include=false&mlt.interestingTerms=list&mlt.mindf=1&mlt.mintf=1]
> The former URL will return a result and list some interesting terms. The latter URL will
return no results and list no interesting terms. It will also show this odd XML element:
> {code:xml}
> <null name="response"/>
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message