lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Solr streaming questions
Date Wed, 01 Nov 2017 16:16:21 GMT
Perhaps if you bothered to explain your use-case we could suggest alternatives.

Streaming is built to handle very large result sets in a
divide-and-conquer manner,
thus the ability to specify worker nodes each of which handles a
sub-set of the results.

Partitioning the output streams requires a way to bucket the results
from multiple sources
to workers such that all the documents that fall into buckets can be
routed to the
same worker. There may be many sources (think shards) and many replicas.

Score is unsuitable for such bucketing. You're simply trying to use
streaming for
a use-case it was not designed for.

You have two choices here.
> use streaming as it was intended,
> use cursorMark for processing in batches.

Best,
Erick

On Wed, Nov 1, 2017 at 8:33 AM, Webster Homer <webster.homer@sial.com> wrote:
> I know that /select supports score. However, I don't want to have to page
> the results, I want to use stream to stream the results of a search, but I
> cannot sort by the relevancy of the result. This seems like a MAJOR deficit
> for the streaming API
>
> /select wants to do paging which in my case I don't want.
>
> This all seems fairly arbitrary to me and a questionable limitation for
> /export, especially since /export has a search facility
>
> On Tue, Oct 31, 2017 at 7:46 PM, Joel Bernstein <joelsolr@gmail.com> wrote:
>
>> It is not possible to use score with the /export handler. The /export
>> handler currently only supports sorting by fields.
>>
>> You can sort by score using the default /select handler.
>>
>> Joel Bernstein
>> http://joelsolr.blogspot.com/
>>
>> On Tue, Oct 31, 2017 at 1:50 PM, Webster Homer <webster.homer@sial.com>
>> wrote:
>>
>> > I have a potential use case for solr searching via streaming expressions.
>> > I am currently using solr 6.2.0, but we will soon be upgrading to the
>> 7.1.0
>> > version.
>> >
>> > I started testing out searching using streaming expressions.
>> > 1. If I use an alias instead of a collection name it fails. I see that
>> > there is a Jira, SOLR-7377. Is this fixed in 7.1.0?
>> >
>> > 2. If I try to sort the results by score, it gives me an undefined field
>> > error. So it seems that streaming searches must not return values ordered
>> > by relevancy?
>> > This is a stopper for us if it has not been addressed.
>> >
>> > This is my query:
>> > search(test-catalog-product-170724,defType="edismax",q="
>> > 7732-18-5",qf="searchmv_cas_number",mm="2<-12%",fl="id_record_spec,
>> > id_s, score",sort="score desc",qt="/export")
>> >
>> > This is the error:
>> > "EXCEPTION": "java.util.concurrent.ExecutionException:
>> > java.io.IOException:
>> > -->
>> > http://141.247.245.207:8983/solr/test-catalog-product-
>> > 170724_shard2_replica1/:org.apache.solr.common.SolrException:
>> > undefined field: \"score\"",
>> >
>> > I could not find a Jira for this issue. Is it not possible to retrieve
>> the
>> > results ordered relevancy (score desc)?
>> >
>> > Seems kind of limiting
>> >
>> > --
>> >
>> >
>> > This message and any attachment are confidential and may be privileged or
>> > otherwise protected from disclosure. If you are not the intended
>> recipient,
>> > you must not copy this message or attachment or disclose the contents to
>> > any other person. If you have received this transmission in error, please
>> > notify the sender immediately and delete the message and any attachment
>> > from your system. Merck KGaA, Darmstadt, Germany and any of its
>> > subsidiaries do not accept liability for any omissions or errors in this
>> > message which may arise as a result of E-Mail-transmission or for damages
>> > resulting from any unauthorized changes of the content of this message
>> and
>> > any attachment thereto. Merck KGaA, Darmstadt, Germany and any of its
>> > subsidiaries do not guarantee that this message is free of viruses and
>> does
>> > not accept liability for any damages caused by any virus transmitted
>> > therewith.
>> >
>> > Click http://www.emdgroup.com/disclaimer to access the German, French,
>> > Spanish and Portuguese versions of this disclaimer.
>> >
>>
>
> --
>
>
> This message and any attachment are confidential and may be privileged or
> otherwise protected from disclosure. If you are not the intended recipient,
> you must not copy this message or attachment or disclose the contents to
> any other person. If you have received this transmission in error, please
> notify the sender immediately and delete the message and any attachment
> from your system. Merck KGaA, Darmstadt, Germany and any of its
> subsidiaries do not accept liability for any omissions or errors in this
> message which may arise as a result of E-Mail-transmission or for damages
> resulting from any unauthorized changes of the content of this message and
> any attachment thereto. Merck KGaA, Darmstadt, Germany and any of its
> subsidiaries do not guarantee that this message is free of viruses and does
> not accept liability for any damages caused by any virus transmitted
> therewith.
>
> Click http://www.emdgroup.com/disclaimer to access the German, French,
> Spanish and Portuguese versions of this disclaimer.

Mime
View raw message