lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Webster Homer <webster.ho...@sial.com>
Subject Re: Solr streaming questions
Date Thu, 02 Nov 2017 17:57:01 GMT
This is a new project, and it's requirements are not yet completely
defined. The system we are looking at building is an automated B2B system
where a customer's system calls in with queries and we return products,
skus, pricing and availability to the caller.

As it turns out relevancy will not be an issue for this system as the
queries are all pretty simple, and users won't see or care about the
relevancy of the hits returned. We have a current system, but it fails to
scale, not from the search, but from calls to the pricing systems. We are
currently in  the early stages of designing this new system. It seems
likely that we can use streaming. I'm sure we could make /select work as
well. The fact that streaming supports joins between collections is
potentially useful.

On Wed, Nov 1, 2017 at 11:16 AM, Erick Erickson <erickerickson@gmail.com>
wrote:

> Perhaps if you bothered to explain your use-case we could suggest
> alternatives.
>
> Streaming is built to handle very large result sets in a
> divide-and-conquer manner,
> thus the ability to specify worker nodes each of which handles a
> sub-set of the results.
>
> Partitioning the output streams requires a way to bucket the results
> from multiple sources
> to workers such that all the documents that fall into buckets can be
> routed to the
> same worker. There may be many sources (think shards) and many replicas.
>
> Score is unsuitable for such bucketing. You're simply trying to use
> streaming for
> a use-case it was not designed for.
>
> You have two choices here.
> > use streaming as it was intended,
> > use cursorMark for processing in batches.
>
> Best,
> Erick
>
> On Wed, Nov 1, 2017 at 8:33 AM, Webster Homer <webster.homer@sial.com>
> wrote:
> > I know that /select supports score. However, I don't want to have to page
> > the results, I want to use stream to stream the results of a search, but
> I
> > cannot sort by the relevancy of the result. This seems like a MAJOR
> deficit
> > for the streaming API
> >
> > /select wants to do paging which in my case I don't want.
> >
> > This all seems fairly arbitrary to me and a questionable limitation for
> > /export, especially since /export has a search facility
> >
> > On Tue, Oct 31, 2017 at 7:46 PM, Joel Bernstein <joelsolr@gmail.com>
> wrote:
> >
> >> It is not possible to use score with the /export handler. The /export
> >> handler currently only supports sorting by fields.
> >>
> >> You can sort by score using the default /select handler.
> >>
> >> Joel Bernstein
> >> http://joelsolr.blogspot.com/
> >>
> >> On Tue, Oct 31, 2017 at 1:50 PM, Webster Homer <webster.homer@sial.com>
> >> wrote:
> >>
> >> > I have a potential use case for solr searching via streaming
> expressions.
> >> > I am currently using solr 6.2.0, but we will soon be upgrading to the
> >> 7.1.0
> >> > version.
> >> >
> >> > I started testing out searching using streaming expressions.
> >> > 1. If I use an alias instead of a collection name it fails. I see that
> >> > there is a Jira, SOLR-7377. Is this fixed in 7.1.0?
> >> >
> >> > 2. If I try to sort the results by score, it gives me an undefined
> field
> >> > error. So it seems that streaming searches must not return values
> ordered
> >> > by relevancy?
> >> > This is a stopper for us if it has not been addressed.
> >> >
> >> > This is my query:
> >> > search(test-catalog-product-170724,defType="edismax",q="
> >> > 7732-18-5",qf="searchmv_cas_number",mm="2<-12%",fl="id_record_spec,
> >> > id_s, score",sort="score desc",qt="/export")
> >> >
> >> > This is the error:
> >> > "EXCEPTION": "java.util.concurrent.ExecutionException:
> >> > java.io.IOException:
> >> > -->
> >> > http://141.247.245.207:8983/solr/test-catalog-product-
> >> > 170724_shard2_replica1/:org.apache.solr.common.SolrException:
> >> > undefined field: \"score\"",
> >> >
> >> > I could not find a Jira for this issue. Is it not possible to retrieve
> >> the
> >> > results ordered relevancy (score desc)?
> >> >
> >> > Seems kind of limiting
> >> >
> >> > --
> >> >
> >> >
> >> > This message and any attachment are confidential and may be
> privileged or
> >> > otherwise protected from disclosure. If you are not the intended
> >> recipient,
> >> > you must not copy this message or attachment or disclose the contents
> to
> >> > any other person. If you have received this transmission in error,
> please
> >> > notify the sender immediately and delete the message and any
> attachment
> >> > from your system. Merck KGaA, Darmstadt, Germany and any of its
> >> > subsidiaries do not accept liability for any omissions or errors in
> this
> >> > message which may arise as a result of E-Mail-transmission or for
> damages
> >> > resulting from any unauthorized changes of the content of this message
> >> and
> >> > any attachment thereto. Merck KGaA, Darmstadt, Germany and any of its
> >> > subsidiaries do not guarantee that this message is free of viruses and
> >> does
> >> > not accept liability for any damages caused by any virus transmitted
> >> > therewith.
> >> >
> >> > Click http://www.emdgroup.com/disclaimer to access the German,
> French,
> >> > Spanish and Portuguese versions of this disclaimer.
> >> >
> >>
> >
> > --
> >
> >
> > This message and any attachment are confidential and may be privileged or
> > otherwise protected from disclosure. If you are not the intended
> recipient,
> > you must not copy this message or attachment or disclose the contents to
> > any other person. If you have received this transmission in error, please
> > notify the sender immediately and delete the message and any attachment
> > from your system. Merck KGaA, Darmstadt, Germany and any of its
> > subsidiaries do not accept liability for any omissions or errors in this
> > message which may arise as a result of E-Mail-transmission or for damages
> > resulting from any unauthorized changes of the content of this message
> and
> > any attachment thereto. Merck KGaA, Darmstadt, Germany and any of its
> > subsidiaries do not guarantee that this message is free of viruses and
> does
> > not accept liability for any damages caused by any virus transmitted
> > therewith.
> >
> > Click http://www.emdgroup.com/disclaimer to access the German, French,
> > Spanish and Portuguese versions of this disclaimer.
>

-- 


This message and any attachment are confidential and may be privileged or 
otherwise protected from disclosure. If you are not the intended recipient, 
you must not copy this message or attachment or disclose the contents to 
any other person. If you have received this transmission in error, please 
notify the sender immediately and delete the message and any attachment 
from your system. Merck KGaA, Darmstadt, Germany and any of its 
subsidiaries do not accept liability for any omissions or errors in this 
message which may arise as a result of E-Mail-transmission or for damages 
resulting from any unauthorized changes of the content of this message and 
any attachment thereto. Merck KGaA, Darmstadt, Germany and any of its 
subsidiaries do not guarantee that this message is free of viruses and does 
not accept liability for any damages caused by any virus transmitted 
therewith.

Click http://www.emdgroup.com/disclaimer to access the German, French, 
Spanish and Portuguese versions of this disclaimer.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message