lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Per Steffensen (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SOLR-6810) Faster searching limited but high rows across many shards all with many hits
Date Mon, 01 Dec 2014 15:08:13 GMT
Per Steffensen created SOLR-6810:
------------------------------------

             Summary: Faster searching limited but high rows across many shards all with many
hits
                 Key: SOLR-6810
                 URL: https://issues.apache.org/jira/browse/SOLR-6810
             Project: Solr
          Issue Type: Improvement
          Components: search
            Reporter: Per Steffensen


Searching "limited but high rows across many shards all with many hits" is slow
E.g.
* Query from outside client: q=something&rows=1000
* Resulting in sub-requests to each shard something a-la this
** 1) q=something&rows=1000&fl=id,score
** 2) Request the full documents with ids in the global-top-1000 found among the top-1000
from each shard

What does the subject mean
* "limited but high rows" means 1000 in the example above
* "many shards" means 200-1000 in our case
* "all with many hits" means that each of the shards have a significant number of hits on
the query
The problem grows on all three factors above

Doing such a query on our system takes between 5 min to 1 hour - depending on a lot of things.
It ought to be much faster, so lets make it.

For further detail see mail-thread "Slow searching limited but high rows across many shards
all with high hits" started 13/11-2014 on dev@lucene.apache.org



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message