lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ganesh Sethuraman <ganeshmail...@gmail.com>
Subject Response Time Diff between Collection with low deletes
Date Sat, 09 May 2020 22:22:21 GMT
Hi Solr Users,

We use SolrCloud 7.2.1 with 2 Solr nodes in AWS. The shard size for these
collections does not exceed more than 5G. They have approximately 16 shards
with 2 replicas.  We do deletes (ByQuery) as well large updates in some of
these Solr collections. We are seeing slower filter queries (95% > 10secs)
on these collections in production, same collections, and same queries in
our lower environment with similar setup and configuration we seeing much
better performance (<100ms).  These are NRT indexes, with daily batch
updates only.

We see a difference however in the lower environment; that we don't see
updates or deletes, we see in Segment Info for each of the Solr code there
are ZERO delete percentages.  Could this be the reason for the faster query
response time in our lower environment? in our production environment, we
are seeing about 30-32% of deletes in each core shard/replica pair.

Does this segment delete % has any correlation with query response time? We
do delete by Query in a loop. Also updates.
If it is so, do you suggest to try to do Optimize or expungeDelete at the
end every day?
Do we need to expunge delete after each delete ByQuery or do it once at the
end?

Regards,
Ganesh

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message