lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-8586) Implement hash over all documents to check for shard synchronization
Date Fri, 22 Jan 2016 17:20:39 GMT

    [ https://issues.apache.org/jira/browse/SOLR-8586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15112725#comment-15112725
] 

Yonik Seeley commented on SOLR-8586:
------------------------------------

A bloom filter would allow one to estimate (with a known error) if a specific version is contained
within the index.  But it's not clear how we would use that info.  All we need here is to
know if two indexes are in sync or not.

I was thinking of something as simple as
{code}
h = 0
for version in versions:
  h += hash(version)
{code}


> Implement hash over all documents to check for shard synchronization
> --------------------------------------------------------------------
>
>                 Key: SOLR-8586
>                 URL: https://issues.apache.org/jira/browse/SOLR-8586
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Yonik Seeley
>
> An order-independent hash across all of the versions in the index should suffice.  The
hash itself is pretty easy, but we need to figure out when/where to do this check (for example,
I think PeerSync is currently used in multiple contexts and this check would perhaps not be
appropriate for all PeerSync calls?)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message