lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <arafa...@gmail.com>
Subject Re: mark solr documents as duplicates on hashing the combination of some fields
Date Wed, 22 Oct 2014 20:11:03 GMT
This is the "dark art" knowledge. I've updated the Reference Guide
comment with the request to have this text included, but it would also
be nice to have it as part of the Javadoc for the Factory or the URP
itself. Maybe WIKI as well. I can see not getting this part causing
somebody a lot of headache.

Regards,
   Alex.

On 22 October 2014 14:17, Chris Hostetter <hossman_lucene@fucit.org> wrote:
> the atomic updates are processed as part of the
> DistributedUpdateProcessor (so they execute on the leader and work with
> optimistic concurrency) but that means if you have the
> SignatureUpdateProcessorFactory configured before the
> DistributedUpdateProcessorFactory it could compute a signature based on
> the raw doc you send (with the updatecommands) instead of the "real" doc
> with the updates applied.
>
> for a situation where you want the signatureField to *be* the uniqueKey,
> then you kind of have to put SignatureUpdateProcessorFactory before
> DistributedUpdateProcessorFactory

Mime
View raw message