lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ali Nazemian <alinazem...@gmail.com>
Subject Re: mark solr documents as duplicates on hashing the combination of some fields
Date Wed, 22 Oct 2014 11:27:08 GMT
The problem is when I partially update some fields of document. The
signature becomes useless! Even if the updated fields are not included in
the signatureField!
Regards.

On Wed, Oct 22, 2014 at 12:44 AM, Chris Hostetter <hossman_lucene@fucit.org>
wrote:

>
> you can still use the SignatureUpdateProcessorFactory for your usecase,
> just don't configure teh signatureField to be the same as your uniqueKey
> field.
>
> configure some othe fieldname (ie "signature") instead.
>
>
> : Date: Tue, 14 Oct 2014 12:08:26 +0330
> : From: Ali Nazemian <alinazemian@gmail.com>
> : Reply-To: solr-user@lucene.apache.org
> : To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org>
> : Subject: mark solr documents as duplicates on hashing the combination of
> some
> :     fields
> :
> : Dear all,
> : Hi,
> : I was wondering how can I mark some documents as duplicate (just marking
> : for future usage not deleting) based on the hash combination of some
> : fields? Suppose I have 2 fields name "url" and "title" I want to create
> : hash based on url+title and send it to another field name "signature".
> If I
> : do that using solr dedup, it will be resulted to deleting duplicate
> : documents! So it is not applicable for my situation. Thank you very much.
> : Best regards.
> :
> : --
> : A.Nazemian
> :
>
> -Hoss
> http://www.lucidworks.com/
>



-- 
A.Nazemian

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message