lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ali Nazemian <alinazem...@gmail.com>
Subject Re: mark solr documents as duplicates on hashing the combination of some fields
Date Wed, 22 Oct 2014 14:03:59 GMT
I meant signature will be broken. For example suppose the destination of
hash function for signature fields are "sig". After each partial update it
becomes: "0000000000"!

On Wed, Oct 22, 2014 at 2:59 PM, Alexandre Rafalovitch <arafalov@gmail.com>
wrote:

> What do you mean by 'useless' specifically on the business level?
>
> Regards,
>      Alex
> On 22/10/2014 7:27 am, "Ali Nazemian" <alinazemian@gmail.com> wrote:
>
> > The problem is when I partially update some fields of document. The
> > signature becomes useless! Even if the updated fields are not included in
> > the signatureField!
> > Regards.
> >
> > On Wed, Oct 22, 2014 at 12:44 AM, Chris Hostetter <
> > hossman_lucene@fucit.org>
> > wrote:
> >
> > >
> > > you can still use the SignatureUpdateProcessorFactory for your usecase,
> > > just don't configure teh signatureField to be the same as your
> uniqueKey
> > > field.
> > >
> > > configure some othe fieldname (ie "signature") instead.
> > >
> > >
> > > : Date: Tue, 14 Oct 2014 12:08:26 +0330
> > > : From: Ali Nazemian <alinazemian@gmail.com>
> > > : Reply-To: solr-user@lucene.apache.org
> > > : To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org>
> > > : Subject: mark solr documents as duplicates on hashing the combination
> > of
> > > some
> > > :     fields
> > > :
> > > : Dear all,
> > > : Hi,
> > > : I was wondering how can I mark some documents as duplicate (just
> > marking
> > > : for future usage not deleting) based on the hash combination of some
> > > : fields? Suppose I have 2 fields name "url" and "title" I want to
> create
> > > : hash based on url+title and send it to another field name
> "signature".
> > > If I
> > > : do that using solr dedup, it will be resulted to deleting duplicate
> > > : documents! So it is not applicable for my situation. Thank you very
> > much.
> > > : Best regards.
> > > :
> > > : --
> > > : A.Nazemian
> > > :
> > >
> > > -Hoss
> > > http://www.lucidworks.com/
> > >
> >
> >
> >
> > --
> > A.Nazemian
> >
>



-- 
A.Nazemian

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message