lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alexandre Rafalovitch <arafa...@gmail.com>
Subject Re: solr dedup on specific fields
Date Tue, 01 Jul 2014 13:47:25 GMT
Well, it's implemented in SignatureUpdateProcessorFactory. Worst case,
you can clone that code and add your preserve-field functionality.
Could even be a nice contribution.

Regards,
   Alex.

Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency


On Tue, Jul 1, 2014 at 6:50 PM, Ali Nazemian <alinazemian@gmail.com> wrote:
> Any suggestion would be appreciated.
> Regards.
>
>
> On Mon, Jun 30, 2014 at 2:49 PM, Ali Nazemian <alinazemian@gmail.com> wrote:
>
>> Hi,
>> I used solr 4.8 for indexing the web pages that come from nutch. I know
>> that solr deduplication operation works on uniquekey field. So I set that
>> to URL field. Everything is OK. except that I want after duplication
>> detection solr try not to delete all fields of old document. I want some
>> fields remain unchanged. For example assume I have a data field called
>> "read" with Boolean value "true" for specific document. I want all fields
>> of new document overwrites except the value of this field. Is that
>> possible? How?
>> Regards.
>>
>> --
>> A.Nazemian
>>
>
>
>
> --
> A.Nazemian

Mime
View raw message