lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fran├žois Schiettecatte <fschietteca...@gmail.com>
Subject Re: Indexing only on change
Date Sat, 24 Nov 2012 20:40:37 GMT
I would create a hash of the document content and store that in SOLR along with any document
info you wish to store. When a document is presented for indexing, hash that and compare to
the hash of the stored document, index if they are different and skip if they are not.

Fran├žois
 

On Nov 24, 2012, at 3:30 PM, Pratyul Kapoor <pratyulk@gmail.com> wrote:

> Hi,
> 
> I just discovered that solr while editing a particular field of a document,
> removes the entire document and recreates.
> 
> I have a list of 1000s of documents to be indexed. But I am aware that only
> some of those documents would be changed and rest all would already be
> there. Is there any way, I can check whether the incoming and already
> existing document is same, and there is no need of indexing it again.
> 
> Pratyul


Mime
View raw message