lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Shalin Shekhar Mangar <shalinman...@gmail.com>
Subject Re: Duplicate docs when merging indices?
Date Mon, 23 Aug 2010 04:55:20 GMT
On Sat, Aug 21, 2010 at 5:56 PM, Andrew Clegg <andrew.clegg@gmail.com>wrote:

>
> Hi,
>
> First off, sorry about previous accidental post, had a sausage-fingered
> moment.
>
> Anyway...
>
> If I merge two indices with CoreAdmin, as detailed here...
>
> http://wiki.apache.org/solr/MergingSolrIndexes
>
> What happens to duplicate documents between the two? i.e. those that have
> the same unique key.
>
> What decides which copy takes precedence? Will documents get indexed
> multiple times, or will the second one just get skipped?
>
> Also, does the behaviour vary between CoreAdmin and IndexMergeTool? This
> thread from a couple of years ago:
>
> http://web.archiveorange.com/archive/v/AAfXfQIiBU7vyQBt6qdk
>
> suggests that IndexMergeTool can result in dupes, unless I'm
> misinterpreting.
>
>
Yes, it will result in duplicate docs. CoreAdmin and IndexMergeTool both use
the IndexWriter#addIndexes method so the behavior will be same.

-- 
Regards,
Shalin Shekhar Mangar.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message