lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gonzalo Rodriguez <GRodrig...@searchtechnologies.com>
Subject RE: Find duplicates
Date Tue, 02 Dec 2014 16:04:06 GMT
Have you tried using result grouping for your query? There are some very good examples in the
wiki:

https://wiki.apache.org/solr/FieldCollapsing


Gonzalo

-----Original Message-----
From: Peter Kirk [mailto:pk@alpha-solutions.dk] 
Sent: Tuesday, December 02, 2014 9:58 AM
To: solr-user@lucene.apache.org
Subject: Find duplicates

Hi

Is it possible to formulate a Solr query which finds all documents which have the same value
in a particular field?
Note, I don't know what the value is, I just want to find all documents with duplicate values.

For example, I have 5 documents:

Doc1: field Name = Peter
Doc2: field Name = Jack
Doc3: field Name = Peter
Doc4: field Name = Paul
Doc5: field Name = Jack


If I executed the query, it would find documents Doc1 and Doc3 (Peter is the same), and Doc2
and Doc5 (Jack is the same).



Thanks,
Peter
Mime
View raw message