lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: Tips for getting unique results?
Date Thu, 07 Apr 2011 05:22:33 GMT
Hi,

I think you are saying dupes are the main problem?  If so, 
http://wiki.apache.org/solr/Deduplication ?

Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



----- Original Message ----
> From: Peter Spam <pspam@mac.com>
> To: solr-user@lucene.apache.org
> Sent: Thu, April 7, 2011 1:13:44 AM
> Subject: Tips for getting unique results?
> 
> Hi,
> 
> I have documents with a field that has "1A2B3C" alphanumeric  characters.  I 
>can query for * and sort results based on this field,  however I'd like to 
>"uniq" these results (remove duplicates) so that I can get  the 5 largest unique 
>values.  I can't use the StatsComponent because my  values have letters in them 
>too.
> 
> Faceting (and ignoring the counts) gets  me half of the way there, but I can 
>only sort ascending.  If I could also  sort facet results descending, I'd be 
>done.  I'd rather not return all  documents and just parse the last few results 
>to work around this.
> 
> Any  ideas?
> 
> 
> -Pete
> 

Mime
View raw message