lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pavel Minchenkov <char...@gmail.com>
Subject Re: Duplicates
Date Fri, 23 Jul 2010 08:15:53 GMT
Thanks.

Does it work with Solr 1.4 (Solr 4.0 mentioned in article)?
What about performance? I need only to delete duplicates (I don't need cout
of duplicates or select certain duplicate).

2010/7/23 Peter Karich <peathal@yahoo.de>

> Another possibility could be the well known 'field collapse' ;-)
>
> http://wiki.apache.org/solr/FieldCollapsing
>
> Regards,
> Peter.
>
> > Thanks.
> >
> > If I set uniqueKey on the field, then I can save duplicates?
> > I need to remove duplicates only from search results. The ability to save
> > duplicates are should be.
> >
> > 2010/7/23 Erick Erickson <erickerickson@gmail.com>
> >
> >
> >> If the field is a single token, just define the uniqueKey on it in your
> >> schema.
> >>
> >> Otherwise, this may be of interest:
> >> http://wiki.apache.org/solr/Deduplication
> >>
> >> Haven't used it myself though...
> >>
> >> best
> >> Erick
> >>
> >> On Thu, Jul 22, 2010 at 6:14 PM, Pavel Minchenkov <chardex@gmail.com>
> >> wrote:
> >>
> >>
> >>> Hi,
> >>>
> >>> Is it possible to remove duplicates in search results by a given field?
> >>>
> >>> Thanks.
> >>>
> >>> --
> >>> Pavel Minchenkov
>
>


-- 
Pavel Minchenkov

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message