lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Đạt Cao Mạnh <caomanhdat...@gmail.com>
Subject Re: Duplicate filtering
Date Wed, 21 Sep 2016 03:40:56 GMT
Solr already support de-duplication when adding new documents. You can
refer to the doc at
https://cwiki.apache.org/confluence/display/solr/De-Duplication

On Tue, Sep 20, 2016 at 12:18 PM Vjeran Marcinko <
vjeran.marcinko@email.t-com.hr> wrote:

> Hello,
>
> I'm pretty much Lucene newb, so wondering for some short guidelines on
> how to implement some duplicate document filtering based on some field
> which defines uniqueness, and first document stays, other duplicates are
> filtered out?
>
> I know some 3rd party contrib lib existed before which was for that, but
> it has been abandoned/deprecated for these newer versions of Lucene.
>
> Regards,
> Vjeran
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message