lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tommaso Teofili <tommaso.teof...@gmail.com>
Subject Re: Document Similarity Algorithm at Solr/Lucene
Date Tue, 23 Jul 2013 09:38:06 GMT
Hi,

I you may leverage and / or improve MLT component [1].

HTH,
Tommaso

[1] : http://wiki.apache.org/solr/MoreLikeThis


2013/7/23 Furkan KAMACI <furkankamaci@gmail.com>

> Hi;
>
> Sometimes a huge part of a document may exist in another document. As like
> in student plagiarism or quotation of a blog post at another blog post.
> Does Solr/Lucene or its libraries (UIMA, OpenNLP, etc.) has any class to
> detect it?
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message