mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From JAGANADH G <jagana...@gmail.com>
Subject Re: Document Comparison with Mahout
Date Thu, 08 Jul 2010 06:21:45 GMT
On Wed, Jul 7, 2010 at 11:49 PM, Grant Ingersoll <gsingers@apache.org>wrote:

> How do you want to determine copy?  Strictly or loosely?  Solr and Nutch
> have some deduplication capabilities, including fuzzy matching.  They
> probably could be brought into Mahout, too.
>
> -Grant
>
>
>
Dear Grant
I am trying to make a strict match.
I will try Solar and Nutch.
Thanks and Regards
-- 
**********************************
JAGANADH G
http://jaganadhg.freeflux.net/blog

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message