mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kasi Subrahmanyam <kasisubbu...@gmail.com>
Subject Clustering of text data on external categories
Date Fri, 11 Oct 2013 12:04:41 GMT
Hi,

I have a problem that i would like to implement in mahout clustering.

I have input text documents with data like below.

Document1: This is the first document of selling information.
Document2: This is the second document of gathering information.

I also have another look up file with data like below
selling:CatA
gathering:CatB.
information:CatC

NOw i would like to cluster the documents with output being genrated as
Document1:CatA,CatC
Document2:CatB,CatC

Please let me know how to achieve this.

Thanks,
Subbu

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message