mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rajesh Nikam <rajeshni...@gmail.com>
Subject Re: Applying clustering techique
Date Wed, 12 Jun 2013 10:21:25 GMT
Hi,

I am not aware of tool that directly takes csv files. However you could
convert csv to arff files by just adding header in required format and then
use

"mahout arff.vector" to convert arff to sequence file which could be used
in canopy/kmeans.

Thanks,
Rajesh




On Wed, Jun 12, 2013 at 3:41 PM, Neetha <netasusan@gmail.com> wrote:

> Hi,
>
>
> I am using 1m movielens.
>
> I need to run the K-means clustering using mahout and hadoop. Actually,
> 1st step in the clustering is to convert into a sequence file, then into
> vector format and then apply the clustering algorithm. My doubt is, Is
> there any need to convert the movielens rating.csv file into a sequence
> file. If needed what are the commands for applying clustering technique
> using mahout and the hadoop.
>
> Thanking you,
> Neetha Suan Thampi
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message