mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Carlos Seminario <>
Subject Vectorize the movielens 100K dataset for Mahout k-means clustering
Date Mon, 01 Jul 2013 00:29:57 GMT
Hi: I want to vectorize the movielens 100K dataset as a
RandomAccessSparseVector and use it to run Mahout k-means clustering. Has
anyone done this before? If not, any ideas on a how this can be done? (BTW,
movielens dataset contains ~100K records/lines with this format: userid,
itemid, rating, unix time.)

Thanks .. Carlos

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message