mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Musselman <>
Subject Preserve contents of keys after running k-means
Date Fri, 05 Jul 2013 19:05:10 GMT
Hi list

We are trying to do some k-means clustering and are wondering if there's an
easy way to preserve the contents of the keys for the input records.


12345: (0,3,79,80)
98765: (1,4,98,90)

where the vectors being clustered are the tuples and the keys are some id.

When we run clusterdump with pointsDir specified we have the vectors but
not the keys.  We're looking at NamedVector as a path to this solution, as
well as looking at a mapping file between ordered integers and the ids in

Thanks for any advice.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message