mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Harrington <ch...@heystaks.com>
Subject Vector distance within a cluster
Date Mon, 25 Feb 2013 12:27:16 GMT
Hi all,

I want to find all the vectors within a cluster and then find the distance between them and
every other vector within a cluster, in hopes this will give me a good idea of how similar
each vector within a cluster is as well as identify outlier vectors.

So there are 2 things I want to ask.

1. Is this a sensible approach to evaluating the cluster quality?

2. Is the correct file to get this info from the clusteredPoints/parts-m-00000 file?

Thanks,
Chris



Mime
View raw message