mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From paritosh ranjan <paritoshranj...@gmail.com>
Subject Re: RepresentativePointsDriver numIterations
Date Thu, 01 Nov 2012 15:14:03 GMT
Each iteration will add a single point to the evolving list of
representative points for each cluster.
So, I think it depends on the number of vectors per cluster and also the
intra cluster distance.

On Thu, Nov 1, 2012 at 8:13 PM, Rahul Mishra <mishra.rahulk@gmail.com>wrote:

> Hello Friends,
>
> Whats the heuristic for providing what number of iterations for
> RepresentativePointsDriver?
>
> I have run kmeans and fuzzy-kmeans algorithm on a dataset of size 500MB.
> Now, how do I obtain cluster quality?
>
> Does the following look Okay? :
> RepresentativePointsDriver.run(conf, new Path(clustersIn), new
> Path(clusteredPointsIn), new Path(outputDir), new
> EuclideanDistanceMeasure(), numIterations, runSequential);
> double interDis = clusterEval.interClusterDensity();
> double intraDis = clusterEval.intraClusterDensity();
> System.out.println("cluster evaluator: The inter distance: "+interDis);
> System.out.println("cluster evaluator: The intra distance: "+intraDis);
>
>
>
> --
> Regards,
> Rahul K Mishra,
> https://sites.google.com/site/reachrahulkmishra/
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message