mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rahul Mishra <mishra.rah...@gmail.com>
Subject Re: RepresentativePointsDriver numIterations
Date Thu, 01 Nov 2012 15:42:48 GMT
Thanks for the prompt reply Paritosh.
Could you please explain it a bit further? How does it depend?

Thanks & Regards,
Rahul


On Thu, Nov 1, 2012 at 8:44 PM, paritosh ranjan
<paritoshranjan5@gmail.com>wrote:

> Each iteration will add a single point to the evolving list of
> representative points for each cluster.
> So, I think it depends on the number of vectors per cluster and also the
> intra cluster distance.
>
> On Thu, Nov 1, 2012 at 8:13 PM, Rahul Mishra <mishra.rahulk@gmail.com
> >wrote:
>
> > Hello Friends,
> >
> > Whats the heuristic for providing what number of iterations for
> > RepresentativePointsDriver?
> >
> > I have run kmeans and fuzzy-kmeans algorithm on a dataset of size 500MB.
> > Now, how do I obtain cluster quality?
> >
> > Does the following look Okay? :
> > RepresentativePointsDriver.run(conf, new Path(clustersIn), new
> > Path(clusteredPointsIn), new Path(outputDir), new
> > EuclideanDistanceMeasure(), numIterations, runSequential);
> > double interDis = clusterEval.interClusterDensity();
> > double intraDis = clusterEval.intraClusterDensity();
> > System.out.println("cluster evaluator: The inter distance: "+interDis);
> > System.out.println("cluster evaluator: The intra distance: "+intraDis);
> >
> >
> >
> > --
> > Regards,
> > Rahul K Mishra,
> > https://sites.google.com/site/reachrahulkmishra/
> >
>



-- 
Regards,
Rahul K Mishra,
https://sites.google.com/site/reachrahulkmishra/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message