mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Adil Aijaz <a...@yahoo-inc.com>
Subject randomseedgenerator
Date Wed, 01 Jul 2009 18:07:29 GMT
I was looking at the RandomSeedGenerator and, correct me if I am wrong, 
but it is not really random; rather it does a bunch of bernoulli trials 
where the points that are in the beginning of your data are always going 
to have a higher chance of being selected than those near the end.

Maybe that's not a problem since given sufficient iterations kmeans 
should converge toward a solution. But, I thought I'd point it out in 
case there is an issue here.

Adil

Mime
View raw message