spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joseph Bradley <jos...@databricks.com>
Subject Re: Restricting the number of iterations in Mllib Kmeans
Date Mon, 01 Jun 2015 19:02:05 GMT
Hi Suman & Meethu,
Apologies---I was wrong about KMeans supporting an initial set of
centroids!  JIRA created: https://issues.apache.org/jira/browse/SPARK-8018
If you're interested in submitting a PR, please do!
Thanks,
Joseph

On Mon, Jun 1, 2015 at 2:25 AM, MEETHU MATHEW <meethu2006@yahoo.co.in>
wrote:

> Hi Joseph,
> I was unable to find any function in Kmeans.scala where the initial
> centroids could be specified by the user. Kindly help.
>
> Thanks & Regards,
> Meethu M
>
>
>
>   On Tuesday, 19 May 2015 6:54 AM, Joseph Bradley <joseph@databricks.com>
> wrote:
>
>
> Hi Suman,
>
> For maxIterations, are you using the DenseKMeans.scala example code?  (I'm
> guessing yes since you mention the command line.)  If so, then you should
> be able to specify maxIterations via an extra parameter like
> "--numIterations 50" (note the example uses "numIterations" in the current
> master instead of "maxIterations," which is sort of a bug in the example).
> If that does not cap the max iterations, then please report it as a bug.
>
> To specify the initial centroids, you will need to modify the DenseKMeans
> example code.  Please see the KMeans API docs for details.
>
> Good luck,
> Joseph
>
> On Mon, May 18, 2015 at 3:22 AM, MEETHU MATHEW <meethu2006@yahoo.co.in>
> wrote:
>
> Hi,
> I think you cant supply an initial set of centroids to kmeans
>
> Thanks & Regards,
> Meethu M
>
>
>
>   On Friday, 15 May 2015 12:37 AM, Suman Somasundar <
> suman.somasundar@oracle.com> wrote:
>
>
> Hi,,
>
> I want to run a definite number of iterations in Kmeans.  There is a
> command line argument to set maxIterations, but even if I set it to a
> number, Kmeans runs until the centroids converge.
> Is there a specific way to specify it in command line?
>
> Also, I wanted to know if we can supply the initial set of centroids to
> the program instead of it choosing the centroids in random?
>
> Thanks,
> Suman.
>
>
>
>
>
>

Mime
View raw message