spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Erlandson <...@redhat.com>
Subject Re: Streaming K-medoids
Date Mon, 01 Jun 2015 16:35:17 GMT

I haven't given any thought to streaming it, but in case it's useful I do have a k-medoids
implementation for Spark:
http://silex.freevariable.com/latest/api/#com.redhat.et.silex.cluster.KMedoids

Also a blog post about multi-threading it:
http://erikerlandson.github.io/blog/2015/05/06/parallel-k-medoids-using-scala-parseq/



----- Original Message -----
> Hello everyone,
> 
> I have an idea and I would like to get a validation from community about
> it.
> 
> In Mahout there is an implementation of Streaming K-means. I'm
> interested in your opinion would it make sense to make a similar
> implementation of Streaming K-medoids?
> 
> K-medoids has even bigger problems than K-means because it's not
> scalable, but can be useful in some cases (e.g. It allows more
> sophisticated distance measures).
> 
> What is your opinion about such an approach? Does anyone see problems
> with it?
> 
> Best regards,
> Marko
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message