spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Debasish Das <>
Subject Spark Matrix Factorization
Date Thu, 02 Jan 2014 23:16:33 GMT

I am not noticing any DSGD implementation of ALS in Spark.

There are two ALS implementations.

org.apache.spark.examples.SparkALS does not run on large matrices and seems
more like a demo code.

org.apache.spark.mllib.recommendation.ALS looks feels more robust version
and I am experimenting with it.

References here are Jellyfish, Twitter's implementation of Jellyfish called
Scalafish, Google paper called Sparkler and similar idea put forward by IBM
paper by Gemulla et al. (large-scale matrix factorization with distributed
stochastic gradient descent)

Are there any plans of adding DSGD in Spark or there are any existing JIRA ?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message