spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Deepak Gopalakrishnan <>
Subject Running ALS on comparitively large RDD
Date Fri, 11 Mar 2016 03:52:35 GMT
Hello All,

I've been running Spark's ALS on a dataset of users and rated items. I
first encode my users to integers by using an auto increment function (
just like zipWithIndex), I do the same for my items. I then create an RDD
of the ratings and feed it to ALS.

My issue is that the ALS algorithm never completes. Attached is a
screenshot of the stages window.

Any help will be greatly appreciated

*Deepak Gopalakrishnan*
*Skype* : deepakgk87

View raw message