spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xiangrui Meng <men...@gmail.com>
Subject Re: How to use BigInteger for userId and productId in collaborative Filtering?
Date Sat, 10 Jan 2015 05:50:05 GMT
Do you have more than 2 billion users/products? If not, you can pair
each user/product id with an integer (check RDD.zipWithUniqueId), use
them in ALS, and then join the original bigInt IDs back after
training. -Xiangrui

On Fri, Jan 9, 2015 at 5:12 PM, nishanthps <nishanthps@gmail.com> wrote:
> Hi,
>
> The userId's and productId's in my data are bigInts, what is the best way to
> run collaborative filtering on this data. Should I modify MLlib's
> implementation to support more types? or is there an easy way.
>
> Thanks!,
> Nishanth
>
>
>
> --
> View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-use-BigInteger-for-userId-and-productId-in-collaborative-Filtering-tp21072.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message