spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <so...@cloudera.com>
Subject Re: implicit ALS dataSet
Date Thu, 19 Jun 2014 14:13:47 GMT
On Thu, Jun 19, 2014 at 3:03 PM, redocpot <julien19890118@gmail.com> wrote:
> We did some sanity check. For example, each user has his own item list which
> is sorted by preference, then we just pick the top 10 items for each user.
> As a result, we found that there were only 169 different items among the
> (1060080 x 10) items picked, most of them are repeated. That means, given 2
> users, the items recommended might be the same. Nothing is personalized.

This sounds like severe underfitting -- lambda is too high or the
number of features is too small.

Mime
View raw message