mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <sro...@gmail.com>
Subject Re: Mahout performance issues
Date Fri, 02 Dec 2011 18:03:13 GMT
Yes, but those users will bring no more candidate items to consider, and
the apparent bottleneck is not touching those users, but later computing
all those similarities. That's my argument.

On Fri, Dec 2, 2011 at 5:56 PM, Ted Dunning <ted.dunning@gmail.com> wrote:
>
> Actually, if these users single item is a fantastically popular item, then
> all of those users will be roped into the computation (with no effect).
>
> Sean's argument would be correct if the users were each interacting with
> some item that is way out in the low frequency tail.  By Murphy, this won't
> be the case.
>
> Better to dump the uninformative items using a kill list.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message