mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: Popularity of recommender items
Date Sun, 09 Feb 2014 03:19:49 GMT
I have different opinions about each piece.

I think that cross recommendation is as core as RowSimilarityJob and should
be a parallel implementation or integrated.  Parallel is probably easier.
 It is even plausible to have a version of RowSimilarityJob that doesn't
support all the different distance measures but does support multiple cross
and direct processing using LLR or related cooccurrence based measures.  It
would be very cool if a single pass over the data could do many kinds of co
or cross occurrence operations.

For dithering, it really is post processing.  That said, it is also the
single largest improvement that anybody typically gets when testing
different options so it is a bit goofy to not have good support for some
kinds of dithering.

For Thompson sampled recommenders, I am not sure where to start hacking on
our current code.

On Sat, Feb 8, 2014 at 4:53 PM, Pat Ferrel <> wrote:

> That was by no means to criticize effort level, which has been impressive
> especially during the release.
> It was more a question about the best place to add these things and
> whether they are important. Whether people see these things as custom post
> processing or core.
> On Feb 8, 2014, at 12:13 PM, Ted Dunning <> wrote:
> …
> The reason that we aren't adding this like cross-rec and other things is
> that "we" have full-time jobs, mostly.  Suneel is full-time on Mahout, but
> the rest are not.  You seem more active than most.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message