mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <>
Subject Re: Replacing the Netflix data set
Date Fri, 07 May 2010 15:38:26 GMT
Cool, yeah I'm looking for something even larger, since this is small
enough that processing it easily fits on one computer. The chapter in
question is about distributing via Hadoop.

My current next-best option, if it can be used, is the LiveJournal
network data here:

On Fri, May 7, 2010 at 4:29 PM, Pedro Oliveira <> wrote:
> This dataset seems to have a few million <user, artist, plays> triples from

View raw message