mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <>
Subject Re: Can all the algorithms in Mahout be run locally without a Hadoop cluster.
Date Sat, 25 Jun 2011 09:21:41 GMT
I think EMR is well worth using. I just think you do want to throw more, and
smaller, machines at the task than you imagine. I used the 'small' instance
but you might get away with a fleet of micro instances even. And do most
certainly request spot instances for your workers (but pay full rate for
your master to ensure it's not killed). It stays reasonably economical this
way, even if I wouldn't call this "dirt cheap".

On Sat, Jun 25, 2011 at 9:06 AM, Chris Schilling <> wrote:

> Hey Sean,
> Just curious about your AWS comment.  I am only in very early testing
> phases with AWS EMR.  So, would you say that you generally recommend
> manually setting an EC2 cluster to run Mahout over EMR?  I guess the
> question is: for those of us without the resources to setup an in-house
> hadoop cluster, what is the best setup we can hope to acheive?

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message