Thanks for the clarification. I'll fix the slide. I've done a lot of Scalding/Cascading programming where the two concepts are synonymous, but clearly I was imposing my prejudices here ;)

dean


On Thu, May 1, 2014 at 8:18 AM, Daniel Darabos <daniel.darabos@lynxanalytics.com> wrote:
Cool intro, thanks! One question. On slide 23 it says "Standalone ("local" mode)". That sounds a bit confusing without hearing the talk.

Standalone mode is not local. It just does not depend on a cluster software. I think it's the best mode for EC2/GCE, because they provide a distributed filesystem anyway (S3/GCS). Why configure Hadoop if you don't have to.


On Thu, May 1, 2014 at 12:25 AM, Dean Wampler <deanwampler@gmail.com> wrote:
I meant to post this last week, but this is a talk I gave at the Philly ETE conf. last week:


Also here:


dean

--
Dean Wampler, Ph.D.




--
Dean Wampler, Ph.D.
Typesafe
@deanwampler
http://typesafe.com
http://polyglotprogramming.com