mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sebastian Schelter <>
Subject Announcement: 'Parallel Processing beyond MapReduce' workshop after Berlin Buzzwords
Date Wed, 02 May 2012 05:57:50 GMT

I'd like to announce a 2-day workshop called 'Parallel Processing beyond
MapReduce', which will take place subsequent to this years Berlin
Buzzwords conference:

This workshop will discuss novel paradigms for parallel processing
beyond the traditional MapReduce paradigm offered by Apache Hadoop.

The workshop will introduce two new systems:

Apache Giraph aims at processing large graphs, runs on standard Hadoop
infrastructure and is a loose port of Google's Pregel system. Giraph
follows the bulk-synchronous parallel model relative to graphs where
vertices can send messages to other vertices during a given superstep.

Stratosphere ( is a system that is developed
in a joint research project by Technische Universit├Ąt Berlin, Humboldt
Universit├Ąt zu Berlin and the Hasso-Plattner-Institut in Potsdam. It is
a database inspired, large-scale data processor based on concepts of
robust and adaptive execution. Stratosphere offers the PACT programming
model that extends the MapReduce programming model with additional
second order functions. As execution platform it uses the Nephele
system, a massively parallel data flow engine which is also researched
and developed in the project.

Attendees will hear about the new possibilities of Hadoop's NextGen
MapReduce architecture (YARN) and get a detailed introduction to the
Apache Giraph and Stratosphere systems. After that there will be plenty
of time for questions, discussions and diving into source code.

For more infos checkout:


View raw message