beam-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [1/3] incubator-beam-site git commit: Added a half birthday blog post, as discussed on dev@.
Date Wed, 03 Aug 2016 20:35:07 GMT
Repository: incubator-beam-site
Updated Branches:
  refs/heads/asf-site c630ee0ed -> 342fe7e42

Added a half birthday blog post, as discussed on dev@.


Branch: refs/heads/asf-site
Commit: 4244fe2b9638b973d99ab73bad9eace7f79cb070
Parents: c630ee0
Author: Frances Perry <>
Authored: Tue Aug 2 22:36:07 2016 -0700
Committer: Dan Halperin <>
Committed: Wed Aug 3 11:59:04 2016 -0700

 _posts/ | 43 ++++++++++++++++++++++++++++++++++++
 1 file changed, 43 insertions(+)
diff --git a/_posts/ b/_posts/
new file mode 100644
index 0000000..c4ed246
--- /dev/null
+++ b/_posts/
@@ -0,0 +1,43 @@
+layout: post
+title:  "Apache Beam: Six Months in Incubation"
+date:   2016-08-04 00:00:01 -0700
+excerpt_separator: <!--more-->
+categories: blog
+  - fjp
+It’s been just over six months since Apache Beam was formally accepted into incubation
with the [Apache Software Foundation]( As a community, we’ve been
hard at work getting Beam off the ground.
+Looking just at raw numbers for those first six months, that’s:
+* 48,238 lines of preexisting code donated by Cloudera, dataArtisans, and Google.
+* 761 pull requests from 45 contributors.
+* 498 Jira issues opened and 245 resolved.
+* 1 incubating release (and another 1 in progress). 
+* 4,200 hours of automated tests. 
+* 161 subscribers / 606 messages on user@.
+* 217 subscribers / 1205 messages on dev@.
+* 277 stars and 174 forks on GitHub.
+And behind those numbers, there’s been a ton of technical progress, including:
+* Refactoring of the entire codebase, examples, and tests to be truly runner-independent.
+* New functionality in the Apache Flink runner for timestamps/windows in batch and bounded
sources and side inputs in streaming mode.
+* Work in progress to upgrade the Apache Spark runner to use Spark 2.0.
+* Several new runners from the wider Apache community -- Apache Gearpump has its own feature
branch, Apache Apex has a PR, and conversations are starting on Apache Storm and others.
+* New SDKs/DSLs for exposing the Beam model -- the Python SDK from Google is in on a feature
branch, and there are plans to add the Scio DSL from Spotify.
+* Support for additional data sources and sinks -- Apache Kafka and JMS are in, there are
PRs for Amazon Kinesis, Apache Cassandra, and MongoDB, and more connectors are being planned.
+But perhaps most importantly, we’re committed to building an involved, welcoming community.
So far, we’ve:
+* Started building a vibrant developer community, with detailed design discussions on features
like DoFn reuse semantics, serialization technology, and an API for accessing state.
+* Started building a user community with an active mailing list and improvements to the website
and documentation.
+* Had multiple talks on Beam at venues including ApacheCon, Hadoop Summit, Kafka Summit,
JBCN Barcelona, and Strata.
+* Presented at multiple existing meetups and are starting to organize some of our own.
+While it’s nice to reflect back on all we’ve done, we’re working full _stream_ ahead
towards a stable release and graduation from incubator. And we’d love your help -- join
the [mailing lists](, check out the [contribution
guide](, and grab a [starter
from Jira!

View raw message