flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Slim Baltagi <sbalt...@gmail.com>
Subject Re: Benchmark results between Flink and Spark
Date Mon, 06 Jul 2015 18:41:07 GMT
Hi 

Vasia, thanks for sharing.
1. I would like to add a couple resources about *BigBench*, the Big Data
benchmark suite that you are referring to: 
 https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench 
and also 
http://blog.cloudera.com/blog/2014/11/bigbench-toward-an-industry-standard-benchmark-for-big-data-analytics/

2. *BigDataBench* is also an open source Big Data Benchmarking suite from
both industry and academia.  As a subset of BigDataBench, BigDataBench-DCA 
is China’s first industry-standard big data benchmark suite:
http://prof.ict.ac.cn/BigDataBench/industry-standard-benchmarks/
It comes with *real-world data sets* and *many workloads*: TeraSort,
WordCount, PageRank, K-means, NaiveBayes, Aggregation and Read/Write/Scan
and also a *tool* that uses Hadoop, HBase and Mahout.
This might be inspiring to build a Big Data Benchmarking suite for Flink!

Regards,

Slim Baltagi
Apache Flink Knowledge Base ( Now with over 300 categorized web resources!)
http://sparkbigdata.com/component/tags/tag/27-flink



--
View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Benchmark-results-between-Flink-and-Spark-tp1940p1963.html
Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.

Mime
View raw message