flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chenliang (Liang, DataSight)" <chenliang...@huawei.com>
Subject Discussion: two different computation approaches for flink and spark(Operator-centric, intermediate data-centric)
Date Thu, 06 Aug 2015 19:26:30 GMT

Flink and Spark take different approaches to do computation, each of which has its pros and
Who can elaborate the pros and cons for "Operator-centric, intermediate data-centric"? Any
help would be very appreciated.

First of all, share my understanding:
Operator-centric: Could have more performance for steaming scenarios, don't need to wait data
together for computation.
Intermediate data-centirc: Easy to do integration across different system to share intermediate
data, especially for data mining and machine learning, need to reuse intermediate data for
multiple iteration computation,the approach would be more efficient.

Huawei Technologies Co., Ltd.

View raw message