Hi

Flink and Spark take different approaches to do computation, each of which has 
its pros and cons.
Who can elaborate the pros and cons for "Operator-centric, intermediate 
data-centric"? Any help would be very appreciated.

First of all, share my understanding:
Operator-centric: Could have more performance for steaming scenarios, don't 
need to wait data together for computation.
Intermediate data-centirc: Easy to do integration across different system to 
share intermediate data, especially for data mining and machine learning, need 
to reuse intermediate data for multiple iteration computation,the approach 
would be more efficient.
[cid:image003.png@01D0D0BE.CE49A180]

Regards
Liang
Huawei Technologies Co., Ltd.

Reply via email to