Hi Flink and Spark take different approaches to do computation, each of which has its pros and cons. Who can elaborate the pros and cons for "Operator-centric, intermediate data-centric"? Any help would be very appreciated.
First of all, share my understanding: Operator-centric: Could have more performance for steaming scenarios, don't need to wait data together for computation. Intermediate data-centirc: Easy to do integration across different system to share intermediate data, especially for data mining and machine learning, need to reuse intermediate data for multiple iteration computation,the approach would be more efficient. [cid:image003.png@01D0D0BE.CE49A180] Regards Liang Huawei Technologies Co., Ltd.