Re: [DISCUSS] FLIP-48: Pluggable Intermediate Result Storage

2019-09-30 Thread Stephan Ewen
I understand your argument about performance, though I am unsure it will be that noticeable and warrants the added complexity, Besides performance, I believe the other two arguments in the previous mails are quite important to consider: - Keeping complexity out of the core is a good goal. It is

Re: [DISCUSS] FLIP-48: Pluggable Intermediate Result Storage

2019-09-23 Thread Becket Qin
Hi Stephan, In terms of the performance concern, please see my understanding below. ## Breaking the pipeline v.s. adding a sink. If two operators are initially chained, they will belong to the same stage in the DAG and the same task, therefore the main processing path will just have one task with

Re: [DISCUSS] FLIP-48: Pluggable Intermediate Result Storage

2019-09-22 Thread Stephan Ewen
## About the improvements you mentioned in (1) - I am not sure that this helps to improve performance by avoiding to break the pipeline. Attaching an additional sink, would in virtually any case add even more overhead than the pipeline breaking. What is your reasoning why it would be fas

Re: [DISCUSS] FLIP-48: Pluggable Intermediate Result Storage

2019-09-18 Thread Becket Qin
Hi Stephan, Sorry for the belated reply. You are right that the functionality proposed in this FLIP can be implemented out of the Flink core as an ecosystem project. The main motivation of this FLIP is two folds: 1. Improve the performance of intermediate result sharing in the same session. Usin

Re: [DISCUSS] FLIP-48: Pluggable Intermediate Result Storage

2019-08-15 Thread Stephan Ewen
Sorry for the late response. So many FLIPs these days. I am a bit unsure about the motivation here, and that this need to be a part of Flink. It sounds like this can be perfectly built around Flink as a minimal library on top of it, without any change in the core APIs or runtime. The proposal to

[DISCUSS] FLIP-48: Pluggable Intermediate Result Storage

2019-07-24 Thread Xuannan Su
Hi folks, I would like to start the FLIP discussion thread about the pluggable intermediate result storage. This is phase 2 of FLIP-36: Support Interactive Programming in Flink Skip to end of metadata. While the FLIP-36 provides a default implementation of the intermediate result storage using