Hi Weijie, Regarding FLIP-409, I have the following questions:
Q1. Other DataStream types are converted into Non-Keyed DataStreams by using a "shuffle" operation to convert Input into output. Does this "shuffle" include the various repartition operations (rebalance/rescale/shuffle) from DataStream V1? Q2. Why is the design for TwoOutputStreamProcessFunction, when dealing with a KeyedStream, only outputting combinations of (Keyed + Keyed) and (Non-Keyed + Non-Keyed)? I hope to receive answers when you have time, thank you! Best, Wencong Liu At 2023-12-26 14:43:47, "weijie guo" <guoweijieres...@gmail.com> wrote: >Hi devs, > > >I'd like to start a discussion about FLIP-409: DataStream V2 Building >Blocks: DataStream, Partitioning and ProcessFunction [1]. > > >As the first sub-FLIP for DataStream API V2, we'd like to discuss and >try to answer some of the most fundamental questions in stream >processing: > > 1. What kinds of data streams do we have? > 2. How to partition data over the streams? > 3. How to define a processing on the data stream? > >The answer to these questions involve three core concepts: DataStream, >Partitioning and ProcessFunction. In this FLIP, we will discuss the >definitions and related API primitives of these concepts in detail. > > >You can find more details in FLIP-409 [1]. This sub-FLIP is at the >heart of the entire DataStream API V2, and its relationship with other >sub-FLIPs can be found in the umbrella FLIP [2]. > > >Looking forward to hearing from you, thanks! > > >Best regards, > >Weijie > > > >[1] >https://cwiki.apache.org/confluence/display/FLINK/FLIP-409%3A+DataStream+V2+Building+Blocks%3A+DataStream%2C+Partitioning+and+ProcessFunction > >[2] >https://cwiki.apache.org/confluence/display/FLINK/FLIP-408%3A+%5BUmbrella%5D+Introduce+DataStream+API+V2