Hi Weijie,

Regarding FLIP-409, I have the following questions:


Q1. Other DataStream types are converted into 
Non-Keyed DataStreams by using a "shuffle" operation 
to convert Input into output. Does this "shuffle" include the 
various repartition operations (rebalance/rescale/shuffle) 
from DataStream V1?


Q2. Why is the design for TwoOutputStreamProcessFunction, 
when dealing with a KeyedStream, only outputting combinations 
of (Keyed + Keyed) and (Non-Keyed + Non-Keyed)?


I hope to receive answers when you have time, thank you!


Best,
Wencong Liu














At 2023-12-26 14:43:47, "weijie guo" <guoweijieres...@gmail.com> wrote:
>Hi devs,
>
>
>I'd like to start a discussion about FLIP-409: DataStream V2 Building
>Blocks: DataStream, Partitioning and ProcessFunction [1].
>
>
>As the first sub-FLIP for DataStream API V2, we'd like to discuss and
>try to answer some of the most fundamental questions in stream
>processing:
>
>   1. What kinds of data streams do we have?
>   2. How to partition data over the streams?
>   3. How to define a processing on the data stream?
>
>The answer to these questions involve three core concepts: DataStream,
>Partitioning and ProcessFunction. In this FLIP, we will discuss the
>definitions and related API primitives of these concepts in detail.
>
>
>You can find more details in FLIP-409 [1]. This sub-FLIP is at the
>heart of the entire DataStream API V2, and its relationship with other
>sub-FLIPs can be found in the umbrella FLIP [2].
>
>
>Looking forward to hearing from you, thanks!
>
>
>Best regards,
>
>Weijie
>
>
>
>[1]
>https://cwiki.apache.org/confluence/display/FLINK/FLIP-409%3A+DataStream+V2+Building+Blocks%3A+DataStream%2C+Partitioning+and+ProcessFunction
>
>[2]
>https://cwiki.apache.org/confluence/display/FLINK/FLIP-408%3A+%5BUmbrella%5D+Introduce+DataStream+API+V2

Reply via email to