On 27.04.20 09:34, David Morávek wrote:
When we include `flatMap` in between rebalances -> `.rebalance().flatMap(...).rebalance()`, we need to reshuffle again, because dataset distribution may have changed (eg. you can possibli emit unbouded stream from a single element). Unfortunatelly `flatMap` output is still incorrectly marked as `FORCED_REBALANCED` and the second reshuffle gets ignored.
This indeed seems incorrect. Did you look into the Flink code to see why the output of the flatMap is `FORCED_REBALANCED`?
Aljoscha