I don't think that the mapping is that sophisticated.
I'd assume it is a bit simpler and just keeps one local pipeline (the one
with the same subtask index) which will run in the same slot (unless
explicitly configured differently).

TBH, I would not rely on this behavior. rescale() is rather an artifact of
the first version of the DataStream API.

Best, Fabian

2018-02-20 11:00 GMT+01:00 m@xi <makisnt...@gmail.com>:

> Hey Fabian!
>
> Thanks for the comprehensive replies. Now I understand those concepts
> properly.
>
> Regarding .rescale() , it does not receive any arguments. Thus, I assume
> that the way it does the shuffling from operator A to operator B instances
> is a black box for the programmer and probably has to do with the number of
> slots in each taskmanager. It strives to favour local data exchange (aka
> *intra-exchange* : between slot of the same taskmanager) instead of
> *inter-exchange* of data between different taskmanagers (that burdens the
> network).
>
> Am I correct?
>
> Thanks in advance.
>
> Best,
> Max
>
>
>
> --
> Sent from: http://apache-flink-user-mailing-list-archive.2336050.
> n4.nabble.com/
>

Reply via email to