I don't think that the mapping is that sophisticated. I'd assume it is a bit simpler and just keeps one local pipeline (the one with the same subtask index) which will run in the same slot (unless explicitly configured differently).
TBH, I would not rely on this behavior. rescale() is rather an artifact of the first version of the DataStream API. Best, Fabian 2018-02-20 11:00 GMT+01:00 m@xi <makisnt...@gmail.com>: > Hey Fabian! > > Thanks for the comprehensive replies. Now I understand those concepts > properly. > > Regarding .rescale() , it does not receive any arguments. Thus, I assume > that the way it does the shuffling from operator A to operator B instances > is a black box for the programmer and probably has to do with the number of > slots in each taskmanager. It strives to favour local data exchange (aka > *intra-exchange* : between slot of the same taskmanager) instead of > *inter-exchange* of data between different taskmanagers (that burdens the > network). > > Am I correct? > > Thanks in advance. > > Best, > Max > > > > -- > Sent from: http://apache-flink-user-mailing-list-archive.2336050. > n4.nabble.com/ >