Hi Averell, Actually, Performing a key partition inside the Source Function is the same as DataStream[Source].keyBy(cumstom partitioner), because keyBy is not a real operator, but a virtual node in a DAG, which does not correspond to a physical operator.
Thanks, vino. 2018-07-31 10:52 GMT+08:00 Averell <lvhu...@gmail.com>: > Hi Vino, > > I'm a little bit confused. > If I can do the partitioning from within the source function, using the > same > hash function on the key to identify the partition, would that be > sufficient > to avoid shuffling in the next byKey call? > > Thanks. > Averell > > > > -- > Sent from: http://apache-flink-user-mailing-list-archive.2336050. > n4.nabble.com/ >