Re: Set Parallelism and keyBy

2017-01-02 Thread Jamie Grier
Domink, This should work just as you expect. Maybe the output of the print is just misleading you. The print() operation will still have a parallelism of two but the flatMap() with have a parallelism of 16 and all data elements with the same key will get routed to the same host. Any sequence of

Set Parallelism and keyBy

2016-12-26 Thread Dominik Bruhn
Hey, I have a flink job which has a default parallelism set to 2. I want to key the stream and then apply some flatMap on the keyed stream. The flatMap operation is quiet costly, so I want to have a much higher parallelism here (lets say 16). Additionally, it is important that the flatMap oper