Re: Left join with unbalanced dataset

2016-01-31 Thread Stephan Ewen
Hi! YARN killing the application seems strange. The memory use that YARN sees should not change even when one node gets a lot or data. Can you share what version of Flink (plus commit hash) you are using and whether you use off-heap memory or not? Thanks, Stephan On Sun, Jan 31, 2016 at 10:47

Re: Left join with unbalanced dataset

2016-01-31 Thread Till Rohrmann
Hi Arnaud, the unmatched elements of A will only end up on the same worker node if they all share the same key. Otherwise, they will be evenly spread out across your cluster. However, I would also recommend you to use Flink's leftOuterJoin. Cheers, Till On Sun, Jan 31, 2016 at 5:27 AM, Chiwan Pa