Re: Integrate Flink with S3 on EMR cluster

2016-04-10 Thread Stephan Ewen
You can always explicitly request a broadcast join, via "joinWithTiny", "joinWithHuge", or by supplying a JoinHint. Greetings, Stephan On Sat, Apr 9, 2016 at 1:56 AM, Timur Fayruzov wrote: > Thank you Robert. One of my test cases is broadcast join, so I need to > make statistics work. The only

Re: Does Kafka connector leverage Kafka message keys?

2016-04-10 Thread Stephan Ewen
Hi! You are right with your observations. Right now, you would have to create a "Tuple2" in the KeyedDeserializationSchema. That is what also a KeyedStream holds internally. A KeyedStream in Flink is more than just a stream that has a Key and a Value - it is also partitioned by the key, and Flink