Re: Kafka topic with an empty parition : parallelism issue and Timestamp monotony violated

2016-08-16 Thread Robert Metzger
Hi Yassine, In Flink 1.2 we've added a new feature to the Kafka consumer, allowing you to extract timestamps and emitting watermarks per partition. The consumers now have the following method: public FlinkKafkaConsumerBase assignTimestampsAndWatermarks(AssignerWithPunctuatedWatermarks assigner)

Re: Kafka topic with an empty parition : parallelism issue and Timestamp monotony violated

2016-08-15 Thread Yassine Marzougui
I think I also figured out the reason of the behavior I described when one Kafka partition is empty. According to the JavaDocs

Re: Kafka topic with an empty parition : parallelism issue and Timestamp monotony violated

2016-08-15 Thread Yassine Marzougui
I think I figured out the explanation of the first part. Looks like the stream gets distributed and merged between the source and the map operator because their parallelisms are different, and therefore the messages resulting from the map operator become out of order. The "Timestamp monotony violat

Kafka topic with an empty parition : parallelism issue and Timestamp monotony violated

2016-08-15 Thread Yassine Marzougui
Hi all, I have a Kafka topic with two partitions, messages within each partition are ordered in ascending timestamps. The following code works correctly (I'm running this on my local machine, the default parallelism is the number of cores=8): stream = env.addSource(myFlinkKafkaConsumer09) stream