Re: Watermarks with repartition

2016-02-26 Thread Aljoscha Krettek
Yes, that would be perfect. Thanks! -- Aljoscha > On 26 Feb 2016, at 20:53, Zach Cox wrote: > > Sure, want me to open a jira issue and then PR a new page into > https://github.com/apache/flink/tree/master/docs/internals, following these > instructions? http://flink.apache.org/contribute-docume

Re: Watermarks with repartition

2016-02-26 Thread Zach Cox
Sure, want me to open a jira issue and then PR a new page into https://github.com/apache/flink/tree/master/docs/internals, following these instructions? http://flink.apache.org/contribute-documentation.html -Zach On Fri, Feb 26, 2016 at 1:13 PM Aljoscha Krettek wrote: > Cool, that’s a nice wri

Re: Watermarks with repartition

2016-02-26 Thread Aljoscha Krettek
Cool, that’s a nice write up. Would you maybe be interested in integrating this as some sort of internal documentation in Flink? So that prospective contributors can get to know this stuff. Cheers, Aljoscha > On 26 Feb 2016, at 18:32, Zach Cox wrote: > > Thanks for the confirmation Aljoscha! I

Re: Watermarks with repartition

2016-02-26 Thread Zach Cox
Thanks for the confirmation Aljoscha! I wrote up results from my little experiment: https://github.com/zcox/flink-repartition-watermark-example -Zach On Fri, Feb 26, 2016 at 2:58 AM Aljoscha Krettek wrote: > Hi, > yes, your description is spot on! > > Cheers, > Aljoscha > > On 26 Feb 2016, at

Re: Watermarks with repartition

2016-02-26 Thread Aljoscha Krettek
Hi, yes, your description is spot on! Cheers, Aljoscha > On 26 Feb 2016, at 00:19, Zach Cox wrote: > > I think I found the information I was looking for: > > RecordWriter broadcasts each emitted watermark to all outgoing channels [1]. > > StreamInputProcessor tracks the max watermark received

Re: Watermarks with repartition

2016-02-25 Thread Zach Cox
I think I found the information I was looking for: RecordWriter broadcasts each emitted watermark to all outgoing channels [1]. StreamInputProcessor tracks the max watermark received on each incoming channel separately, and computes the task's watermark as the min of all incoming watermarks [2].

Watermarks with repartition

2016-02-25 Thread Zach Cox
Hi - how are watermarks passed along parallel tasks where there is a repartition? For example, say I have a simple streaming job computing hourly counts per key, something like this: val environment = StreamExecutionEnvironment.getExecutionEnvironment environment.setParallelism(2) environment.setS