Re: End of Window Marker

2019-09-02 Thread Eduardo Winpenny Tejedor
gt; @Eduardo: Thanks for sharing your approach! Not sure if I understood it >> correctly, but I think that the approach does not guarantee that all results >> of a window are emitted before the end-of-window marker is written. >> Since the sink operator and the single-task-opera

Re: End of Window Marker

2019-09-02 Thread Padarn Wilson
sure if I understood it > correctly, but I think that the approach does not guarantee that all > results of a window are emitted before the end-of-window marker is written. > Since the sink operator and the single-task-operator are separate > operators, the output records might get stuck (or be b

Re: End of Window Marker

2019-09-02 Thread Fabian Hueske
results of a window are emitted before the end-of-window marker is written. Since the sink operator and the single-task-operator are separate operators, the output records might get stuck (or be bufffered) in one of the sink tasks and the single-task would still emit an end-of-window marker record

Re: End of Window Marker

2019-08-29 Thread Eduardo Winpenny Tejedor
Hi, I'll chip in with an approach I'm trying at the moment that seems to work, and I say seems because I'm only running this on a personal project. Personally, I don't have anything against end-of-message markers per partition, Padarn you seem to not prefer this option as it overloads the meaning

Re: End of Window Marker

2019-08-27 Thread Padarn Wilson
Hi again Fabian, Thanks for pointing this out to me. In my case there is no need for keyed writing - but I do wonder if having each kafka task write only to a single partition would significantly affect performance. Actually now that I think about it, the approach to just wait for the first recor

Re: End of Window Marker

2019-08-27 Thread Fabian Hueske
Hi Padarn, Yes, this is quite tricky. The "problem" with watermarks is that you need to consider how you write to Kafka. If your Kafka sink writes to keyed Kafka stream (each Kafka partition is written by multiple producers), you need to broadcast the watermarks to each partition, i.e., each parti

Re: End of Window Marker

2019-08-16 Thread Padarn Wilson
Hi Fabian, thanks for your input Exactly. Actually my first instinct was to see if it was possible to publish the watermarks somehow - my initial idea was to insert regular watermark messages into each partition of the stream, but exposing this seemed quite troublesome. > In that case, you could

Re: End of Window Marker

2019-08-16 Thread Fabian Hueske
Hi Padarn, What you describe is essentially publishing Flink's watermarks to an outside system. Flink processes time windows, by waiting for a watermark that's past the window end time. When it receives such a WM it processes and emits all ended windows and forwards the watermark. When a sink rece

End of Window Marker

2019-08-12 Thread Padarn Wilson
Hello Users, I have a question that is perhaps not best solved within Flink: It has to do with notifying a downstream application that a Flink window has completed. The (simplified) scenario is this: - We have a Flink job that consumes from Kafka, does some preprocessing, and then has a sliding w