Hi,

Wanted to follow up as I did have similar case.

So this means it is ok for Beam to use Sliding window of 1 day with 1 sec period (with using different trigger than after watermark to avoid outputting data from every window) and there is no additional performance penalty (duplicating input messages for storage or cpu for resolving windows)? Interesting from both Flink and Dataflow perspective (both Python and Java).

I ended up implementing the logic with Beam state and timers (which is quite performant and readable) but also interested in other possibilities.

Best

Wiśniowski Piort

On 12.06.2024 21:50, Ruben Vargas wrote:
I imagined it but wasn't sure!

Thanks for the clarification!

On Wed, Jun 12, 2024 at 1:42 PM Robert Bradshaw via user
<[email protected]> wrote:
Beam implements Windowing itself (via state and timers) rather than
deferring to Flink's implementation.

On Wed, Jun 12, 2024 at 11:55 AM Ruben Vargas <[email protected]> wrote:
Hello guys

May be a silly question,

But in the Flink runner, the window implementation uses the Flink
windowing? Does that mean the runner will have performance issues like
Flink itself? see this:
https://issues.apache.org/jira/browse/FLINK-7001

I'm asking because I see the issue, it mentions different concepts
that Beam already handles at the API level. So my suspicion is that
the Beam model handles windowing a little differently from the pure
Flink app. But I'm not sure..


Regards.

Reply via email to