Hi,

I have the following situation:
- I have click stream data from a website that contains a session id.
- I then use something like EventTimeSessionWindows to group these events
per session into a WindowedStream

So I essentially end up with a stream of "finished sessions"

So far I am able to do this fine.

I then want to put these "finished sessions" in (parquet) files where I
want to have files with the sessions that ENDED (or the timeout of the gap
occurred) in a similar timeframe.

So these files should be created every 5 minutes (or so) and contain all
events of the sessions that ended/timedout in the specified 5 minutes.

What I ran into is that the WindowsStream doesn't accept a sink so simply
creating a BucketingSink and write the data doesn't work.

Can anyone please give me some pointers on how to do this correctly?

Thanks.

-- 
Best regards / Met vriendelijke groeten,

Niels Basjes

Reply via email to