Looks like you are missing a window *function* that processes the window. >From [1] :
stream .keyBy(...) <- keyed versus non-keyed windows .window(...) <- required: "assigner" [.trigger(...)] <- optional: "trigger" (else default trigger) [.evictor(...)] <- optional: "evictor" (else no evictor) [.allowedLateness()] <- optional, else zero .reduce/fold/apply() <- required: "function" so for example: events.keyBy("carId") .window(EventTimeSessionWindows.withGap(Time.seconds(15))) .apply(...) .addSink(...) Nico [1] https://ci.apache.org/projects/flink/flink-docs-release-1.3/dev/windows.html On Friday, 30 June 2017 13:29:17 CEST Niels Basjes wrote: > Hi, > > I have the following situation: > - I have click stream data from a website that contains a session id. > - I then use something like EventTimeSessionWindows to group these events > per session into a WindowedStream > > So I essentially end up with a stream of "finished sessions" > > So far I am able to do this fine. > > I then want to put these "finished sessions" in (parquet) files where I > want to have files with the sessions that ENDED (or the timeout of the gap > occurred) in a similar timeframe. > > So these files should be created every 5 minutes (or so) and contain all > events of the sessions that ended/timedout in the specified 5 minutes. > > What I ran into is that the WindowsStream doesn't accept a sink so simply > creating a BucketingSink and write the data doesn't work. > > Can anyone please give me some pointers on how to do this correctly? > > Thanks.
signature.asc
Description: This is a digitally signed message part.