Hello,
I have used Flink to stream data and do analytics on the stream, using time
windows...
Now, this is assuming the data is effectively coming in real time. However I
have a use case where the data is 'batched' upstream, and comes in bursts, but
has a timestamp.It obviously messes up the windowed stream assumption. (note it
is a problem with queuing in Kafka for example when there is any kind of
downtime downstream of Kafka: if data accumulates and then is consumed, it is
consumed at higher 'speed' than real clock time and statistics do not match
reality.)
So my question is:
Is it possible to use a window stream based on a timestamp key for time, as
opposed to clock time?
How would one do this with the current API?
ThanksEmmanuel