Re: Eventime window

Timo Walther Wed, 02 Aug 2017 07:30:44 -0700

The question is what defines your `10 seconds`. In event-time theincoming events determine when 10 seconds have passed. Your descriptionsounds like you want to have results after 10 secondswall-clock/processing-time. So either you use a processing-time windowor you implement a custom trigger that triggers both on event-time or ona timer that you have set after 10 s processing-time.


Timo



Am 02.08.17 um 16:20 schrieb Govindarajan Srinivasaraghavan:

Thanks Timo. The next message will arrive only after a minute or so.Is there a way to evict whatever is there in window buffer just after10 seconds irrespective of whether a new message arrives or not.
Thanks,
Govind
On Aug 2, 2017, at 6:56 AM, Timo Walther <twal...@apache.org<mailto:twal...@apache.org>> wrote:
Hi Govind,
if the window is not triggered, this usually indicates that yourtimestamp and watermark assignment is not correct. According to yourdescription, I don't think that you need a custom trigger/evictor.How often do events arrive from one device? There must be anotherevent from the same device that has a timestamp >10s in order totrigger the window evaluation.
Instead of using the Kafka timestamp, maybe you could also convertyour timestamps to UTC in the TimestampExtractor.
There are no official limitation. However, each window comes withsome overhead. So you should choose your memory/state backends andparallelism accordingly.
Hope that helps.

Timo


Am 02.08.17 um 06:54 schrieb Govindarajan Srinivasaraghavan:
Hi,
I have few questions regarding event time windowing. My scenario isdevices from various timezones will send messages with timestamp andI need to create a window per device for 10 seconds. The messageswill mostly arrive in order.
Here is my sample code to perform windowing and aggregating themessages after the window to further process it.
streamEnv.setStreamTimeCharacteristic(TimeCharacteristic.EventTime);
FlinkKafkaConsumer010 consumer = new FlinkKafkaConsumer010("STREAM1",
                    new DeserializationSchema(),
                    kafkaConsumerProperties);

DataStream<Message> msgStream = streamEnv
.addSource(consumer)
.assignTimestampsAndWatermarks(new TimestampExtractor(Time.of(100,TimeUnit.MILLISECONDS))); // TimestampExtractor implementsBoundedOutOfOrdernessTimestampExtractor
KeyedStream<Message, String> keyByStream = msgStream.keyBy(newCustomKeySelector());
WindowedStream<Message, String, TimeWindow> windowedStream =
keyByStream.window(TumblingEventTimeWindows.of(org.apache.flink.streaming.api.windowing.time.Time.seconds(10)));
SingleOutputStreamOperator<Message> aggregatedStream =windowedStream.apply(new AggregateEntries());
My questions are
- In the above code, data gets passed till the window function buteven after window time the data is not received in the applyfunction. Do I have to supply a custom evictor or trigger?
- Since the data is being received from multiple timezones and eachdevice will have some time difference, would it be ok to assign thetimestamp as that of received timestamp in the message at source(kafka). Will there be any issues with this?
- Are there any limitations on the number of time windows that canbe created at any given time? In my scenario if there are 1 milliondevices there will be 1 million tumbling windows.
Thanks,
Govind

Re: Eventime window

Reply via email to