Re: Timeseries aggregation with many IoT devices off of one Kafka topic.

2020-02-26 Thread Theo Diefenthal
Von: "Khachatryan Roman" An: "Avinash Tripathy" CC: "Theo Diefenthal" , "hemant singh" , "Marco Villalobos" , "user" Gesendet: Dienstag, 25. Februar 2020 19:08:16 Betreff: Re: Timeseries aggregation with many IoT devices off o

Re: Timeseries aggregation with many IoT devices off of one Kafka topic.

2020-02-25 Thread Khachatryan Roman
gt;> Theo >> >> >> >> >> Ursprüngliche Nachricht -------- >> Von: hemant singh >> Datum: Di., 25. Feb. 2020, 06:19 >> An: Marco Villalobos >> Cc: user@flink.apache.org >> Betreff: Re: Timeseries aggregation with many IoT devices

Re: Timeseries aggregation with many IoT devices off of one Kafka topic.

2020-02-25 Thread Avinash Tripathy
think that it is the best way to go for this usecase. > > Best regards > Theo > > > > > Ursprüngliche Nachricht > Von: hemant singh > Datum: Di., 25. Feb. 2020, 06:19 > An: Marco Villalobos > Cc: user@flink.apache.org > Betreff: Re: Timese

AW: Timeseries aggregation with many IoT devices off of one Kafka topic.

2020-02-24 Thread theo.diefent...@scoop-software.de
Hi, At last flink forward in Berlin I spoke with some persons about the same problem, where they had construction devices as IoT sensors which could even be offline for multiple days. They told me that the major problem for them was that the watermark in Flink is maintained per operator /subtask, e

Re: Timeseries aggregation with many IoT devices off of one Kafka topic.

2020-02-24 Thread hemant singh
Hello, I am also working on something similar. Below is the pipeline design I have, sharing may be it can be helpful. topic -> keyed stream on device-id -> window operation -> sink. You can PM me on further details. Thanks, Hemant On Tue, Feb 25, 2020 at 1:54 AM Marco Villalobos wrote: > I n

Timeseries aggregation with many IoT devices off of one Kafka topic.

2020-02-24 Thread Marco Villalobos
I need to collect timeseries data from thousands of IoT devices. Each device has name, value, and timestamp published to one Kafka topic. The event time timestamps are in order only relation with the individual device, but out of order with respect to other devices. Is there a way to aggregate