Kairosdb (OpenTSDB clone forCassandra) is a TSDB that does this.
MAy be you could have a look at it ?
It has a daemon process that collects and groups data points into blobs
before writing to cassandra.
--
best,
Alain
reading your description it seems like each data point you get is a self
contained item, is that right? or are you recalculating some data for the
entire block with each new message?
On Sun, Mar 19, 2017 at 9:27 PM, Nate McCall wrote:
> I think you would be better served by using a streaming sys
I think you would be better served by using a streaming system like Apache
Flink (http://flink.apache.org) and checkpointing occasionally to
Cassandra.
This is a significant increase in complexity, but you are describing a
real-time streaming use case with the need for watermarking time windows
an
If its a sliding 30 min window you will need to implement it and have an
in-memory timestamp list but out of order messages will always be a
headache. If you are ok with a fixed 30 min window (each 30 min eg 5:00,
5:30, 6:00,..) then just add a time bucket into the partition key and you
are done. O