Hello,

We are currently storing geolocation events (about 1 per 5 minutes) for
each device we track. We currently have 2 TB of data. I would like to store
the device_id, the timestamp of the event, latitude and longitude. I though
about using the device_id as the partition key and timestamp as the
clustering column. It is great as events are naturally grouped by device
(very useful for our Spark jobs). However, if I would like to retrieve all
events of all devices of the last week I understood that Cassandra will
need to load all data and filter which does not seems to be clean on the
long term.

How should I create my model?

Best Regards

Reply via email to