Hello, We are currently storing geolocation events (about 1 per 5 minutes) for each device we track. We currently have 2 TB of data. I would like to store the device_id, the timestamp of the event, latitude and longitude. I though about using the device_id as the partition key and timestamp as the clustering column. It is great as events are naturally grouped by device (very useful for our Spark jobs). However, if I would like to retrieve all events of all devices of the last week I understood that Cassandra will need to load all data and filter which does not seems to be clean on the long term.
How should I create my model? Best Regards