Re: Data modelling questions

2015-02-24 Thread Alexander Sicular
Seconded. Deterministic materialized keys at specific time granularities are definitely the way to go. If your frequency is high enough you could r/w data at second or ms resolution directly into memory and then roll those up into higher time resolutions on disk. The value, as noted, could be js

Re: Data modelling questions

2015-02-24 Thread AM
On 2/23/15 1:33 PM, Jason Campbell wrote: Thanks for the info. The model looks reasonable, but something I would worry about is the availability of the key data. For example, the timestamps and msg-ids should be known without key-listing Riak (which is always a very slow operation). There i

Re: Data modelling questions

2015-02-23 Thread Jason Campbell
Thanks for the info. The model looks reasonable, but something I would worry about is the availability of the key data. For example, the timestamps and msg-ids should be known without key-listing Riak (which is always a very slow operation). There is several options for this, you can either m

Re: Data modelling questions

2015-02-23 Thread AM
On 2/22/15 6:16 PM, Jason Campbell wrote: Coming at this from another angle, if you already have a permanent data store, and you are only reporting on each hour at a time, can you run the reports based on the log itself? A lot of Riak’s advantage comes from the stability and availability of dat

Re: Data modelling questions

2015-02-22 Thread Jason Campbell
Coming at this from another angle, if you already have a permanent data store, and you are only reporting on each hour at a time, can you run the reports based on the log itself? A lot of Riak’s advantage comes from the stability and availability of data storage, but S3 is already doing that fo

Re: Data modelling questions

2015-02-22 Thread AM
Hi Jason, Christopher. This is supposed to be an append-only time-limited data. I only intend to save about 2 weeks worth of data (which is yet another thing I need to figure out, ie how to vacate older data). Re: querying, for the most part the system will be building out hourly reports bas

Re: Data modelling questions

2015-02-21 Thread Jason Campbell
I have the same questions as Christopher. Does this data need to change, or is it write-once? What information do you have when querying? - Will you already have timestamp and msg-id? - If not, you may want to consider aggregating everything into a single key. This is easier of the data isn’t

Re: Data modelling questions

2015-02-21 Thread Christopher Meiklejohn
> On Feb 20, 2015, at 5:35 PM, AM wrote: > > Hi All. > > I am currently looking at using Riak as a data store for time series data. > Currently we get about 1.5T of data in JSON format that I intend to persist > in Riak. I am having some difficulty figuring out how to model it such that I >