Re: data schema for hourly runnning analytics

2013-09-26 Thread Aaron Morton
> CREATE TABLE requests ( > request_id UUID PRIMARY KEY, > partition_number INT, > payload ASCII > ); > > CREATE INDEX ON requests(partition_number); If reading all the request in an hour is something you do frequently than I strongly recommend modelling that with another table. e

data schema for hourly runnning analytics

2013-09-26 Thread Renat Gilfanov
Hello, We have a column family which stores incoming requests, and we would like to perform some analytics  on that data using Hadoop. The analytic results should be available pretty soon, not realtime, but within an hour or so. So we store the current hour number (calculated from timestamp) a