RE: CQL3 Data Model Question

2013-05-08 Thread Adriano Paggi
ssage- From: Hiller, Dean [mailto:dean.hil...@nrel.gov] Sent: Martes, 07 de Mayo de 2013 05:52 p.m. To: user@cassandra.apache.org Subject: Re: CQL3 Data Model Question Playorm is not yet on CQL3 and cassandra doesn't work well with +10,000 CF's as we went down that path and cassandra can'

Re: CQL3 Data Model Question

2013-05-07 Thread Hiller, Dean
Playorm is not yet on CQL3 and cassandra doesn't work well with +10,000 CF's as we went down that path and cassandra can't cope, so we have one cassandra CF with 60,000 virtual CF's thanks to PlayOrm and a few other CF's. But yes, we bucket into hour or month or whatever depending on your rates an

Re: CQL3 Data Model Question

2013-05-07 Thread Keith Wright
So in that case I would create a different column family for each app id and then a "time bucket" key as the row key with perhaps an hour resolution? Something like this: create 123_table organic_events ( hour timestamp, event_id UUID, app_id INT, event_time TIMESTAMP, user_id INT,

Re: CQL3 Data Model Question

2013-05-07 Thread Hiller, Dean
We use PlayOrm to do 60,000 different streams which are all time series and use the virtual column families of PlayOrm so they are all in one column family. We then partition by time as well. I don't believe that we really have any hotspots from what I can tell. Dean From: Keith Wright mailt