On Jun 4, 2010, at 5:19 PM, Ben Browning wrote:

> How many subcolumns are in each supercolumn and how large are the
> values? Your example shows 8 subcolumns, but I didn't know if that was
> the actual number. I've been able to read columns out of Cassandra at
> an order of magnitude higher than what you're seeing here but there
> are too many variables to directly compare.

There are very few columns for each SC. About 8, but it varies a bit. The 
column names and values are pretty small. around 20-30 bytes for each column, I 
guess. So, we are talking small amounts of data here.

Yes, I know there are too many variables, but I have the feeling - as you also 
write - that the performance of this simple thing should be orders of magnitude 
better. 

So, how might I go about trying to find out why this takes so long time in my 
specific setup? Can I get timings of stuff inside cassandra itself?

> Keep in mind that the results from each thrift call has to fit into
> memory - you might be better off paging through the 23000 columns,
> reading a few thousand at a time.

Yes, I know. And I might end up doing this in the end. I do though have pretty 
hard upper limits of how many rows I will end up with for each key, but anyways 
it might be a good idea none the less. Thanks for the advice on that one.

Per

Reply via email to