A few quick questions to help me design a better schema..

asil klin Sun, 09 Jan 2011 04:05:05 -0800

1. ) If certain columns in a row get mutated too frequently or if new
columns are added to the row frequently then does the reads of old columns
that rarely get changed is also affected ? In other words, is the
performance of reads of almost infrequently changing columns in a row where
some columns are frequently updated/inserted, affected in any manner ?


2. ) Are all columns inside a super column family, supercolumns or can they
may be simple columns+supercolumns  as well ?

3. ) When row cache is enabled and certain  columns of a row are read then
will the entire row be put into the cache or just those read columns are put
into cache?

4. ) Does the larger no of column families has any impact on the
performance(I read about it somewhere)? Should information for a particular
row key be split in multiple column families according to the specific query
demands or should all data related to a particular row key be kept together
in a single column family ?

5. ) Are there any limitation of valueless column to consider. I read in a
ppt   "Only works with <= 2B columns in 0.7 valueless colum". I could
understand the meaning of this statement.

Thanks
Asil

A few quick questions to help me design a better schema..

Reply via email to