1. ) If certain columns in a row get mutated too frequently or if new columns are added to the row frequently then does the reads of old columns that rarely get changed is also affected ? In other words, is the performance of reads of almost infrequently changing columns in a row where some columns are frequently updated/inserted, affected in any manner ?
2. ) Are all columns inside a super column family, supercolumns or can they may be simple columns+supercolumns as well ? 3. ) When row cache is enabled and certain columns of a row are read then will the entire row be put into the cache or just those read columns are put into cache? 4. ) Does the larger no of column families has any impact on the performance(I read about it somewhere)? Should information for a particular row key be split in multiple column families according to the specific query demands or should all data related to a particular row key be kept together in a single column family ? 5. ) Are there any limitation of valueless column to consider. I read in a ppt "Only works with <= 2B columns in 0.7 valueless colum". I could understand the meaning of this statement. Thanks Asil