Re: Getting less write throughput due to more number of columns

2013-03-28 Thread Ted Yu
Prefix compression would lower the cost of storing value in rowkey. It was inspired by long rowkey, short value schema design. PREFIX and FAST_DIFF encodings are most often used. Cheers On Thu, Mar 28, 2013 at 7:26 AM, Pankaj Gupta wrote: > Would prefix compression (https://issues.apache.org/j

Re: Getting less write throughput due to more number of columns

2013-03-28 Thread Pankaj Gupta
Would prefix compression (https://issues.apache.org/jira/browse/HBASE-4676) improve this? This is an important question in terms of schema design. Given the choice of storing a value in column vs rowkey, I would many times want to store a value in a rowkey if I foresee it being used for constr

RE: Getting less write throughput due to more number of columns

2013-03-25 Thread Anoop Sam John
When the number of columns (qualifiers) are more yes it can impact the performance. In HBase every where the storage will be in terms of KVs. The key will be some thing like rowkey+cfname+columnname+TS... So when u have 26 cells in a put then there will be repetition of many bytes in the key.(O