Re: Calculate number of nodes required based on data

2011-09-07 Thread Hefeng Yuan
Adi, just to make sure my calculation is correct, the configured ops threshold is ~2m, we have 6 nodes, does that mean each node's threshold is around 300k? I do see the when flushing happens, ops is about 300k, with several 500k. Seems like the ops threshold is throttling us. On Sep 7, 2011, a

Re: Calculate number of nodes required based on data

2011-09-07 Thread Adi
On Wed, Sep 7, 2011 at 2:09 PM, Hefeng Yuan wrote: > We didn't change MemtableThroughputInMB/min/maxCompactionThreshold, they're > 499/4/32. > As for why we're flushing at ~9m, I guess it has to do with this: > http://thelastpickle.com/2011/05/04/How-are-Memtables-measured/ > The only parameter I

Re: Calculate number of nodes required based on data

2011-09-07 Thread Hefeng Yuan
We didn't change MemtableThroughputInMB/min/maxCompactionThreshold, they're 499/4/32. As for why we're flushing at ~9m, I guess it has to do with this: http://thelastpickle.com/2011/05/04/How-are-Memtables-measured/ The only parameter I tried to play with is the compaction_throughput_mb_per_sec,

Re: Calculate number of nodes required based on data

2011-09-07 Thread Adi
On Wed, Sep 7, 2011 at 1:09 PM, Hefeng Yuan wrote: > Adi, > > The reason we're attempting to add more nodes is trying to solve the > long/simultaneous compactions, i.e. the performance issue, not the storage > issue yet. > We have RF 5 and CL QUORUM for read and write, we have currently 6 nodes,

Re: Calculate number of nodes required based on data

2011-09-07 Thread Hefeng Yuan
Adi, The reason we're attempting to add more nodes is trying to solve the long/simultaneous compactions, i.e. the performance issue, not the storage issue yet. We have RF 5 and CL QUORUM for read and write, we have currently 6 nodes, and when 4 nodes doing compaction at the same period, we're s

Re: Calculate number of nodes required based on data

2011-09-07 Thread Adi
On Tue, Sep 6, 2011 at 3:53 PM, Hefeng Yuan wrote: > Hi, > > Is there any suggested way of calculating number of nodes needed based on > data? > We currently have 6 nodes (each has 8G memory) with RF5 (because we want to > be able to survive loss of 2 nodes). > The flush of memtable happens arou