Hi Asad,
First, you need to understand the factors impacting cluster capacity. Some of 
the important factors to be considered while doing capacity planning of 
Cassandra are:
1.  Compaction strategy: It impacts disk space requirements and IO/CPU/memory 
overhead for compactions.
2. Replication Factor: Impacts disk space.
3. Business SLAs and Data Access patterns (read/write)
4. Type of storage: SSD will ensure that IO is rarely a problem. You may become 
CPU bound first.
Some trigger points for expanding your cluster:
1. Disk crunch. Unable to meet free disk requirements for various compaction 
strategies.
2. Overloaded nodes: tpstats/logs show  frequent dropped mutations. 
Consistently high CPU load.
3. Business SLAs not being met due to increase in reads/writes per second.
Please note that this is not an exhaustive list.
ThanksAnuj







Sent from Yahoo Mail on Android 
 
  On Thu, Jun 29, 2017 at 7:15 PM, ZAIDI, ASAD A<az1...@att.com> wrote:    
<!--#yiv0984108574 _filtered #yiv0984108574 {font-family:Helvetica;panose-1:2 
11 6 4 2 2 2 2 2 4;} _filtered #yiv0984108574 {font-family:"Cambria 
Math";panose-1:2 4 5 3 5 4 6 3 2 4;} _filtered #yiv0984108574 
{font-family:Calibri;panose-1:2 15 5 2 2 2 4 3 2 4;}#yiv0984108574 
#yiv0984108574 p.yiv0984108574MsoNormal, #yiv0984108574 
li.yiv0984108574MsoNormal, #yiv0984108574 div.yiv0984108574MsoNormal 
{margin:0in;margin-bottom:.0001pt;font-size:12.0pt;font-family:"Times New 
Roman", serif;}#yiv0984108574 a:link, #yiv0984108574 
span.yiv0984108574MsoHyperlink 
{color:blue;text-decoration:underline;}#yiv0984108574 a:visited, #yiv0984108574 
span.yiv0984108574MsoHyperlinkFollowed 
{color:purple;text-decoration:underline;}#yiv0984108574 
span.yiv0984108574EmailStyle17 {font-family:"Calibri", 
sans-serif;color:#1F497D;}#yiv0984108574 .yiv0984108574MsoChpDefault 
{font-family:"Calibri", sans-serif;} _filtered #yiv0984108574 {margin:1.0in 
1.0in 1.0in 1.0in;}#yiv0984108574 div.yiv0984108574WordSection1 {}-->
Hello Folks,
 
  
 
I’m on Cassandra 2.2.8 cluster with 14 nodes , each with around 2TB of data 
volume. I’m looking for a criteria /or data points that can help me decide when 
or  if I should add more nodes to the cluster and by how many nodes.
 
  
 
I’ll really appreciate if you guys can share your insights.
 
  
 
Thanks/Asad
 
  
 
  
 
  
 
  
   

Reply via email to