Hi Asad, First, you need to understand the factors impacting cluster capacity. Some of the important factors to be considered while doing capacity planning of Cassandra are: 1. Compaction strategy: It impacts disk space requirements and IO/CPU/memory overhead for compactions. 2. Replication Factor: Impacts disk space. 3. Business SLAs and Data Access patterns (read/write) 4. Type of storage: SSD will ensure that IO is rarely a problem. You may become CPU bound first. Some trigger points for expanding your cluster: 1. Disk crunch. Unable to meet free disk requirements for various compaction strategies. 2. Overloaded nodes: tpstats/logs show frequent dropped mutations. Consistently high CPU load. 3. Business SLAs not being met due to increase in reads/writes per second. Please note that this is not an exhaustive list. ThanksAnuj
Sent from Yahoo Mail on Android On Thu, Jun 29, 2017 at 7:15 PM, ZAIDI, ASAD A<az1...@att.com> wrote: <!--#yiv0984108574 _filtered #yiv0984108574 {font-family:Helvetica;panose-1:2 11 6 4 2 2 2 2 2 4;} _filtered #yiv0984108574 {font-family:"Cambria Math";panose-1:2 4 5 3 5 4 6 3 2 4;} _filtered #yiv0984108574 {font-family:Calibri;panose-1:2 15 5 2 2 2 4 3 2 4;}#yiv0984108574 #yiv0984108574 p.yiv0984108574MsoNormal, #yiv0984108574 li.yiv0984108574MsoNormal, #yiv0984108574 div.yiv0984108574MsoNormal {margin:0in;margin-bottom:.0001pt;font-size:12.0pt;font-family:"Times New Roman", serif;}#yiv0984108574 a:link, #yiv0984108574 span.yiv0984108574MsoHyperlink {color:blue;text-decoration:underline;}#yiv0984108574 a:visited, #yiv0984108574 span.yiv0984108574MsoHyperlinkFollowed {color:purple;text-decoration:underline;}#yiv0984108574 span.yiv0984108574EmailStyle17 {font-family:"Calibri", sans-serif;color:#1F497D;}#yiv0984108574 .yiv0984108574MsoChpDefault {font-family:"Calibri", sans-serif;} _filtered #yiv0984108574 {margin:1.0in 1.0in 1.0in 1.0in;}#yiv0984108574 div.yiv0984108574WordSection1 {}--> Hello Folks, I’m on Cassandra 2.2.8 cluster with 14 nodes , each with around 2TB of data volume. I’m looking for a criteria /or data points that can help me decide when or if I should add more nodes to the cluster and by how many nodes. I’ll really appreciate if you guys can share your insights. Thanks/Asad