Hi all, In Opscenter I see one of the nodes is orange. It seems like it's working on compaction. I used nodetool compactionstats and whenever I did this the Completed nad percentage stays the same (even with hours in between). I currently don't see cpu load from cassandra on that node. So it seems stuck (somewhere mid 60%). Also some other nodes have compaction on the same columnfamily. I don't see any progress.
WARN [RMI TCP Connection(554)-192.168.0.68] 2015-11-09 17:18:13,677 ColumnFamilyStore.java (line 2101) Unable to cancel in-progress compactions for usage_record_ptd. Probably there is an unusually large row in progress somewhere. It is also possible that buggy code left some sstables compacting after it was done with them - How can I assure that nothing is happening? - Is it recommended to disable compaction from a certain data size? (I believe 25GB on each node). - Can I stop this compaction? nodetool stop compaction doesn't seem to work. - Is stopping the compaction dangerous? - Is killing the cassandra process dangerous while compacting(I did nodetool drain on one node)? This is output of nodetool compactionstats grepped for the keyspace that seems stuck. 4e48f940-86c6-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447062197972 52321301 16743606 {1:2, 4:248} 94acec50-86c8-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447063175061 48992375 13420862 {3:3, 4:245} 3210c9b0-8707-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447090067915 52763216 17732003 {1:2, 4:248} 24f96fe0-86ce-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447065564638 44909171 17029440 {1:2, 3:39, 4:209} 06d58370-86ef-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447079687463 53570365 17873962 {1:2, 3:2, 4:246} f7aa5fa0-86c7-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447062911642 47701016 13291915 {3:2, 4:246} 806a4380-86f7-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447083327416 52644411 17363023 {1:2, 2:1, 4:247} c845b900-86c5-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447061973136 48944530 16698191 {1:2, 3:6, 4:242} bb44a0b0-8718-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447097599547 48768463 13518523 {2:2, 3:5, 4:242} f2c17ea0-86c3-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447061185418 90367799 13904914 {5:4, 6:7, 7:52, 8:185} 1aae6590-86ce-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447065547369 53190698 17228121 {1:2, 4:248} d7ca8d00-86d5-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447068871120 52422499 16995963 {1:2, 3:3, 4:245} 6e890290-86df-11e5-96be-dd3c9e46ec74 mykeyspace mycolumnfamily 1447072989497 45218168 17174468 {1:2, 3:21, 4:227} I also see frequently lines like this in system.log: WARN [Native-Transport-Requests:11935] 2015-11-09 20:10:41,886 BatchStatement.java (line 223) Batch of prepared statements for [billing.usage_record_by_billing_period, billing.metric] is of size 53086, exceeding specified threshold of 5120 by 47966. Any other remarks? Thanks a lot in advance!