Can anyone speak to best practices for running manual compaction in production? Our assumption is that without it the sstables will become too fragmented...is this an accepted "fact"? Obviously it depends on the volume of writes, but I am looking for current production practices.
Since it takes a lot of resources and 4-5 hours for our current node size of 500Gb weekly seems like a sensible option for us. Is this a normal practice? Is it best to run on all nodes at the same time or staggered across nodes to reduce total cluster slow-down? Given that full compaction has a major affect on a node and its ability to function under heavy load our assumption is that staggered over the weekend for example (our low usage time) would be best. Any recommendations? Thanks Wayne