On Wed, Feb 13, 2019 at 6:47 PM Jeff Jirsa <jji...@gmail.com> wrote: > Depending on how bad data resurrection is, you should run it for any host > that loses a range. In vnodes, that's usually all hosts. > > Cleanup with LCS is very cheap. Cleanup with STCS/TWCS is a bit more work. >
Wait, doesn't cleanup just rewrite every SSTable one by one? Why would compaction strategy matter? Do you mean that after cleanup STCS may pick some resulting tables to re-compact them due to the min/max size difference, which would not be the case with LCS? > If you're just TTL'ing all data, it may not be worth the effort. > Indeed, but in our case the main reason to scale out is that the nodes are running out of disk space, so we really want to get rid of the extra copies. -- Alex