Isn't this considered a kind of *bug* in Cassandra because as we know *cleanup* is a lengthy and unreliable operation, so relying on the *cleanup* means higher chances of data resurrection? Do you think we should discard the unowned token-ranges as part of the regular compaction itself? What are the pitfalls of doing this as part of compaction itself?
Jaydeep On Thu, May 4, 2023 at 7:25 PM guo Maxwell <cclive1...@gmail.com> wrote: > compact ion will just merge duplicate data and remove delete data in this > node .if you add or remove one node for the cluster, I think clean up is > needed. if clean up failed, I think we should come to see the reason. > > Runtian Liu <curly...@gmail.com> 于2023年5月5日周五 06:37写道: > >> Hi all, >> >> Is cleanup the sole method to remove data that does not belong to a >> specific node? In a cluster, where nodes are added or decommissioned from >> time to time, failure to run cleanup may lead to data resurrection issues, >> as deleted data may remain on the node that lost ownership of certain >> partitions. Or is it true that normal compactions can also handle data >> removal for nodes that no longer have ownership of certain data? >> >> Thanks, >> Runtian >> > > > -- > you are the apple of my eye ! >