If I understand –pr correctly… 1. -pr forces only the current nodes' stables to be fixed (so I run on each node once) 2. Can I run node tool –pr repair on just 1/RF of my nodes if I do the correct nodes? 3. Without the –pr, it will fix all the stuff on the current node AND the nodes with replicas? 4. But is it true replicas may have other data that still needs repair as well so I would still need to go to those nodes and run ./nodetool –pr repair? 5. I ran nodetool –pr on EVERY node, but I still see rows that were deleted…well, I see their keys only so the columns appear to be deleted correctly. Why are the row keys still there though? (the cassandra-cli and cqlsh both have the same results). 6. The below is very confusing where it says "only first range returned by partitioner for a node is repaired". This makes it sound like only a range of keys is repaired on the current node and the other rows on the current node are not touched. How to correctly read that sentence??? Would it be more accurate to say "All the rows that this node is responsible for are repaired" vs. not using –pr means whole cluster is repaired or is that incorrect?
Begins an anti-entropy node repair operation. If the -pr option is specified, only the first range returned by the partitioner for a node is repaired. This allows you to repair each node in the cluster in succession without duplicating work. Without -pr, all replica ranges that the node is responsible for are repaired. Optionally takes a list of column family names.