If I understand –pr correctly…

 1.  -pr forces only the current nodes' stables to be fixed (so I run on each 
node once)
 2.  Can I run node tool –pr repair on just 1/RF of my nodes if I do the 
correct nodes?
 3.  Without the –pr, it will fix all the stuff on the current node AND the 
nodes with replicas?
 4.  But is it true replicas may have other data that still needs repair as 
well so I would still need to go to those nodes and run ./nodetool –pr repair?
 5.  I ran nodetool –pr on EVERY node, but I still see rows that were 
deleted…well, I see their keys only so the columns appear to be deleted 
correctly.  Why are the row keys still there though? (the cassandra-cli and 
cqlsh both have the same results).
 6.  The below is very confusing where it says "only first range returned by 
partitioner for a node is repaired".  This makes it sound like only a range of 
keys is repaired on the current node and the other rows on the current node are 
not touched.  How to correctly read that sentence???  Would it be more accurate 
to say "All the rows that this node is responsible for are repaired" vs. not 
using –pr means whole cluster is repaired or is that incorrect?

Begins an anti-entropy node repair operation. If the -pr option is specified, 
only the first range returned by the partitioner for a node is repaired. This 
allows you to repair each node in the cluster in succession without duplicating 
work.

Without -pr, all replica ranges that the node is responsible for are repaired.

Optionally takes a list of column family names.


Reply via email to