Repair does not fix inconsistency

Michal Michalski Wed, 03 Apr 2013 04:56:19 -0700

Hi,

TL;DR: I have inconsistend data (1 live row on node A & 1 tombstoned rowon node B) that do not get fixed by repair. What can be a problem?


Long version:

I have a CF containing Users' info, which I sometimes query by key, andsometimes by indexed columns like email. I'm using RF=2. I write withCL.ONE, but this CF is very rarely updated, so C* has a looot of timeto fix inconsistencies that may occur, so I'm fine with this (at leastin theory ;-) ).


To be clear:

- I've run a successfull cluster-wide repair on this CF before testing,so I do not expect any inconsistency- All indexes are built, I've rebuilt them manually before testing, so Iexpect them to work properly (I mention it because it seems to besomehow related to indexes, but I'm not sure - see below)


The problem is:

When I query (cqlsh) some rows by key (CL is default = ONE) I _always_get a correct result. However, when I query it by indexed column, itreturns nothing.


When tracing a query with CL.ALL in cqlsh, I get info that C* has:

Read 0 live cells and 1 tombstoned       // for first replica node
Read 1 live cells and 0 tombstoned       // for second replica node

When CL is ONE it's never asking second replica for data (possibly dueto DynamicSnitch scores or so), so it returns nothing.

Switching to CL >= TWO obviously fixes this problem for us, but it's notthe solution I'd like to use as I'd rather rely on fast read/writerequests with CL.ONE + frequent repairs, allowing some short-terminconsistency.

Any ideas why it may happen that data are still inconsistent afterrepair? Is there something I could have missed?

I'm mainly surprised that repair does not fix this inconsistency in ANYway - either by pulling missing data to first replica _OR_ tombstoningit on second replica. First one would be correct (delete was made a longtime ago and then the row reappeared), but both could make sense, asboth will make the data consistent. In this state it's definitelyinconsistent and I don't understand it :-)

M.

Repair does not fix inconsistency

Reply via email to