Propose new ConsistencyLevel.ALL_AVAIL for reads

AJ Thu, 16 Jun 2011 08:19:50 -0700

Good morning all.

Hypothetical Setup:
1 data center
RF = 3
Total nodes > 3


Problem:

Suppose I need maximum consistency for one critical operation; thus Ispecify CL = ALL for reads. However, this will fail if only 1 replicaendpoint is down. I don't see why this fail is necessary all of thetime since the data could have been updated since the node becameunavailable and it's data is old anyways. If only one node goes downand it has the key I need, then the app is not 100% available and itcould take some time making the node available again.


Proposal:

If all of the *available* replica nodes answer the read operation andthe latest value timestamp is clearly AFTER the time the down nodebecame unavailable, then this situation can meet the requirements for*near* 100% consistency since the value in the down node would beoutdated anyway. Clearly, the value was updated some time *after* thenode went down or unavailable. This way, you can have max availabilitywhen using read with CL.ALL... or something CL close in meaning to ALL.

I say "near" 100% consistency to leave room for some situation where theunavailable node was only unavailable to the coordinating node for somereason such as a network issue and thus still received an update by someother route after it "appeared" unavailable to the current coordinatingnode. In a situation like this, there is a chance the read will stillnot return the latest value. So, this will not be truly 100% consistentwhich CL.ALL guarantees. However, I think this logic could justify anew consistency level slightly lower than ALL, such as ALL_AVAIL.

What do you think? Is my logic correct? Is there a conflict with thearchitecture or base principles? This fits with the tunable consistencyprinciple for sure.


Thanks for listening

Propose new ConsistencyLevel.ALL_AVAIL for reads

Reply via email to