Re: Re: eliminate need to repair by using column TTL??

jonathan . colby Fri, 22 Jul 2011 03:11:43 -0700

good points Aaron. I realize now how expensive repair on reads are. I'mgoing to keep doing repairs regularly but still have a max TTL on allcolumns to make sure we don't have really old data we no longer needgetting buried in the cluster.


On , aaron morton <aa...@thelastpickle.com> wrote:

Read repair will only repair data that is read on the nodes that are upat that time, and does not guarantee that any changes it detects will bewritten back to the nodes. The diff mutations are async fire and forgetmessages which may go missing or be dropped or ignored by the recipientjust like any other message.

Also getting hit with a bunch of read repair operations is prettypainful. The normal read runs, the coordinator detects the digestmis-match, the read runs again from all nodes and they all have to returntheir full data (no digests this time), the coordinator detects thediffs, mutations are sent back to each node that needs them. All thishappens sync to the read request when the CL > ONE. Thats 2 reads withmore network IO and up to RF mutations .

The delete thing is important but repair also reduces the chance of readsgetting hit with RR and gives me confidence when it's necessary to nuke abad node.

Your plan may work but it feels risky to me. You may end up with worseread performance and unpleasent emotions if you ever have to nuke a node.Others may disagree.

Not ignoring the fact the repair can take a long time, fail, hurtperformance etc. There are plans to improve it though.

Cheers

-----------------

Aaron Morton

Freelance Cassandra Developer

@aaronmorton

http://www.thelastpickle.com

On 22 Jul 2011, at 19:55, jonathan.co...@gmail.com wrote:

> One of the main reasons for regularly running repair is to make suredeletes are propagated in the cluster, ie, data is not resurrected if anode never received the delete call.

> And repair-on-read takes care of repairing inconsistencies "on-the-fly".

> So if I were to set a universal TTL on all columns - so everythingwould only live for a certain age, would I be able to get away withouthaving to do regular repairs with nodetool?

> I realize this scenario would not be applicable for everyone, but ourdata model would allow us to do this.

> So could this be an alternative to running the (resource-intensive,long-running) repairs with nodetool?

> Thanks.

Re: Re: eliminate need to repair by using column TTL??

Reply via email to