Slowdowns during repair

Aurynn Shaw Wed, 15 Jun 2011 15:21:55 -0700

Hey all;

So, we have Cassandra running on a 5-server ring, with a RF of 3, andwe're regularly seeing major slowdowns in read & write performance whilerunning nodetool repair, as well as the occasional Cassandra crashduring the repair window - slowdowns past 10 seconds to perform a singlewrite.

The repair cycle runs nightly on a different server, so each server hasit run once a week.


We're running 0.7.0 currently, and we'll be upgrading to 0.7.6 shortly.

System load on the Cassandra servers is never more than 10% CPU andutterly minimal IO usage, so I wouldn't think we'd be seeing issuesquite like this.

What sort of knobs should I be looking at tuning to reduce the impactthat nodetool repair has on Cassandra? What questions should I be askingas to why Cassandra slows down to the level that it does, and what Ishould be optimizing?

Additionally, what should I be looking for in the logs when this ishappening? There's a lot in the logs, but I'm not sure what to look for.

Cassadra is, in this instance, backing a system that supports around amillion requests a day, so not terribly heavy traffic.


Thanks,

Aurynn

Slowdowns during repair

Reply via email to