Re: Repair taking a long, long time

Edward Capriolo Tue, 19 Jul 2011 06:41:53 -0700

On Tue, Jul 19, 2011 at 7:24 AM, Maxim Potekhin <potek...@bnl.gov> wrote:


> We have something of the order of 200GB load on each of 3 machines in a
> balanced cluster under 0.8.1.
> I started repair about 24hrs ago and did some moderate amount of inserts
> since then (a small fraction of
> data load). The repair still appears to be running. What could go wrong?
>
> Thanks,
> Maxim
>
>

Repair calculates a merkle tree on each node and then transmits differences
to its neighbours.  Because this process run on all your data it can take a
LONG time.
See http://wiki.apache.org/cassandra/LargeDataSetConsiderations

Use 'nodetool compactionstats' and 'nodetool streams' to check the progress.
My rule of thumb is do not have more storage on a node then you can compact
in 3 hours. So for that you either need fast disks or more nodes.

Re: Repair taking a long, long time

Reply via email to