Thanks for pointer. As a comment to your disk speed comment --
I had already installed SSDs on these nodes.
Maxim
On 7/19/2011 9:41 AM, Edward Capriolo wrote:
On Tue, Jul 19, 2011 at 7:24 AM, Maxim Potekhin <potek...@bnl.gov
<mailto:potek...@bnl.gov>> wrote:
We have something of the order of 200GB load on each of 3 machines
in a balanced cluster under 0.8.1.
I started repair about 24hrs ago and did some moderate amount of
inserts since then (a small fraction of
data load). The repair still appears to be running. What could go
wrong?
Thanks,
Maxim
Repair calculates a merkle tree on each node and then transmits
differences to its neighbours. Because this process run on all your
data it can take a LONG time.
See http://wiki.apache.org/cassandra/LargeDataSetConsiderations
Use 'nodetool compactionstats' and 'nodetool streams' to check the
progress. My rule of thumb is do not have more storage on a node then
you can compact in 3 hours. So for that you either need fast disks or
more nodes.