We have an application that has been designed to use potentially 100s of
keyspaces (one for each company).

One thing we are noticing is that nodetool repair across all of the
keyspaces seems to increase linearly based on the number of keyspaces. For
example, if we have a 6 node ec2 (m1.large) cluster across 3 Availability
Zones and create 20 keyspaces a nodetool repair -pr on one node takes 3
hours even with no data in any of the keyspaces. If I bump that up to 40
keyspaces it takes 6 hours.

Is this the behaviour you would expect?

Is there anything you can think of (short of redesigning the cluster to
limit keyspaces) to increase the performance of the nodetool repairs?

My obvious concern is that as this application grows and we get more
companies using our it we will eventually have too many keyspaces to
perform repairs on the cluster.

-- 
John Pyeatt
Singlewire Software, LLC
www.singlewire.com
------------------
608.661.1184
john.pye...@singlewire.com

Reply via email to