Rebalancing pain and 100 percentile stat

Elias Levy Tue, 10 Jan 2012 10:17:49 -0800

Good day,

Yesterday we went through the exercise of doubling the size of our initial
3 node cluster.  The rebalancing took four hours or so.  Each node now has
between 7 and 8 GB of data stored in the Leveldb backend with an n_val of
3.


During rebalancing the cluster became nearly useless. The CPUs where
largely pegged, and IO was hit hard, as could be expected.  Op throughput
dropped to the floor and services depending on Riak become unresponsive, as
they waited on Riak ops and then failed with timeouts.  This was in EC2
m1.large instances with ephemeral drives.

A few questions.

Is this expected behavior?  Granted, these are EC2 boxes and Leveldb
depends heavily on disk, but I can't imagine folks using Riak on production
his this type of performance hit resulting from rebalancing.

Are client ops prioritized higher than the rebalancing work?  If not, why
not?

I recall there was some mention that by using Riak EDS it was possible to
fill in a new node slowly, and then have it join the cluster when its
nearly in sync, thus limiting the stampede effect that regular rebalancing
seems to bring on.  Is that the case?  If so, is there any documentation on
this?


I am also starting to suspect the timing statistics that Riak reports.
 We've been charting them, and comparing them before and after the doubling
of the cluster size shows some odd results.  Nearly all stats in the older
nodes (mean, median, 95 percentile, 99 percentile, 100 percentile) are
about the same before and after the addition of new nodes.  This is the
case even though there are now double as many nodes to handle requests, and
each node has about half as much data as before.  The only one that did
change was the puts 100 percentile.  It increased during the rebalancing,
and then stayed at its higher value.  How is this possible?

In fact, I find the 100 percentile stat a bit incredible.  One of the boxes
is showing a 8,012 seconds 100 percentile stat (yes, I did divide by one
million to convert from microseconds).

Or are the time stats not rolling stats?  Are they computed since the start
of the node, meaning that the 100 percentile will never go down, only up?

Elias

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Rebalancing pain and 100 percentile stat

Reply via email to