Hi Riak users, 

Before adding new nodes, the cluster only have five nodes. The member list are 
as below: 
10.21.136.66,10.21.136.71,10.21.136.76,10.21.136.81,10.21.136.86. 
We did not setup http proxy for the cluster, only one node of the cluster 
provide the http service. so the CPU load is always high on this node. 

After that, I added four nodes (10.21.136.[91-94]) to those cluster. During the 
ring/data balance progress, each node failed(riak stopped) because of disk 100% 
full. 
I used multi-disk path to "data_root" parameter in '/etc/riak/app.config'. Each 
disk is only 580MB size. 
As you know, bitcask storage engine did not support multi-disk path. After one 
of the disks is 100% full, it can not switch next idle disk. So the "riak" 
service is down. 

After that, I removed the new add four nodes at active nodes with "riak-admin 
cluster leave riak@'10.21.136.91' ". 
and then stop "riak" service on other active new nodes, reformat the above new 
nodes with LVM disk management (bind 6 disk with virtual disk group). 
Replace the "data-root" parameter with one folder, and then start "riak" 
service again. After that, the cluster began the data balance again. 
That's the whole story. 


Amao 

----- Original Message -----

From: "Dmitri Zagidulin" <dzagidu...@basho.com> 
To: "Changmao.Wang" <changmao.w...@datayes.com> 
Sent: Thursday, August 6, 2015 10:46:59 PM 
Subject: Re: why leaving riak cluster so slowly and how to accelerate the speed 

Hi Amao, 

Can you explain a bit more which steps you've taken, and what the problem is? 

Which nodes have been added, and which nodes are leaving the cluster? 

On Tue, Jul 28, 2015 at 11:03 PM, Changmao.Wang < changmao.w...@datayes.com > 
wrote: 


Hi Raik user group, 

I'm using riak and riak-cs 1.4.2. Last weekend, I added four nodes to cluster 
with 5 nodes. However, it's failed with one of disks 100% full. 
As you know bitcask storage engine can not support multifolders. 

After that, I restarted the "riak" and leave the cluster with the command 
"riak-admin cluster leave" and "riak-admin cluster plan", and the commit. 
However, riak is always doing KV balance after my submit leaving command. I 
guess that it's doing join cluster progress. 

Could you show us how to accelerate the leaving progress? I have tuned the 
"transfer-limit" parameters on 9 nodes. 

below is some commands output: 
riak-admin member-status 
================================= Membership ================================== 
Status Ring Pending Node 
------------------------------------------------------------------------------- 
leaving 6.3% 10.9% ' riak@10.21.136.91 ' 
leaving 9.4% 10.9% ' riak@10.21.136.92 ' 
leaving 6.3% 10.9% ' riak@10.21.136.93 ' 
leaving 6.3% 10.9% ' riak@10.21.136.94 ' 
valid 10.9% 10.9% ' riak@10.21.136.66 ' 
valid 12.5% 10.9% ' riak@10.21.136.71 ' 
valid 18.8% 10.9% ' riak@10.21.136.76 ' 
valid 18.8% 12.5% ' riak@10.21.136.81 ' 
valid 10.9% 10.9% ' riak@10.21.136.86 ' 

riak-admin transfer_limit 
=============================== Transfer Limit ================================ 
Limit Node 
------------------------------------------------------------------------------- 
200 ' riak@10.21.136.66 ' 
200 ' riak@10.21.136.71 ' 
100 ' riak@10.21.136.76 ' 
100 ' riak@10.21.136.81 ' 
200 ' riak@10.21.136.86 ' 
500 ' riak@10.21.136.91 ' 
500 ' riak@10.21.136.92 ' 
500 ' riak@10.21.136.93 ' 
500 ' riak@10.21.136.94 ' 

Any more details for your diagnosing the problem? 

Amao 

_______________________________________________ 
riak-users mailing list 
riak-users@lists.basho.com 
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com 





_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to