Hi Chaim,

Inline

> On Jul 6, 2014, at 1:13 AM, Chaim Solomon <ch...@itcentralstation.com> wrote:
> 
> I don't think I was quite clear in what I asked for.

My apologies for misunderstanding your previous query.

> 
> I am not asking for the ability to influence the hashing algorithm. That 
> would be a mess.
> But I would like to be able to have more influence on the distribution of 
> vnodes on the nodes - and that is something that RIAK already does.
> 
> So a command to bump a vnode off a particular node or reduce the number of 
> vnodes on a node or set the target percentage on a node would be nice. It 
> seems like the current algorithm already does something similar - but I 
> didn't see how one can influence that.
> 
> The other issue was that I would suggest taking the disk space into 
> consideration.
> If you have nodes that have different storage then balancing the data equally 
> between nodes may not be the best option. 
> It may be better to take the available disk space into consideration and move 
> vnodes to nodes that have free space if a node runs low on space.

What you refer to here would be nice, and is something referred to as "weighted 
claim." I know it's been discussed a bit in the past. Perhaps someone from 
Basho can chime in and let us know if it's on the roadmap for a future release?

> 
> One simple use case would be expanding a cluster with newer nodes (that have 
> more storage) and being able to utilise that storage. 
> 
> Another would be to be able to distribute larger partitions more evenly - in 
> particular if the size per partition is not evenly distributed.
> 
> Chaim Solomon
> 
> 
> 
>> On Thu, Jul 3, 2014 at 8:51 PM, Tom Santero <tsant...@gmail.com> wrote:
>> responses inline
>> 
>> 
>>> On Thu, Jul 3, 2014 at 2:45 AM, Chaim Solomon <ch...@itcentralstation.com> 
>>> wrote:
>>> Hi,
>>> 
>>> I'm running a 2.0.0b cluster (small) and have been running out of space on 
>>> one node. 
>>> I had expected that adding a node would lead to freeing up of space on 
>>> other nodes - but it's not working too fast.
>> 
>> Keep in mind that the speed of transfers is bound by the bandwidth available 
>> on the network as well as the speed at which you can actually read the data 
>> off disk. Once the transfers complete you should see the disk freed. 
>>  
>>> 
>>> I would suggest to add to RIAK a way to have the distribution algorithm 
>>> take free space into consideration and to move data to empty nodes fast. 
>>> Another issue is that adding the node moved most nodes from 25% to 18.8% - 
>>> but one stayed on 25% in the planner.
>> 
>> The algorithm Riak uses to determine vnode placement is non-deterministic; 
>> if you don't like any given staged vnode distribution I might suggest you 
>> run riak-admin cluster clear to undo any staged changed and attempt to add 
>> the node again, until you're content with the new plan. 
>>  
>>> 
>>> And I would also suggest adding some way to force a rebalancing of the 
>>> cluster to force nodes to take up more load if they don't have enough or 
>>> hand off load to others.
>> 
>> The hashing algorithm used by Riak to determine object placement in the ring 
>> is uniform--over time and with a greater number of total keys you'll start 
>> to see a smoother distribution across all partitions. 
>> 
>> On the fly rebalancing would be incredibly expensive, especially for users 
>> who have lots of nodes and petabytes of data stored in Riak. Ad-hoc 
>> partition handoff would most likely be brittle and error-prone, given the 
>> unreliability of the network.
>> 
>> In my humble opinion the engineers at Basho work harder than most other 
>> distributed systems developers, considering all the edge cases where systems 
>> can fail unexpectedly; I say this not to boost their egos, but rather to 
>> point out that their approach has the effect of making Riak more robust and 
>> resilient than most other distributed datastores. But such resiliency isn't 
>> free, and for these guarantees every user must pay the price. Riak might not 
>> be the fastest database, and it may even underutilize that really expensive 
>> hardware you might throw at it...but i'll be damned if it doesn't lie to me, 
>> lose my data or pretend that failures like network partitions don't happen. 
>>  
>>> 
>>> Chaim Solomon
>>> 
>>> 
>>> _______________________________________________
>>> riak-users mailing list
>>> riak-users@lists.basho.com
>>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> 
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to