Hi Dave,

On Thu, Aug 16, 2012 at 9:41 AM, Dave Brady <dbr...@weborama.com> wrote:
> Thanks, Mark!
>
> I have a somewhat-related follow up question: is there a recommended maximum 
> for the amount of data held on a machine?
>
> I ask because at my previous company Cassandra was used, and we were advised 
> to put no more 250 GB per physical box.  The reasoning was that in a failure 
> situation, rebuilding any more than this amount of data would cause too great 
> a performance degradation.
>

So there's no hard and fast amount of data per node that you shouldn't
be passing when using Riak. Rather, it makes more sense to keep in
mind the resources needed when a node (or set of nodes) becomes
unreachable, dead, or is sold on ebay by a sysadmin trying to raise
money for a ping pong table..  Plan appropriately for capacity and
Riak should operate as advertised . 250GB/node shouldn't be an issue
as long as you understand the resources needed to keep the cluster
going in failure and growth scenarios. (Anecdotally, I can tell you I
know of one Riak user that has >5TB/physical node on a cluster with
10s of nodes.)

Ryan Zezeski also pointed out that Eric Brewer wrote a paper in 2001
[0] that touches on this and is worth a read (much like all the papers
he writes).

Hope that helps.

Mark
ricon2012.com

[0] http://www.cs.berkeley.edu/~brewer/Giant.pdf


> --
> Dave Brady
>
> ----- Original Message -----
> From: "Mark Phillips" <m...@basho.com>
> To: "Dave Brady" <dbr...@weborama.com>
> Cc: riak-users@lists.basho.com
> Sent: Tuesday, August 14, 2012 7:34:10 PM GMT +01:00 Amsterdam / Berlin / 
> Bern / Rome / Stockholm / Vienna
> Subject: Re: eLevelDB max_open_files in 1.2.0
>
> Hi Dave,
>
> On Sun, Aug 12, 2012 at 3:58 PM, Dave Brady <dbr...@weborama.com> wrote:
>> Hello,
>>
>> First I want to thanks Basho for greatly expanding the documentation on the 
>> Wiki for configuring/tuning Riak and eLevelDB in 1.2.0!  Big improvement 
>> over 1.1.x.
>>
>> My question is about max_open_files: here the documentation is confusing to 
>> me.
>>
>> It says to allocate one open file per 2 MB, then divide by the number of 
>> partitions.  This is the same formula used in 1.1.x.
>>
>> It goes on to say that if you manually set this parameter in 1.1.x, to 
>> divide that value by two for 1.2.0.
>>
>> Should not the formula for 1.2.0, in that case, read as use one file per 4 
>> MB?
>
> Long story short, the answer is "yes" :)
>
> With 1.2, 4MB is the advised file size, and you should be running with
> no less that 20 files/nodes. I'll take a pass at updating the docs to
> make this a bit easier to understand. Thanks for pointing that out.
>
> Mark
>
>>
>> Thankks!
>>
>> --
>> Dave Brady
>>
>>
>> _______________________________________________
>> riak-users mailing list
>> riak-users@lists.basho.com
>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to