Ok, how about in this case I described? It runs out of memory with a single 
pair of nodes...

(Or did you mean there's a connection between each pair of vnodes?)
On Wednesday, April 13, 2011 at 1:56 PM, Jon Meredith wrote:
Hi Greg et al,
> 
> As you say largest known is not largest possible. Internally within Basho, 
> the largest cluster we've experimented with so far had 50 nodes.
> 
> Going beyond that it's speculation from me about pain points. 
> 
> 1) It is true that you need enough file descriptors to start up all 
> partitions when a node restarts - Riak checks if there is any handoff data 
> pending for each partition. We have work scheduled to address that in the 
> medium term. The plan is to only spin up partitions the node owns and any 
> that have been started as fallbacks that handoff has not completed for. Until 
> that work is done you will need a high ulimit with large ring sizes. 
> 
> 2) It is also true that Erlang runs a fully connected network, so there will 
> be connections between each node pair in the cluster. We haven't determined 
> the point at which it becomes a problem. 
> 
> So it looks like you'll be pushing the known limits. Basho will do our very 
> best to help overcome any obstacles as you encounter them.
> 
> Jon Meredith
> Basho Technologies.
> 
> On Wed, Apr 13, 2011 at 1:41 PM, Greg Nelson <gro...@dropcam.com> wrote:
> > The largest known riak cluster != the largest possible riak cluster. ;-)
> > 
> > The inter node communication of the cluster depends on the data set and 
> > usage pattern, doesn't it? Or is there some constant overhead that tops out 
> > at a few hundred nodes? I should point out that we'll have big data, but 
> > not a huge number of keys. 
> > 
> > The number of vnodes in the cluster should be equal to the 
> > ring_creation_size under normal circumstances, shouldn't it? So when I have 
> > a one node cluster, that node is running ring_creation_size vnodes... File 
> > descriptors probably isn't a problem -- these machines won't be doing 
> > anything else, and the limits are set to 65536. 
> > 
> > Thinking about the internode communication you mentioned, that's probably 
> > where the resource hog is.. socket buffers, etc.
> > 
> > Anyway, I'd also love to hear more from basho. :)
> > On Wednesday, April 13, 2011 at 12:33 PM, sicul...@gmail.com wrote:
> > > Ill just chime in and say that this is not practical for a few reasons. 
> > > The largest known riak cluster has like 50 or 60 nodes. Afaik, inter node 
> > > communication of erlang clusters top out at a few hundred nodes. I'm also 
> > > under the impression that each physical node has to have enough file 
> > > descriptors to accommodate every virtual node in the cluster. 
> > > 
> > > I'd love to hear more from basho. 
> > > 
> > > -alexander 
> > > 
> > > 
> > > Sent from my Verizon Wireless BlackBerry
> > > 
> > > -----Original Message-----
> > > From: Greg Nelson <gro...@dropcam.com>
> > >  Sender: riak-users-boun...@lists.basho.com
> > > Date: Wed, 13 Apr 2011 12:13:34 
> > > To: <riak-users@lists.basho.com>
> > >  Subject: Large ring_creation_size
> > > 
> > > _______________________________________________
> > > riak-users mailing list
> > > riak-users@lists.basho.com
> > > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> > > 
> > 
> > _______________________________________________
> >  riak-users mailing list
> > riak-users@lists.basho.com
> > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> > 
> 
> 
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to