I thought I knew the cause of this problem. I do not. We need to await input from others.
My apologies. Other basic questions will be: what version of Riak, what is your app.config, how many servers/nodes, any reason this one node is "different"? Matthew On Mar 19, 2014, at 5:30 PM, Michael Dillon <mdillon...@pagefreezer.com> wrote: > We are using AMazon EC2 m3.x2large nodes and while the freeze is occurring > free reports > > total used free shared buffers cached > > Mem: 30623232 8818792 21804440 0 88092 4411832 > > -/+ buffers/cache: 4318868 26304364 > > Swap: 0 0 0 > > The Erlang processes seem to be unkillable because "shutdown -r now" is also > hanging. Right now these nodes are just being used for some testing, but > eventually we will go into production and I really need to have a plan for > how to detect and then deal with these Erlang freezes. Or better yet, a way > to avoid them even if it means detecting some condition in advance and then > rebooting the node. > > > > > On Wed, Mar 19, 2014 at 2:07 PM, Matthew Von-Maszewski <matth...@basho.com> > wrote: > > Any chance you are overflowing into swap? Or in the case of XEN have you > exceeded the guaranteed RAM for the VM memory and moved into the disk backed > portion of "ram"? > > What backend do you use within riak? Do you have memory statistics from > before and after the seizure/freeze? > > Matthew > > > On Mar 19, 2014, at 4:56 PM, Michael Dillon <mdillon...@pagefreezer.com> > wrote: > > > I've run into a problem with Riak freezing completely on one node running > > on Ubuntu 12.04 LTS on a XEN VM (EC2). If I ssh into the node and run "ps > > ax" that shell session also freezes. I also tried another ssh session with > > "netstat -lnp" to see if I could find the process ID to kill, but that also > > froze. > > > > I must admit that I have seen a similar problem with RabbitMQ running on > > Ubuntu 10 LTS on a an OpenVPS VM a few years ago. > > > > I suppose this is an Erlang issue of some sort, but I would really like > > some way to kill the Riak processes without a reboot if possible. > > > > -- > > PageFreezer.com > > #200 - 311 Water Street > > Vancouver, BC V6B 1B8 > > _______________________________________________ > > riak-users mailing list > > riak-users@lists.basho.com > > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com > > > > > -- > PageFreezer.com > #200 - 311 Water Street > Vancouver, BC V6B 1B8 > _______________________________________________ > riak-users mailing list > riak-users@lists.basho.com > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
_______________________________________________ riak-users mailing list riak-users@lists.basho.com http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com