Re: Crashed node has Bitcask merge errors on restart

Jeff Pollard Fri, 05 Aug 2011 08:00:50 -0700

Hey David,

Thanks again for all your help, much appreciated.  We've since restored from
our backup and the node appears healthy now.  I checked the logs and saw
successful bitcask merges and read repairs were happening, so all appears
well.


I also upped the ulimit -n value based on your observation.

On Fri, Aug 5, 2011 at 7:07 AM, David Smith <diz...@basho.com> wrote:

> On Fri, Aug 5, 2011 at 6:49 AM, Jeff Pollard <jeff.poll...@gmail.com>
> wrote:
> > Update: now the node has crashed, due to the following lines in the
> > sasl-error.log (see below).  I've also attached the crash dump to this
> > email.
> > Real quickly though, just to confirm - If we wanted to restore the node
> from
> > a recent backup, the procedure is as simple as:
> >
> > Stop the node.
> > Restore the bitcask and ring directories from a recent backup (~12 hours
> > old) to the node
> > Start the node
> >
> > That correct?  Any gotchas or anything else I should know about that
> > process?
>
> You really only need to do the bitcask dirs, not the ring; also make
> sure you move aside the bitcask files created interstitially.
>
> The {error, emfile} problem shows that you're hitting the ulimit -n
> for your process. Make sure your ulimit is set appropriately. It can
> also be helpful to remove any 0-length *.bitcask.data files.
>
> D.
>
> --
> Dave Smith
> Director, Engineering
> Basho Technologies, Inc.
> diz...@basho.com
>

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Re: Crashed node has Bitcask merge errors on restart

Reply via email to