Hi Joel,

That's not a message I'd expect to see on a clean restart.  We'll need some
more information to diagnose it.  Next time it crashes, could you provide
the contents of your ring file (you can just grab the most recent one out
of /var/lib/riak/ring - location may vary depending on your platform) and
it would be very helpful if you could modify your deploy script to capture
the file list for the leveldb directory on *all* of your nodes immediately
before you bounce riak to do the update.   When it crashes, the console.log
from all the nodes would also be useful.  If any of those files contain
sensitive information, please contact me off list.

BR, Jon

On Tue, Nov 15, 2011 at 6:48 AM, Jeremy Raymond <jeraym...@gmail.com> wrote:

> I'm using Riak 1.0.1 and I have a script that deploys updates to each of
> my 3 nodes to update the Erlang mapred modules. What I do is stop a node,
> deploy the new mapred modues, restart the node, wait for the riak_kv
> service to start, then move onto the next node. Sometimes when I do this
> one of the nodes that is not the current one being updated will go down.
> Each time this has happened thus far it's been the same node that will go
> down (the last one). I see this error in the logs:
>
> [error] Failed to start riak_kv_eleveldb_backend Reason: {db_open,"IO
> error:
> /var/lib/riak/leveldb/913438523331814323877303020447676887284957839360/MANIFEST-000002:
> No such file or directory"}
>
> If I manually restart the node, things go back to normal. Any ideas on
> what's going on? I've attached the error log.
>
> --
>  Jeremy
>
> _______________________________________________
> riak-users mailing list
> riak-users@lists.basho.com
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>
>


-- 
Jon Meredith
Platform Engineering Manager
Basho Technologies, Inc.
jmered...@basho.com
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to