Hi,

I'm running a 4 member riak cluster that failed today for unknown reasons.
At a particular point in time, I noticed that the HTTP riak connector began
refusing connections to my client

java.lang.RuntimeException: com.basho.riak.client.RiakRetryFailedException:
com.basho.riak.client.http.response.RiakIORuntimeException:
org.apache.http.conn.HttpHostConnectException: *Connection to
http://localhost:XXXX refused*

I initiated a start command on all clusters just in case some of them were
down. Cluster 1 (main) and cluster 4 accepted the start command. The others
maintained that they were already running.

After the system was back up I began my hunt for error / crash reports and
found nothing substantial. The error.log file on cluster 1 contained this

2012-12-06 08:44:35.322 [error] <0.15539.62> gen_fsm <0.15539.62> in state
wait_pipeline_shutdown terminated with reason: *{sink_died,normal}*
2012-12-06 08:44:35.423 [error] <0.15539.62> CRASH REPORT Process
<0.15539.62> with 0 neighbours crashed with reason: {sink_died,normal}
2012-12-06 08:44:35.428 [error] <0.164.0> Supervisor riak_pipe_builder_sup
had child undefined started with {riak_pipe_builder,start_link,undefined}
at <0.15539.62> exit with reason {sink_died,normal} in context
child_terminated

However I believe this error is normal, as mentioned here -
http://riak-users.197444.n3.nabble.com/Safe-to-ignore-a-sink-died-normal-error-td3824989.html

I ran riaknostic on the failed cluster through the 'riak-admin diag'
command and it did not find any crash reports either. I'm
running riak-1.1.4.

Do you have any pointers / advise that I could use ? Any help is
appreciated.

Thank you
-Deepak
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to