Hi Sargun,

>>1) I recommend you have a 5-node cluster:

We'll add another node for sure.

>> 2) What version of Riak are you using?

2.0.1

>> 3) What backend(s) are you using?

leveldb

>> 4) What's the size of your keyspace?

Not sure what to answer here.


>> 5) Are you actively rewriting keys, or writing keys to the cluster?

Yes, plenty of other operations on this cluster for reading/writing etc.
This particular key does not get written to anymore once its initially
created.

>> 6) Do you know how much I/O the cluster is currently doing?

Very little at the moment (almost non-existant) as we haven't ported any of
our clients onto this new system just yet - we ran a load test against our
system and then 2 days later without any changes at all we started noticing
this timeout issue.


We turned on debug messages temporarily and have noticed this in the logs:

2014-12-29 11:51:44.957 [error] <0.14490.1> gen_fsm <0.14490.1> in state
waiting_vnode_r terminated with reason: bad argument in call to
erlang:hd([]) in riak_object:most_recent_content/1 line 228
2014-12-29 11:51:44.958 [error] <0.14490.1> CRASH REPORT Process
<0.14490.1> with 0 neighbours exited with reason: bad argument in call to
erlang:hd([]) in riak_object:most_recent_content/1 line 228 in
gen_fsm:terminate/7 line 622



Thanks,
Jason

[image: photo]
*Jason Ryan*
VP Engineering

Trustev
Real Time, Online Identity Verification

email: jason.r...@trustev.com
skype: jason_j_ryan
web: www.trustev.com

Trustev Ltd, 2100 Cork Airport Business Park, Cork, Ireland.

On 29 December 2014 at 11:55, Sargun Dhillon <sar...@sargun.me> wrote:

> Several things:
> 1) I recommend you have a 5-node cluster:
> http://basho.com/why-your-riak-cluster-should-have-at-least-five-nodes/
> 2) What version of Riak are you using?
> 3) What backend(s) are you using?
> 4) What's the size of your keyspace?
> 5) Are you actively rewriting keys, or writing keys to the cluster?
> 6) Do you know how much I/O the cluster is currently doing?
>
> On Mon, Dec 29, 2014 at 2:51 AM, Jason Ryan <jason.r...@trustev.com>
> wrote:
> > Hi,
> >
> > We are getting random timeouts from our application (>60seconds) when we
> try
> > to retrieve a key from our Riak cluster (4 nodes with a load balancer in
> > front of them). Our application just uses the standard REST API to query
> > Riak.
> >
> > We are pretty new to Riak - so would like to understand how best to debug
> > this issue? Is there any good pointers on what to start with? This is our
> > production cluster.
> >
> > Thanks,
> > Jason
> >
> >
> > This message is for the named person's use only. If you received this
> > message in error, please immediately delete it and all copies and notify
> the
> > sender. You must not, directly or indirectly, use, disclose, distribute,
> > print, or copy any part of this message if you are not the intended
> > recipient. Any views expressed in this message are those of the
> individual
> > sender and not Trustev Ltd. Trustev is registered in Ireland No. 516425
> and
> > trades from 2100 Cork Airport Business Park, Cork, Ireland.
> >
> >
> > _______________________________________________
> > riak-users mailing list
> > riak-users@lists.basho.com
> > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> >
>

-- 


This message is for the named person's use only. If you received this 
message in error, please immediately delete it and all copies and notify 
the sender. You must not, directly or indirectly, use, disclose, 
distribute, print, or copy any part of this message if you are not the 
intended recipient. Any views expressed in this message are those of the 
individual sender and not Trustev Ltd. Trustev is registered in Ireland No. 
516425 and trades from 2100 Cork Airport Business Park, Cork, Ireland.
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to