Not to jump to conclusions, but this sounds like a quorum issue. Such as your writes are not going to all three servers, but only to one or two. Handoffs would occur when nodes went down unexpectedly, it doesn't ever attempt to fetch unwritten data, it fetches data from other servers which were still up at the point the server went down.. There are some configuration values you could experiment with, that will ok a write only as long as its gone to a durable storage count.. But keep in mind theres no guarantee vnodes are assigned to unique servers..
On Tue, May 24, 2016 at 4:08 PM, Vikram Lalit <vikramla...@gmail.com> wrote: > It's returning no object at all for the relevant key. That too is random - > every few calls it returns but then it doesn't. > > On May 24, 2016 4:06 PM, "Sargun Dhillon" <sar...@sargun.me> wrote: >> >> What do you mean it's not returning? It's returning stale data? Or >> it's erroring? >> >> On Tue, May 24, 2016 at 7:34 AM, Vikram Lalit <vikramla...@gmail.com> >> wrote: >> > Hi - I'd appreciate if someone can opine on the below behavior of Riak >> > that >> > I am observing... is that expected, or something wrong in my set-up / >> > understanding? >> > >> > To summarize, I have a 3-node Riak cluster (separate EC2 AWS instances) >> > with >> > a separate chat server connecting to them. When I write data on the Riak >> > nodes, the process is successful and I can read all data correctly. >> > However, >> > as part of my testing, if I deliberately bring down one node (and then >> > remove it from the cluster using riak-admin cluster force-remove / plan >> > / >> > commit), the client API is not able to fetch all the written data. In >> > fact, >> > there is an alternation of success and failure which happens rather >> > randomly. >> > >> > My initial suspicion was that it would be happening only during the time >> > the >> > rebalancing is occurring (i.e. riak-admin ring-status is not fully >> > settled) >> > but I've seen this sporadic behavior post the same too. >> > >> > Does this have to do with the n and r values for the cluster and given >> > that >> > 1 node is down, the cluster does not succeed in returning results >> > reliably? >> > Also, does this mean that during the time a cluster is being rebalanced >> > (even incl. addition of new nodes), the results could be arbitrary - >> > that >> > doesn't sound correct to me? >> > >> > Appreciate if someone can throw some light here? Also, the HTTP API >> > calls to >> > retrieve and set the n / r / w values for a specific bucket - couldn't >> > locate the same! >> > >> > Thanks much! >> > Vikram >> > >> > >> > _______________________________________________ >> > riak-users mailing list >> > riak-users@lists.basho.com >> > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com >> > > > > _______________________________________________ > riak-users mailing list > riak-users@lists.basho.com > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com > _______________________________________________ riak-users mailing list riak-users@lists.basho.com http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com