Hi Praveen, There are a few things that could be contributing the to 404s.
The most likely issue has to do with your "n" and "r" values. With an "r" of 1, and a laggy node in your cluster, you could have a situation that resembled a netsplit. (At the moment, Riak does better with downed nodes than laggy ones.) Laggy responses will result in you returning a notfound to the client if you're only waiting for 1 of 2 existing replicas and have "{notfound_ok, true}". Potential solutions to this (assuming it's the problem): * consider retrying a number of times with your client logic; there is some information about why you might want to do this on the wiki [0] * change to "{notfound_ok, false}" because otherwise if a "not_found" comes back on the first response, it'll be treated as valid * increase R to 2. There are a few other potential culprits, but this is the most-likely. Let us know if any of that helps. Mark [0] http://wiki.basho.com/Client-Implementation-Guide.html On Mon, Sep 17, 2012 at 4:01 AM, Praveen Baratam <praveen.bara...@gmail.com> wrote: > The frequency of error is now more common. Upto 1 failed request in 10. This > is breaking everything. > > > On Mon, Sep 17, 2012 at 3:04 PM, Praveen Baratam <praveen.bara...@gmail.com> > wrote: >> >> Here are some more details about the cluster. >> >> {ring_creation_size, 1024}, >> >> {default_bucket_props, [ >> {n_val, 2}, >> {r, 1}, >> {w, 1}, >> {allow_mult, false}, >> {last_write_wins, false}, >> {precommit, []}, >> {postcommit, []}, >> {chash_keyfun, {riak_core_util, chash_std_keyfun}}, >> {linkfun, {modfun, riak_kv_wm_link_walker, mapreduce_linkfun}} >> ]}, >> >> {storage_backend, riak_kv_multi_backend}, >> {multi_backend_default, <<"store">>}, >> {multi_backend, [ >> {<<"store">>, riak_kv_bitcask_backend, [ >> {data_root, "/var/lib/riak/bitcask/store"} >> ]}, >> {<<"cache">>, riak_kv_bitcask_backend, [ >> {data_root, "/var/lib/riak/bitcask/cache"}, >> {expiry_secs, 21600} >> ]} >> ]}, >> >> On using Riaknostic I am seeing these errors. >> >> [warning] The following preflists do not satisfy the n_val: >> [[{973382926425464638881751031164555683013033197568,'riak@178.238.237.241'},..... >> a big list similar to this...] >> >> Please help. We are stuck. >> >> Praveen >> >> >> >> On Mon, Sep 17, 2012 at 4:18 AM, Praveen Baratam >> <praveen.bara...@gmail.com> wrote: >>> >>> Hello Everybody, >>> >>> We are currently alpha testing out our Webapp which uses Riak for storing >>> binary files (< 2 MB) with n_val as 2. We currently have 3 nodes in the >>> cluster and all files are stored in the same bucket. We are using Bitcask >>> engine. >>> >>> Sometimes we see 404 error cropping up when accessing these keys. This >>> happens only once in a day or so and all existing keys are accessible most >>> of the time. We are only trying to access the keys that we have put in Riak >>> so there is no doubt of their existence. >>> >>> Can a network split cause such an error? Is there anything we can do to >>> avoid such corner cases. >>> >>> Thank you >>> >>> Praveen >> >> > > > _______________________________________________ > riak-users mailing list > riak-users@lists.basho.com > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com > _______________________________________________ riak-users mailing list riak-users@lists.basho.com http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com