Hi Dan,

Thanks for your reply.
Indeed, I saw the "insufficient_vnodes_available" error message with
only one node on the machine.
What we did was just install the official 1.0.2's deb version and
modifying vm.args and app.config.

The only changes in our setting are:
A. app.config:
- (http ip ..
- {pb_ib ..
- storage_backend points to storage_backend, riak_kv_eleveldb_backend

B. vm.args
-name
-setcookie

And the db ran smoothly for many days, then crashed for the 1st time.
It got recovered after we adjusted the ulimit.
Then it crashed again with the error message as above.

We don't have the other vnodes up-and-running in the machine at this
moment, though I was thinking it might be a solution and trying.
I tried to add an additional node and join the original node, but
fails. Then I leave it as it is.
Odd is, the DB comes back.

Here is our riak-admin info:

1. We don't have MapReduce procedures in it.

2. "#riak-admin status" shows:

Attempting to restart script through sudo -u riak
1-minute stats for 'riak@my-ip-address'
-------------------------------------------
vnode gets : 138
vnode_puts : 78
vnode_index_reads : 0
vnode_index_writes : 0
vnode_index_writes_postings : 0
vnode_index_deletes : 0
vnode_index_deletes_postings : 0
read_repairs : 1
vnode_gets_total : 490203
vnode_puts_total : 199436
vnode_index_reads_total : 0
vnode_index_writes_total : 0
vnode_index_writes_postings_total : 0
vnode_index_deletes_total : 0
vnode_index_deletes_postings_total : 0
node_gets : 42
node_gets_total : 163369
node_get_fsm_time_mean : 409
node_get_fsm_time_median : 397
node_get_fsm_time_95 : 674
node_get_fsm_time_99 : 734
node_get_fsm_time_100 : 748
node_puts : 24
node_puts_total : 66478
node_put_fsm_time_mean : 968
node_put_fsm_time_median : 1000
node_put_fsm_time_95 : 1599
node_put_fsm_time_99 : 1719
node_put_fsm_time_100 : 1735
node_get_fsm_siblings_mean : 1
node_get_fsm_siblings_median : 1
node_get_fsm_siblings_95 : 1
node_get_fsm_siblings_99 : 1
node_get_fsm_siblings_100 : 1
node_get_fsm_objsize_mean : 3609
node_get_fsm_objsize_median : 1280
node_get_fsm_objsize_95 : 29952
node_get_fsm_objsize_99 : 31385
node_get_fsm_objsize_100 : 31422
read_repairs_total : 1
coord_redirs_total : 0
cpu_nprocs : 441
cpu_avg1 : 51
cpu_avg5 : 33
cpu_avg15 : 54
mem_total : 8383463424
mem_allocated : 8280829952
disk : [{"/",724427316,3},
        {"/dev",4088508,1},
        {"/dev/shm",4093488,0},
        {"/var/run",4093488,1},
        {"/var/lock",4093488,0},
        {"/lib/init/rw",4093488,0},
        {"/var/lib/ureadahead/debugfs",724427316,3},
        {"/boot",256665,20}]
nodename : 'riak@my-ip-address'
connected_nodes : []
sys_driver_version : <<"1.5">>
sys_global_heaps_size : 0
sys_heap_type : private
sys_logical_processors : 8
sys_otp_release : <<"R14B03">>
sys_process_count : 1225
sys_smp_support : true
sys_system_version : <<"Erlang R14B03 (erts-5.8.4) [source] [64-bit]
[smp:8:8] [rq:8] [async-threads:64] [kernel-poll:true]">>
sys_system_architecture : <<"x86_64-unknown-linux-gnu">>
sys_threads_enabled : true
sys_thread_pool_size : 64
sys_wordsize : 8
ring_members : ['riak@my-ip-address']
ring_num_partitions : 64
ring_ownership : <<"[{'riak@my-ip-address',64}]">>
ring_creation_size : 64
storage_backend : riak_kv_eleveldb_backend
ssl_version : <<"4.1.5">>
public_key_version : <<"0.12">>
runtime_tools_version : <<"1.8.5">>
basho_stats_version : <<"1.0.1">>
riak_search_version : <<"1.0.2">>
merge_index_version : <<"1.0.1">>
luwak_version : <<"1.1.2">>
skerl_version : <<"1.1.0">>
riak_kv_version : <<"1.0.2">>
bitcask_version : <<"1.3.0">>
luke_version : <<"0.2.4">>
erlang_js_version : <<"1.0.0">>
mochiweb_version : <<"1.5.1">>
inets_version : <<"5.6">>
riak_pipe_version : <<"1.0.2">>
riak_core_version : <<"1.0.2">>
riak_sysmon_version : <<"1.0.0">>
webmachine_version : <<"1.9.0">>
crypto_version : <<"2.0.3">>
os_mon_version : <<"2.2.6">>
cluster_info_version : <<"1.2.0">>
sasl_version : <<"2.1.9.4">>
lager_version : <<"0.9.4">>
basho_metrics_version : <<"1.0.0">>
stdlib_version : <<"1.17.4">>
kernel_version : <<"2.14.4">>
executing_mappers : 0
memory_total : 1088568880
memory_processes : 1041629696
memory_processes_used : 1041593360
memory_system : 46939184
memory_atom : 1031585
memory_atom_used : 999738
memory_binary : 2593768
memory_code : 9301939
memory_ets : 656408


3. "# riak-admin ring_status" shows:

Attempting to restart script through sudo -u riak
================================== Claimant ===================================
Claimant:  'riak@my-ip-address'
Status:     up
Ring Ready: true

============================== Ownership Handoff ==============================
No pending changes.

============================== Unreachable Nodes ==============================
All nodes are up and reachable


4. "# riak-admin member_status" shows

Attempting to restart script through sudo -u riak
================================= Membership ==================================
Status     Ring    Pending    Node
-------------------------------------------------------------------------------
valid     100.0%      --      'riak@my-ip-address'
-------------------------------------------------------------------------------
Valid:1 / Leaving:0 / Exiting:0 / Joining:0 / Down:0


Thanks,
Walter

2011/11/30 Dan Reverri <d...@basho.com>:
> Hi Walter,
> It is not recommended to run more than 1 node per machine. Also, creating a
> new node by copying/pasting an existing node is not a good idea.
> Were you seeing "insufficient vnodes" before you tried adding nodes?
> Was the node under any load? MapReduce, ListKeys, etc.
> What does "riak-admin status" return on the original node?
> What does "riak-admin ring_status" return on the original node?
> What does "riak-admin "member_status" return on the original node?
> I am concerned that the addition of copied nodes may be creating additional
> issues. I would recommend getting the cluster back to a steady state with a
> single node. Once this is complete we can review the issues and determine
> what the problem might be.
> Thanks,
> Dan
> Daniel Reverri
> Developer Advocate
> Basho Technologies, Inc.
> d...@basho.com
>
>
> On Mon, Nov 28, 2011 at 3:06 PM, Chung-Hung (Walter) Tsai
> <waltert...@gmail.com> wrote:
>>
>> Hi Dan,
>>
>> Yes, I use non-default name parameter and reip it.
>>
>> Please find my setting and setup info as below:
>>
>> Version: 1.0.2, .deb
>> Linux: Unbuntu 10.4, 64bits
>> Vnode: I didn't specify the other 2 nodes specifically, just used
>> default setting and did riak start.
>> The DB has a bumping due to insufficient file descriptor and got
>> resolved after I set ulimit and sysctl for system-wide parameters.
>> It ran ok for several days but since yesterday, it gave us the error
>> message regarding insufficient vnodes errors.
>>
>> I'm trying to setup 2 additional nodes by copying and modding
>> necessary files (e.g. /usr/sbin/riak to /usr/sbin/riakN) manually now.
>> But they each become a single cluster with just a single member in it
>> and doesn't allow other vnodes to join.
>> When I launch "riak-admin", it shows something like " ... 100% ...
>> riak@MY-IP-ADDRESS"
>>
>>
>> Thanks,
>> Walter
>>
>> 2011/11/29 Dan Reverri <d...@basho.com>:
>> > Hi Walter,
>> > Have you made any changes to your configuration recently such as
>> > changing
>> > the "-name" parameter in vm.args?
>> > Thanks,
>> > Dan
>> > Daniel Reverri
>> > Developer Advocate
>> > Basho Technologies, Inc.
>> > d...@basho.com
>> >
>> >
>> > On Mon, Nov 28, 2011 at 1:04 AM, Chung-Hung (Walter) Tsai
>> > <waltert...@gmail.com> wrote:
>> >>
>> >> Hello everyone,
>> >>
>> >> We just saw many errors like this:
>> >>
>> >> 2011-11-27 23:02:01.129 [error] <0.700.0> webmachine error:
>> >> path="/buckets/.. (SKIPPED).....ui"
>> >> {error,{error,insufficient_vnodes_available}}
>> >>
>> >> The DB went well for several days, but now our riak cannot be
>> >> connected from our code since the errors appear.
>> >> Does anyone know how to resolve the issue?
>> >>
>> >>
>> >> Thanks,
>> >> Walter
>> >>
>> >> _______________________________________________
>> >> riak-users mailing list
>> >> riak-users@lists.basho.com
>> >> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>> >
>> >
>
>

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to