Hi again everyone!

- The memory usage keeps growing day by day:
https://dl.dropboxusercontent.com/u/1962284/riak2.png

- The handoffs keep on going, with strange things like a transfer started
1.5 days ago:
riak-admin transfers
'riak@192.168.20.112' waiting to handoff 51 partitions
'riak@192.168.20.111' waiting to handoff 74 partitions
'riak@192.168.20.110' waiting to handoff 86 partitions
'riak@192.168.20.109' waiting to handoff 191 partitions
'riak@192.168.20.108' waiting to handoff 67 partitions
'riak@192.168.20.107' waiting to handoff 177 partitions

transfer type: hinted_handoff
vnode type: riak_kv_vnode
partition: 51380916937414555718098294900181824909778878464
started: 2015-02-11 21:54:07 [1.53 d ago]
last update: no updates seen
total size: unknown
objects transferred: unknown

- I'm starting to have some entries in the error log:
2015-02-12 19:58:54.026 [error]
<0.184.0>@riak_core_handoff_manager:handle_info:289 An outbound handoff of
partition riak_kv_vnode 936274486415109681974235595958868809467081785344
was terminated for reason: noproc
2015-02-12 20:27:34.092 [error]
<0.21096.1867>@riak_core_handoff_sender:start_fold:263 hinted_handoff
transfer of riak_kv_vnode from 'riak@192.168.20.112'
1210306043414653979137426502093171875652569137152 to 'riak@192.168.20.109'
1210306043414653979137426502093171875652569137152 failed because of TCP
recv timeout
2015-02-12 20:27:34.092 [error]
<0.184.0>@riak_core_handoff_manager:handle_info:289 An outbound handoff of
partition riak_kv_vnode 1210306043414653979137426502093171875652569137152
was terminated for reason: {shutdown,timeout}
2015-02-12 21:25:32.852 [error]
<0.184.0>@riak_core_handoff_manager:handle_info:289 An outbound handoff of
partition riak_kv_vnode 742168800207099138150308704113737470919028244480
was terminated for reason: noproc


Please, can anyone give me a help on this? I'm starting to get worried with
this behaviour. Tell me if you need more info!

Thanks and Best regards,
Edgar Veiga

On 10 February 2015 at 16:16, Edgar Veiga <edgarmve...@gmail.com> wrote:

> Hi all!
>
> I have a riak cluster, working smoothly in production for about one year, 
> with the following characteristics:
>
> - Version 1.4.12
>
> - 6 nodes
>
> - leveldb backend
>
> - replication (n) = 3
>
> ~ 3 billion keys
>
> ~ 1.2Tb per node
>
> - AAE disabled
>
>
> Two days ago I've upgraded all of the 6 nodes from riak v1.4.8 to v1.4.12, 
> and two things started happening that are a little bit odd
>
> 1) The first is the memory consumption, please check the next imagem to 
> understand what I mean:
>
> - https://dl.dropboxusercontent.com/u/1962284/riak.png
>
> 2) All of the machines keep logging hinted handoffs after the rolling 
> restart. I've made the upgrade on non-busy hours and assured that the rolling 
> restart was concluded only when all the in-progress handoffs were concluded, 
> but on the next day when checking the logs I've realised that they keep 
> appearing... Heres are some random examples:
>
> 2015-02-10 16:11:55.547 [info] 
> <0.3070.753>@riak_core_handoff_sender:start_fold:148 Starting hinted_handoff 
> transfer of riak_kv_vnode from 'riak@192.168.20.112' 
> 765004763290394496247241279624929393101152190464 to 'riak@192.168.20.109' 
> 765004763290394496247241279624929393101152190464
>
> 2015-02-10 16:11:55.548 [info] 
> <0.3070.753>@riak_core_handoff_sender:start_fold:236 hinted_handoff transfer 
> of riak_kv_vnode from 'riak@192.168.20.112' 
> 765004763290394496247241279624929393101152190464 to 'riak@192.168.20.109' 
> 765004763290394496247241279624929393101152190464 completed: sent 3.15 KB 
> bytes in 1 of 1 objects in 0.00 seconds (3.99 MB/second)
>
> 2015-02-10 16:12:05.803 [info] 
> <0.3434.753>@riak_core_handoff_sender:start_fold:148 Starting hinted_handoff 
> transfer of riak_kv_vnode from 'riak@192.168.20.112' 
> 902020541790166644828836732692080926193895866368 to 'riak@192.168.20.109' 
> 902020541790166644828836732692080926193895866368
>
> 2015-02-10 16:12:05.856 [info] 
> <0.3368.753>@riak_core_handoff_sender:start_fold:148 Starting hinted_handoff 
> transfer of riak_kv_vnode from 'riak@192.168.20.112' 
> 570899077082383952423314387779798054553098649600 to 'riak@192.168.20.111' 
> 570899077082383952423314387779798054553098649600
>
> 2015-02-10 16:12:05.860 [info] 
> <0.3434.753>@riak_core_handoff_sender:start_fold:236 hinted_handoff transfer 
> of riak_kv_vnode from 'riak@192.168.20.112' 
> 902020541790166644828836732692080926193895866368 to 'riak@192.168.20.109' 
> 902020541790166644828836732692080926193895866368 completed: sent 39.79 KB 
> bytes in 1 of 1 objects in 0.06 seconds (699.32 KB/second)
>
> 2015-02-10 16:12:05.886 [info] 
> <0.3368.753>@riak_core_handoff_sender:start_fold:236 hinted_handoff transfer 
> of riak_kv_vnode from 'riak@192.168.20.112' 
> 570899077082383952423314387779798054553098649600 to 'riak@192.168.20.111' 
> 570899077082383952423314387779798054553098649600 completed: sent 3.55 KB 
> bytes in 1 of 1 objects in 0.03 seconds (118.58 KB/second)
>
>
> Should I be worried or is this normal on this version?
>
>
> Best regards,
>
> Edgar
>
>
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to