RE: Install riak from source without github access

2012-05-08 Thread Walters Chris (NHS CONNECTING FOR HEALTH)
Hi Mark, Yes, that solution worked for me - thanks. Chris. From: Mark Phillips [m...@basho.com] Sent: 04 May 2012 16:58 To: Walters Chris (NHS CONNECTING FOR HEALTH) Cc: riak-users@lists.basho.com Subject: Re: Install riak from source without github access Hi Wa

Re: Questions about Riak Enterprise

2012-05-08 Thread Mark Rose
On Mon, May 7, 2012 at 9:25 PM, Andrew Thompson wrote: > > Hi Mark, I'm the riak EDS 'maintainer'. Sorry I didn't reply earlier, I > was travelling all week. > Hi Andrew, glad to hear from you! > I understand that the open source version of Riak's replication is > designed > > for single data ce

Riak cluster unresponsive after single node failure

2012-05-08 Thread Armon Dadgar
We are currently running a 4 node cluster with 1.1.2 on Ubuntu 10.04, and are experiencing an issue where losing a single node has cause the entire cluster to fail. Nagios reported that node 1 had failed, shortly after, all the logs are filled with: 2012-05-08 08:13:22.319 [error] <0.27873.2568>

Re: Riak cluster unresponsive after single node failure

2012-05-08 Thread Mark Phillips
Hey Armon, So "monitor busy_dist_port" means your nodes aren't talking but we need to figure out why. Specifically it looks like you're kv vnodes aren't able to communicate. First questions * Which backend are you using? * What OS? * What size are your values? * What is the typical traffic (ops/

Re: Riak cluster unresponsive after single node failure

2012-05-08 Thread Armon Dadgar
Hey, The cluster is back up and running by going through the following steps: 1) Force terminate east-riak-001 using the AWS console 2) "riak-admin down r...@east-riak-001.cluster.kiip.me" on ALL nodes 3) riak stop && riak start on ALL nodes All the nodes appeared to have been blocked tryi

Re: Riak cluster unresponsive after single node failure

2012-05-08 Thread Scott Lystig Fritchie
>>> "ar" == Armon Dadgar wrote: ar> All the nodes appeared to have been blocked trying to talk to riak ar> 001 which was the ring claimant at the time. Doing this seems to ar> have cleared the state enough for the cluster to make progress ar> again. Armon, it's quite unlikely that the ring claim

Re: Riak cluster unresponsive after single node failure

2012-05-08 Thread Armon Dadgar
Hey Scott, My mistake, I was not sure if the claimant was responsible for convergence. If this was a competition, it was not one that would ever finish… The node went down at about 1AM, and by 9AM when I started to resolve the issue it was in the same state. I was unable to investigate the stat

Re: Riak cluster unresponsive after single node failure

2012-05-08 Thread Armon Dadgar
Where by "competition", I meant "compaction". Derp. Best Regards, Armon Dadgar On Tuesday, May 8, 2012 at 3:54 PM, Armon Dadgar wrote: > Hey Scott, > > My mistake, I was not sure if the claimant was responsible for convergence. > > If this was a competition, it was not one that would eve

Efficient way of passing multiple arguments to mapreduce functions

2012-05-08 Thread Matthew Tovbin
Hi Riak-users, I'm looking for an efficient way of passing multiple arguments to mapreduce functions, Instead of passing a JSON (or other custom string representation of arguments) and being forced to parse it on every map/reduce function call, i.e.: Now: //Passing: arg = {"a":"0", "b":"1"} //Ex

Re: Questions about Riak Enterprise

2012-05-08 Thread Andrew Thompson
On Tue, May 08, 2012 at 10:59:19AM -0400, Mark Rose wrote: > > Approximately how fast is "eventually"? Is it based on the network > bandwidth/latency between the clusters or is there additional latency? > Basically, would I expect more or less than a second if the two clusters > were 250 ms away?