Hi Kota,
Our production nodes are Riak CS 1.5 and Riak 1.4.x -- they're running
haproxy 1.4.x, and it's all been happy for some time now.

Testing the new nodes, still same haproxy versions, but Riak CS 2.0.1 and
Riak 1.0.5.
Very confused as to why the connections are being dropped when going
through haproxy. The problem persists even after restarting CS.
I tried staggering the restarts.. increasing the PB request pool.. etc..
 no change.

But it works fine if CS connects directly to the localhost riak pb.
(Which isn't a great idea.. big Riak instances sometimes take too long to
start, and CS falls over because it started too fast and couldn't connect,
if you're going to localhost)

Confusing! I'm wondering if it's because the testing machines are in
virtual machines, compared to production which is real hardware.
But.. normally haproxy still works fine on VMs.

I'll continue to play around.. Must be something that's botched on the
testing setup... but don't want to replicate that into production!


On Fri, 5 Jun 2015 at 13:59 Kota Uenishi <k...@basho.com> wrote:

> Toby,
>
> As PB connection management haven't been changed between CS 1.5 and
> 2.0, I think it should work. What's the version the load balancing
> working stable? It depends of the reason why connection has been cut,
> but I would recommend you restart just the CS node and recreate the
> connection pool.
>
> On Thu, Jun 4, 2015 at 2:33 PM, Toby Corkindale <t...@dryft.net> wrote:
> > Hi,
> > I've been happily using haproxy in front of Riak and Riak CS 1.x in
> > production for quite a while.
> >
> > I've been trying to bring up a new cluster based on riak/cs 2.0.x
> recently,
> > as you've probably noticed from the flurry of emails to this list :)
> >
> > I'm discovering that if I have haproxy sitting between riak-cs and riak,
> > then I get a lot of errors about disconnections. Initially I thought this
> > must be related to pb backlogs or pool sizes or file handle limits -- but
> > I've played with all those things to no avail.
> >
> > I *have* noticed that if I get riak-cs to connect directly to a riak
> > (bypassing haproxy) then everything is fine, including with the original
> > default request pool and backlog sizes.
> >
> > I am essentially using the recommended haproxy.cfg, which has worked
> fine in
> > production elsewhere.
> >
> > Any suggestions?
> > Error message sample follows:
> >
> > 2015-06-04 15:26:16.447 [warning]
> > <0.283.0>@riak_cs_riak_client:get_user_with_pbc:293 Fetching user re
> > cord with strong option failed: disconnected
> > 2015-06-04 15:26:16.447 [warning]
> > <0.2095.0>@riak_cs_pbc:check_connection_status:97 Connection status
> > of <0.287.0> at maybe_create_user: {false,[]}
> >
> >
> >
> > Cheers
> > Toby
> >
> > _______________________________________________
> > riak-users mailing list
> > riak-users@lists.basho.com
> > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> >
>
>
>
> --
> Kota UENISHI / @kuenishi
> Basho Japan KK
>
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to