RE: Rejection of peer join.

Michael Van Der Beek Wed, 27 Sep 2017 00:53:20 -0700

Hi Stuart,

I think you misinterpreted what I was asking

Assuming a pair of instance already are in relationship.
Lets assume we have drbd running between these two instances lets call it A-B.
If juju starts a 3rd instance C.  I want to make sure the 3rd cannot join the 
pair as drbd not supposed to have 3rd node. Although in theory you can create a 
stack node for 3rd or more backups.

So when the ha-relation-joined is triggered on either A or B. How do I tell C, 
it is rejected from the join so that it doesn't screw up juju. I suppose if A/B 
can send do a "relation-set" some thing to tell C, it is rejected. 
So A/B can now exit 0. And C can set blocked and exit 0. So would that create a 
satisfactory state in juju so that there is no relationship set but all exit 
okay.

Now as to your other suggestion of starting C and allowing data to be synced to 
C, before dropping one node say A. That is possible as well, just means that 
you'll restrict at a higher number.

The issue for me, is how to scale if you have specific data in a set of nodes.  
So you can ceph, or drbd or some cluster. So ceph will require 3 ceph nodes, 
drbd two nodes and maybe galera cluster 3 nodes.

So my idea is that there is already a loadbalance to scale. So my idea is each 
time you want to scale you would add one or more pairs (assuming drbd) to an 
already existing set of pairs. The load balancer will just redirect data to 
specific pairs based on some logic (like modulus of the last octet of customer 
IP which can give you 256 pairs). This is how we are doing on physical 
machines. Haven't had a customer yet that requires more than 10,000 tps for 
radius or 5 million concurrent sessions). Note I use pairs loosely in this line 
as the pair if running galera cluster is 3 nodes instead of pair).

I'm currently trying to figure how to do it on openstack. If you have some 
recommendation for me to read/view on how people deal with scaling for very 
high write IO to disk. Current for radius we are looking at near 95% writes 
5%reads. Nobody reads the data unless someone wants to know if user X is 
currently logged in. If it was the other way around in (R/W) IO requirements 
its much easier to scale.

Regards,

Michael
-----Original Message-----
From: stu...@stuartbishop.net [mailto:stu...@stuartbishop.net] On Behalf Of 
Stuart Bishop
Sent: Wednesday, September 27, 2017 2:38 PM
To: Michael Van Der Beek <michael....@antlabs.com>
Cc: juju@lists.ubuntu.com
Subject: Re: Rejection of peer join.

On 27 September 2017 at 10:02, Michael Van Der Beek <michael....@antlabs.com> 
wrote:
> Hi All,
>
> I was thinking about a charm, that for peer relationship, supports 
> only two instances to form a peer.
>
> What would be the correct response to reject a 3rd instance trying to 
> join gracefully so that juju doesn’t complain of a failed setup?
>
> Exit 1 this will sure cause problems. Exit 0 but then juju doesn’t 
> know that the relationship is not set.

Set the workload status  of the unit that should not join to 'blocked'
with a message explaining the problem, and Exit 0 (using the 'status-set' hook 
environment tool, or hookenv.status_set() in charm-helpers). The live peers 
should just ignore the third unit.

If you do it right, it would be possible to add a third unit, then remove one 
of the live pair, and have everything migrate across to the new unit with only 
a small period of time without redundancy.

> The idea is that I want to do failover within a pair of instances.
>
> I have a radius load balancer that will split packets based on a 
> modulus of same framedip (ip of client) or mac address or something.
>
> So each pair of radius would only hold sessions related to that 
> modulus result. So if I do modulus 4, I’ll have 4 pairs of instances.
>
> I’m not sure if running each pair (active-standby) support 1000tps is 
> viable up to a million sessions.

You may not be sure, but maybe the person who is using your charm is?
Or wants to test it? If it is no more effort, I'd suggest supporting an 
arbitrary number of units rather than just one or two. It could also provide a 
mechanism to migrate the Juju application to new units without ever losing 
redundancy by allowing the operator to bring a third unit online, wait for it 
to get in sync, and then drop one of the original units.

--
Stuart Bishop <stuart.bis...@canonical.com>
-- 
Juju mailing list
Juju@lists.ubuntu.com
Modify settings or unsubscribe at: 
https://lists.ubuntu.com/mailman/listinfo/juju

RE: Rejection of peer join.

Reply via email to