Re: [Pacemaker] Serious issue with booth for site failover

2013-02-06 Thread Lars Marowsky-Bree
On 2013-02-06T11:24:08, Yves Trudeau wrote: > Hi Lars, >the site suposedly down was still able to send packet to the arbitrator. > That caused the ballot number to be increased on the arbitrator so the valid > ballot from the surviving site was discarded in acceptor_promise. Although > an op

Re: [Pacemaker] Serious issue with booth for site failover

2013-02-06 Thread Yves Trudeau
Hi Lars, the site suposedly down was still able to send packet to the arbitrator. That caused the ballot number to be increased on the arbitrator so the valid ballot from the surviving site was discarded in acceptor_promise. Although an operational mistake on my side, I wonder if this can

Re: [Pacemaker] Serious issue with booth for site failover

2013-02-06 Thread Lars Marowsky-Bree
On 2013-01-19T12:19:46, Yves Trudeau wrote: > Hi, > Forget this, everything is fine. An iptables rule was missing in my > failure test. Hi Yves, which iptables rule was missing, if I may ask? Regards, Lars -- Architect Storage/HA SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild

Re: [Pacemaker] Serious issue with booth for site failover

2013-01-19 Thread Yves Trudeau
Hi, Forget this, everything is fine. An iptables rule was missing in my failure test. Regards, Yves Le 2013-01-18 13:24, Yves Trudeau a écrit : Hi, learning about the paxos protocol, I realize the problem is not with the arbitrator, but the surviving node. Here its debug output: booth

Re: [Pacemaker] Serious issue with booth for site failover

2013-01-18 Thread Yves Trudeau
Hi, learning about the paxos protocol, I realize the problem is not with the arbitrator, but the surviving node. Here its debug output: booth-site[2552]: 2013/01/18_11:26:36 debug: preposer prepare ... booth-site[2552]: 2013/01/18_11:26:36 debug: enter lease_prepare booth-site[2552]: 2013/01

[Pacemaker] Serious issue with booth for site failover

2013-01-18 Thread Yves Trudeau
Hi, working on a geo-redundant setup, I uncovered a problem with booth. In order to simplify, I did an experiment with only booth, no pacemaker. The behavior is the same with pacemaker. Version used git log commit 55ab027233407fd44850f0c4905b085205d55f64 Author: Xia Li Date