Re: [Pacemaker] The question of a function which commits suicide according to STONITH by one node composition

2012-03-15 Thread Andrew Beekhof
2012/3/14 Yuusuke Iida : > Hi, Andrew > > I was going to confirm the suicide function by STONITH implemented by > the following changes. > https://github.com/ClusterLabs/pacemaker/commit/1a26605e1b5840f49d44b47a1f083c2cdb6f673b > > The environment is as follows. > Pacemaker-1.1.6(commit 31f6ca36cca

Re: [Pacemaker] Pacemaker restart resources when node joins cluster after failback

2012-03-15 Thread Andrew Beekhof
2012/3/6 José Alonso : > Hi all, > > I have 2 Debian nodes with heartbeat and pacemaker 1.1.6 installed, and > almost everything is working fine, I have only apache configured for > testing, when a node goes down the failover is done correctly, but there's a > problem when a node failbacks. > > For

Re: [Pacemaker] Migration atomicity

2012-03-15 Thread Andrew Beekhof
On Thu, Mar 15, 2012 at 3:22 PM, Vladislav Bogdanov wrote: > 15.03.2012 01:49, Andreas Kurz wrote: >> On 03/14/2012 08:40 AM, Vladislav Bogdanov wrote: >>> Hi, >>> >>> I'm observing a little bit unintuitive behavior of migration logic when >>> transition is aborted (due to CIB change) in the middl

Re: [Pacemaker] Migration atomicity

2012-03-15 Thread Andrew Beekhof
On Thu, Mar 15, 2012 at 9:53 AM, Lars Marowsky-Bree wrote: > On 2012-03-14T23:49:26, Andreas Kurz wrote: > >> > Is the current behavior intended? >> You mean that a migration is rolled-back due to a transition abort -- >> depending on its progress? I think that is the defined (and intended) >> be

Re: [Pacemaker] Migration atomicity

2012-03-15 Thread Andrew Beekhof
On Thu, Mar 15, 2012 at 6:55 PM, Andreas Kurz wrote: > On 03/15/2012 05:22 AM, Vladislav Bogdanov wrote: >> 15.03.2012 01:49, Andreas Kurz wrote: >>> On 03/14/2012 08:40 AM, Vladislav Bogdanov wrote: Hi, I'm observing a little bit unintuitive behavior of migration logic when tr

Re: [Pacemaker] 2 sbd devices and stonith-ng is showing (1 active devices)

2012-03-15 Thread Andrew Beekhof
On Fri, Mar 16, 2012 at 8:08 AM, Lars Marowsky-Bree wrote: > On 2012-03-15T15:39:23, "Janec, Jozef" wrote: > >> We have configured pacemaker on HAE  from novell: >> >> cat /etc/sysconfig/sbd >> SBD_DEVICE="/dev/mapper/SHARED1_part1;/dev/mapper/SHARED2_part1" >> SBD_OPTS="-W" >> >> I'm running  2

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Lars Marowsky-Bree
On 2012-03-15T14:39:52, Andreas Kurz wrote: > I think your problem is, that a "crm resource stop" is asynchronous ... crm has a synchronous mode nowadays too. He should use that, and first "stop" the resource he wants to stop before starting to delete bits from the CIB, I think. Regards,

Re: [Pacemaker] 2 sbd devices and stonith-ng is showing (1 active devices)

2012-03-15 Thread Lars Marowsky-Bree
On 2012-03-15T15:39:23, "Janec, Jozef" wrote: > We have configured pacemaker on HAE from novell: > > cat /etc/sysconfig/sbd > SBD_DEVICE="/dev/mapper/SHARED1_part1;/dev/mapper/SHARED2_part1" > SBD_OPTS="-W" > > I'm running 2 instances of watcher > > root 9157 1 0 11:00 pts/000:

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Arnold Krille
Hi, On 15.03.2012 14:39, Andreas Kurz wrote: On 03/15/2012 02:23 PM, Tim Ward wrote: From: Jake Smith [mailto:jsm...@argotec.com] Maybe totally in the wrong direction for what you want but... Put commands in a script and add a until loop with a pgrep test and sleep 1 till the specific resource

[Pacemaker] 2 sbd devices and stonith-ng is showing (1 active devices)

2012-03-15 Thread Janec, Jozef
Hello All, We have configured pacemaker on HAE from novell: cat /etc/sysconfig/sbd SBD_DEVICE="/dev/mapper/SHARED1_part1;/dev/mapper/SHARED2_part1" SBD_OPTS="-W" I'm running 2 instances of watcher root 9157 1 0 11:00 pts/000:00:00 sbd: inquisitor root 9158 9157 0 11:00 pt

Re: [Pacemaker] Nodes unable to connect / find each other

2012-03-15 Thread Regendoerp, Achim
That is the weird part I do not understand. Netstat shows the ports and process name, SELinux is disabled, local firewall is disabled, no other log entries other than those posted in initial mail. Which is why I do not understand why it's not working. UDP traffic only appears with multicast enab

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Tim Ward
> From: Andreas Kurz [mailto:andr...@hastexo.com] > >> > >> Maybe totally in the wrong direction for what you want but... > >> Put commands in a script and add a until loop with a pgrep > test and > >> sleep 1 till the specific resource is stopped then continue? > > > > Yes, probably is in the

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Andreas Kurz
On 03/15/2012 02:23 PM, Tim Ward wrote: >> From: Jake Smith [mailto:jsm...@argotec.com] >> >> Maybe totally in the wrong direction for what you want but... >> Put commands in a script and add a until loop with a pgrep >> test and sleep 1 till the specific resource is stopped then continue? > > Y

Re: [Pacemaker] Nodes unable to connect / find each other

2012-03-15 Thread Andreas Kurz
On 03/15/2012 01:57 PM, Regendoerp, Achim wrote: > As a status update, not got any further... > Confirmed with the Networks people that Multicast is enabled, but no luck. > Using unicast crm is not able to connect to the cluster (below is the unicast > config used). > With unicast, there's no traf

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Tim Ward
> From: Jake Smith [mailto:jsm...@argotec.com] > > Maybe totally in the wrong direction for what you want but... > Put commands in a script and add a until loop with a pgrep > test and sleep 1 till the specific resource is stopped then continue? Yes, probably is in the wrong direction. I'm driv

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Jake Smith
- Original Message - > From: "Tim Ward" > To: "The Pacemaker cluster resource manager" > Sent: Thursday, March 15, 2012 8:58:19 AM > Subject: Re: [Pacemaker] How to stop a resource? > > I think that describes an interactive session? > > I know I can do it interactively - just go and ha

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Tim Ward
I think that describes an interactive session? I know I can do it interactively - just go and have a cup of coffee after the "stop" and before the "delete" and manually poll crm status - but what I need is a programmatic way of doing it. Plus /etc/init.d.myResource isn't going to do anything

Re: [Pacemaker] Nodes unable to connect / find each other

2012-03-15 Thread Regendoerp, Achim
As a status update, not got any further... Confirmed with the Networks people that Multicast is enabled, but no luck. Using unicast crm is not able to connect to the cluster (below is the unicast config used). With unicast, there's no traffic at all (observed via tcpdump) Beginning to wonder if t

Re: [Pacemaker] How to stop a resource?

2012-03-15 Thread Florian Crouzat
Le 15/03/2012 12:50, Tim Ward a écrit : So, does anyone have any ideas as to what is going on here, and/or how to actually stop and then delete something? - thanks! $ crm configure property maintenance-mode=true commit quit $ /etc/init.d/myResource stop $ crm resource cleanup myResource # so

[Pacemaker] Question about Pacemaker & mysql master/slave replication and DRBD replication

2012-03-15 Thread coma
Hello, I'm a new Pacemaker user and i trying to understand exactly what it can do / can't do in case of MySQL Replication or DRBD replication. I have two MySQl servers, for the moment with a simple Master/Slave replication, my need is to implement a high availability system with automated IP and

[Pacemaker] How to stop a resource?

2012-03-15 Thread Tim Ward
The description of resource agents at http://www.linux-ha.org/wiki/Resource_Agents includes "They are synchronous in nature. That is, you start them, and they complete some time later, and you are expected to wait for them to complete." I am assured by team members that we have verified that ou

Re: [Pacemaker] Nodes unable to connect / find each other

2012-03-15 Thread Regendoerp, Achim
Hi, According to the network guy the multicast is all there, but we've got no step further, so we're trying your suggested method now :) Didn't know that corosync now supports unicast too, must've missed that... Thanks for the heads up! Achim -Original Message- From: Andreas Kurz [m

Re: [Pacemaker] getting started - crm hangs when adding resources, even "crm ra classes" hangs

2012-03-15 Thread Florian Haas
On Wed, Mar 14, 2012 at 5:55 PM, Phillip Frost wrote: > On Mar 14, 2012, at 12:33 PM, Florian Haas wrote: > >>> However, sometimes pacemakerd will not stop cleanly. >> >> OK. Whether this is related to your original problem or not a complete >> open question, jftr. >> >>> I thought it might happen

Re: [Pacemaker] unbound resource agent

2012-03-15 Thread Dejan Muhamedagic
Hi, On Wed, Mar 14, 2012 at 10:35:01PM +0100, Arnold Krille wrote: > On Wednesday 14 March 2012 17:52:21 Dejan Muhamedagic wrote: > > On Wed, Mar 14, 2012 at 02:48:11PM +0100, Benjamin Kiessling wrote: > > > Hi, > > > > > > On 2012.03.14 14:24:10 +0100, Dejan Muhamedagic wrote: > > > > > dnsCache

Re: [Pacemaker] Migration atomicity

2012-03-15 Thread Vladislav Bogdanov
15.03.2012 10:55, Andreas Kurz wrote: > On 03/15/2012 05:22 AM, Vladislav Bogdanov wrote: >> 15.03.2012 01:49, Andreas Kurz wrote: >>> On 03/14/2012 08:40 AM, Vladislav Bogdanov wrote: Hi, I'm observing a little bit unintuitive behavior of migration logic when transition is abor

Re: [Pacemaker] Migration atomicity

2012-03-15 Thread Andreas Kurz
On 03/15/2012 05:22 AM, Vladislav Bogdanov wrote: > 15.03.2012 01:49, Andreas Kurz wrote: >> On 03/14/2012 08:40 AM, Vladislav Bogdanov wrote: >>> Hi, >>> >>> I'm observing a little bit unintuitive behavior of migration logic when >>> transition is aborted (due to CIB change) in the middle of the r