[Pacemaker] [Patch]Patch for crmd-transition-delay processing.

2012-03-21 Thread renayama19661014
Hi All, The crmd-transition-delay waits for the update of the attribute to be late. However, crmd cannot realize the wait of the attribute well because a timer is not reset when the delay of the attribute occurs after a timer was set. As a result, the resource may not be placed definitely. I w

[Pacemaker] unable to join cluster

2012-03-21 Thread Hisashi Osanai
Hello, I have three nodes cluster using pacemaker/corosync. When I reboot one node, the node unable to join cluster. I can see that kind of split brain 10-20% (recall ration) if I shutdown a node. What do you think of this problem? My questions are: - Is this known problem? - Any work aroun

Re: [Pacemaker] Always Run Clone Resource

2012-03-21 Thread Andrew Beekhof
Shouldnt it have sent you a message that it was being stopped? On Thu, Mar 22, 2012 at 4:51 AM, Andrew Martin wrote: > Hello, > > I have a pacemaker/heartbeat cluster that uses several DRBD primitives. The > cluster resources are all colocated and ordered to start after the > DRBD primitives. I h

Re: [Pacemaker] [Openstack] Howto Nova setup with HA?

2012-03-21 Thread Florian Haas
Hi everyone, apologies for the cross-post; I believe this might be interesting to people on both the openstack and the pacemaker lists. Please see below. On Tue, Feb 14, 2012 at 9:07 AM, i3D.net - Tristan van Bokkem wrote: > Hi Stackers, > > It seems running Openstack components in High Availabi

[Pacemaker] Always Run Clone Resource

2012-03-21 Thread Andrew Martin
Hello, I have a pacemaker/heartbeat cluster that uses several DRBD primitives. The cluster resources are all colocated and ordered to start after the DRBD primitives . I have configured an ocf:heartbeat:MailTo primitive and clone for notifying me of any changes in the cluster state: primiti

Re: [Pacemaker] inconsistence in crm_mon and crm resource show

2012-03-21 Thread Janec, Jozef
Fixed now, By mistake I removed property stonith-enabled=false, and therefore the second node always tried fence the second node which crashed/was rebooted. Result was that all resources were down and waiting till fence will return done. After I have returned the parameter back, the behavior is

Re: [Pacemaker] Using shadow configurations noninteractively

2012-03-21 Thread Phillip Frost
On Mar 19, 2012, at 4:30 PM, Florian Haas wrote: > On Mon, Mar 19, 2012 at 9:00 PM, Phil Frost wrote: >> On Mar 19, 2012, at 15:22 , Florian Haas wrote: >>> On Mon, Mar 19, 2012 at 8:00 PM, Phil Frost >>> wrote: Normally I'd expect some command-line option, but I can't find any. It >

Re: [Pacemaker] How to setup STONITH in a 2-node active/passive linux HA pacemaker cluster?

2012-03-21 Thread Andreas Kurz
On 03/21/2012 02:53 PM, Dejan Muhamedagic wrote: > On Tue, Mar 20, 2012 at 06:22:34PM +0100, Andreas Kurz wrote: >> On 03/20/2012 04:14 PM, Mathias Nestler wrote: >>> Hi Dejan, >>> >>> On 20.03.2012, at 15:25, Dejan Muhamedagic wrote: >>> Hi, On Tue, Mar 20, 2012 at 08:52:39AM +0100,

Re: [Pacemaker] How to setup STONITH in a 2-node active/passive linux HA pacemaker cluster?

2012-03-21 Thread Dejan Muhamedagic
On Tue, Mar 20, 2012 at 06:22:34PM +0100, Andreas Kurz wrote: > On 03/20/2012 04:14 PM, Mathias Nestler wrote: > > Hi Dejan, > > > > On 20.03.2012, at 15:25, Dejan Muhamedagic wrote: > > > >> Hi, > >> > >> On Tue, Mar 20, 2012 at 08:52:39AM +0100, Mathias Nestler wrote: > >>> On 19.03.2012, at 20

Re: [Pacemaker] inconsistence in crm_mon and crm resource show

2012-03-21 Thread Janec, Jozef
> > On 2012-03-21T09:42:26, "Janec, Jozef" wrote: > > > Node b300ple0: UNCLEAN (offline) > > rs_nw_dbjj7 (ocf::heartbeat:IPaddr) Started > > rs_nw_cijj7 (ocf::heartbeat:IPaddr) Started > > Node b400ple0: online > > sbd_fense_SHARED2 (stonith:external/sbd) St

Re: [Pacemaker] inconsistence in crm_mon and crm resource show

2012-03-21 Thread Lars Marowsky-Bree
On 2012-03-21T09:42:26, "Janec, Jozef" wrote: > Node b300ple0: UNCLEAN (offline) > rs_nw_dbjj7 (ocf::heartbeat:IPaddr) Started > rs_nw_cijj7 (ocf::heartbeat:IPaddr) Started > Node b400ple0: online > sbd_fense_SHARED2 (stonith:external/sbd) Started > > Inacti

[Pacemaker] inconsistence in crm_mon and crm resource show

2012-03-21 Thread Janec, Jozef
Hello All, I have easy configuration of 2 node cluster: b400ple0:(/root/home/root)(root)#crm configure show node b300ple0 node b400ple0 primitive rs_nw_cijj7 ocf:heartbeat:IPaddr \ operations $id="rs_nw_cijj7-operations" \ op monitor interval="5s" timeout="20s" \ params ip

Re: [Pacemaker] Resource Agent ethmonitor

2012-03-21 Thread Florian Haas
On Tue, Mar 20, 2012 at 4:18 PM, Fiorenza Meini wrote: > Hi there, > has anybody configured successfully the RA specified in the object of the > message? > > I got this error: if_eth0_monitor_0 (node=fw1, call=2297, rc=-2, > status=Timed Out): unknown exec error Your ethmonitor RA missed its 50-s