Re: [Pacemaker] Transition trigger for alarming

2013-04-11 Thread Andrew Beekhof
On 12/04/2013, at 6:52 AM, Keith Ouellette wrote: > We am using Nagios for network monitoring and for Pacemaker, we are using > check_crm to provide status for nagios. This works very well if the failure > is still seen by Pacemaker at the time nagios polls. If the failure > condidtion causes

Re: [Pacemaker] Pacemaker 1.1.8, Corosync, No CMAN, Promotion issues

2013-04-11 Thread Andrew Beekhof
On 12/04/2013, at 12:11 PM, pavan tc wrote: > Hi Andrew, > > Thanks much for looking at this. > > > > Then (after about 15 minutes), I see the following: > > There were no logs at all in between? > > Absolutely none in the syslog. Only the regular monitor logs from my resource > agent whic

Re: [Pacemaker] Pacemaker 1.1.8, Corosync, No CMAN, Promotion issues

2013-04-11 Thread pavan tc
Hi Andrew, Thanks much for looking at this. > Then (after about 15 minutes), I see the following: > > There were no logs at all in between? > Absolutely none in the syslog. Only the regular monitor logs from my resource agent which continued to report as secondary. I also checked /var/log/clust

Re: [Pacemaker] attrd waits one second before doing update

2013-04-11 Thread Andrew Beekhof
On 12/04/2013, at 7:17 AM, Rainer Brestan wrote: > In pacemaker 1.1.7-6 with corosync 1.4.1-7 update of attributes works almost > online. > Used with SysInfo resource agent and manual commands like "attrd_updater -U 4 > -n test". > > In the logfile there is one line > attrd[...] notice: attr

Re: [Pacemaker] Pacemaker 1.1.8, Corosync, No CMAN, Promotion issues

2013-04-11 Thread Andrew Beekhof
On 11/04/2013, at 8:15 AM, pavan tc wrote: > Hi, > > [I did go through the mail thread titled: "RHEL6 and clones: CMAN needed > anyway?", but was not sure about some answers there] > > I recently moved from pacemaker 1.1.7 to 1.1.8-7 on centos 6.2. I see the > following in syslog: > > coros

Re: [Pacemaker] compile issues with cluster-3.2.0 was: repetative membership messages

2013-04-11 Thread Andrew Beekhof
On 11/04/2013, at 2:02 PM, Daniel Black wrote: > > > > - Original Message - >> It wasn't as bad as I thought. > > Famous last words. > > running crm_mon (from pacemaker-1.1.7) reporting libcoroipcc.so.4 missing > (existed in the libcorosync4 days from corosync-1.4.2) > > Found clu

Re: [Pacemaker] Question about recovery policy after "Too many failures to fence"

2013-04-11 Thread Andrew Beekhof
On 11/04/2013, at 7:23 PM, Kazunori INOUE wrote: > Hi Andrew, > > (13.04.08 12:01), Andrew Beekhof wrote: >> >> On 27/03/2013, at 7:45 PM, Kazunori INOUE >> wrote: >> >>> Hi, >>> >>> I'm using pacemaker-1.1 (c7910371a5. the latest devel). >>> >>> When fencing failed 10 times, S_TRANSITION

Re: [Pacemaker] active standby failover

2013-04-11 Thread Andrew Beekhof
On 10/04/2013, at 7:30 PM, Rus Hughes wrote: > Hi, > > I hope I've got the right list, I'm still a little confused about where CMAN > ends and Pacemaker begins! Think of CMAN as some extra APIs for corosync. Anything you would configure in Pacemaker when using corosync is still configured th

Re: [Pacemaker] racing crm commands... last write wins?

2013-04-11 Thread Andrew Beekhof
On 11/04/2013, at 10:46 PM, Rasto Levrinc wrote: > On Thu, Apr 11, 2013 at 2:04 PM, Brian J. Murrell > wrote: >> On 13-04-11 07:37 AM, Brian J. Murrell wrote: >>> >>> In exploring all options, how about pcs? Does pcs' "resource create >>> ..." for example have the same read+modify+replace pr

Re: [Pacemaker] racing crm commands... last write wins?

2013-04-11 Thread Andrew Beekhof
On 11/04/2013, at 10:04 PM, Brian J. Murrell wrote: > On 13-04-11 07:37 AM, Brian J. Murrell wrote: >> >> In exploring all options, how about pcs? Does pcs' "resource create >> ..." for example have the same read+modify+replace problem as crm >> configure or does pcs resource create also only

[Pacemaker] attrd waits one second before doing update

2013-04-11 Thread Rainer Brestan
In pacemaker 1.1.7-6 with corosync 1.4.1-7 update of attributes works almost online. Used with SysInfo resource agent and manual commands like "attrd_updater -U 4 -n test".   In the logfile there is one line attrd[...] notice: attrd_trigger_update: Sending flush up to all hosts for: ... and a

[Pacemaker] Transition trigger for alarming

2013-04-11 Thread Keith Ouellette
We am using Nagios for network monitoring and for Pacemaker, we are using check_crm to provide status for nagios. This works very well if the failure is still seen by Pacemaker at the time nagios polls. If the failure condidtion causes a switchover, but the recovery is before the nest Nagios pol

Re: [Pacemaker] issues when installing on pxe booted environment

2013-04-11 Thread John White
Ah, /dev/shm had root:root user writable only. Opening it up seems to have kicked something the right way. Thanks folks. John White HPC Systems Engineer (510) 486-7307 One Cyclotron Rd, MS: 50C-3209C Lawrence Berkeley National Lab Berkeley, CA 94720 On Apr 11, 2013, at 1:37 PM,

Re: [Pacemaker] issues when installing on pxe booted environment

2013-04-11 Thread John White
Yep, we've definitely got /dev/shm (this was done to fix an earlier problem). John White HPC Systems Engineer (510) 486-7307 One Cyclotron Rd, MS: 50C-3209C Lawrence Berkeley National Lab Berkeley, CA 94720 On Mar 27, 2013, at 4:46 PM, Andrew Beekhof wrote: > What about /dev/shm

Re: [Pacemaker] clustering with pacemaker

2013-04-11 Thread Digimer
That is a stable version. I suspect you have a configuration error. Please paste your configuration. Also please share the exact steps you are doing to try and start the cluster and what errors you get. It would also be good to share the log entries from /var/log/messages starting just before

Re: [Pacemaker] RA Supporting the reload action

2013-04-11 Thread Felix Zachlod
Hello! > -Ursprüngliche Nachricht- > Von: David Vossel [mailto:dvos...@redhat.com] > Gesendet: Donnerstag, 11. April 2013 16:57 > An: The Pacemaker cluster resource manager > Betreff: Re: [Pacemaker] RA Supporting the reload action > > Can you paste your agent's metadata in here or in past

Re: [Pacemaker] cman based cluster fencindevice fence_pcmk

2013-04-11 Thread Florian Crouzat
Le 11/04/2013 16:49, Wolfgang Routschka a écrit : Hi all, one question today about cman based cluster on rhel6 and clone systems with fencingdeviceagent fence_pcmk In my scenario the stonithdevice is a IBM based IPMI-Management Interface (IMM) so I want to use fence_ipmilan from package resource-

Re: [Pacemaker] cman based cluster fencindevice fence_pcmk

2013-04-11 Thread David Vossel
- Original Message - > From: "Wolfgang Routschka" > To: pacemaker@oss.clusterlabs.org > Sent: Thursday, April 11, 2013 9:49:10 AM > Subject: [Pacemaker] cman based cluster fencindevice fence_pcmk > > Hi all, > one question today about cman based cluster on rhel6 and clone systems with

Re: [Pacemaker] RA Supporting the reload action

2013-04-11 Thread David Vossel
- Original Message - > From: "Felix Zachlod" > To: "The Pacemaker cluster resource manager" > Sent: Thursday, April 11, 2013 8:47:50 AM > Subject: [Pacemaker] RA Supporting the reload action > > Hello gain folks, > > I have been implementing a reload action for a resource agent. The act

[Pacemaker] cman based cluster fencindevice fence_pcmk

2013-04-11 Thread Wolfgang Routschka
Hi all, one question today about cman based cluster on rhel6 and clone systems with fencingdeviceagent fence_pcmk In my scenario the stonithdevice is a IBM based IPMI-Management Interface (IMM) so I want to use fence_ipmilan from package resource-agents. after reading rhel quickstart guide ht

[Pacemaker] RA Supporting the reload action

2013-04-11 Thread Felix Zachlod
Hello gain folks, I have been implementing a reload action for a resource agent. The action is advertised in the meta-data and it works when invoking it directly. According to http://linux-ha.org/wiki/OCF_Resource_Agents and http://clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s

Re: [Pacemaker] racing crm commands... last write wins?

2013-04-11 Thread Rasto Levrinc
On Thu, Apr 11, 2013 at 2:04 PM, Brian J. Murrell wrote: > On 13-04-11 07:37 AM, Brian J. Murrell wrote: >> >> In exploring all options, how about pcs? Does pcs' "resource create >> ..." for example have the same read+modify+replace problem as crm >> configure or does pcs resource create also onl

Re: [Pacemaker] racing crm commands... last write wins?

2013-04-11 Thread Brian J. Murrell
On 13-04-11 07:37 AM, Brian J. Murrell wrote: > > In exploring all options, how about pcs? Does pcs' "resource create > ..." for example have the same read+modify+replace problem as crm > configure or does pcs resource create also only send proper fragments to > update just the part of the CIB it

Re: [Pacemaker] racing crm commands... last write wins?

2013-04-11 Thread Brian J. Murrell
On 13-04-10 04:33 PM, Brian J. Murrell wrote: > > Does crm_resource suffer from this problem or does it properly only send > exactly the update to the CIB for the operation it's trying to achieve? In exploring all options, how about pcs? Does pcs' "resource create ..." for example have the same

Re: [Pacemaker] active standby failover

2013-04-11 Thread Rus Hughes
as an update node vfontopensips1 node vfontopensips2 primitive ClusterIPPres ocf:heartbeat:IPaddr2 \ params ip="10.30.0.176" cidr_netmask="32" \ op monitor interval="5s" primitive osp ocf:netdev:osp \ params interval="1s" \ op monitor interval="5s" \ meta allow-migrate="true" i

Re: [Pacemaker] problem with VM in pacemaker cluster

2013-04-11 Thread Yuriy Demchenko
Solved my problem First error was in constraint: i've put constraint with "cxml" resource alone, not with cloned "cxml-clone" - that's why "cxml" were moved first on "standby" command. after redefining constraint to "cxml-clone than testVM" putting active node in standby went smooth - VM moved

Re: [Pacemaker] node status does not change even if pacemakerd dies

2013-04-11 Thread Kazunori INOUE
Hi Andrew, (13.03.01 11:10), Andrew Beekhof wrote: > On Wed, Feb 13, 2013 at 8:14 PM, Kazunori INOUE > wrote: >> Hi Andrew, >> >> Yes, please see attached pacemaker.conf. It controls only pacemakerd. > > I've pushed up the basic one in > https://github.com/beekhof/pacemaker/commit/4bd8ac3 > > Onc