Re: [Pacemaker] promote is delayed more than 60 sec after stonith reset

2009-10-13 Thread Andrew Beekhof
On Wed, Oct 14, 2009 at 2:48 AM, hj lee wrote: > Hi, > > I configured two nodes cluster on RHEL 5.3 with the following resources. > Note that I am using pacemaker-1.0.6. > - IPMI stonith as a clone. Each IPMI clone is monitoring the other node. > - One Master/Slave resource: Master is running on n

Re: [Pacemaker] more package misery

2009-10-13 Thread Andrew Beekhof
On Tue, Oct 13, 2009 at 6:04 PM, Johan Verrept wrote: > > Hi, > > when I install the latest resource-agents, it installs > etc/ha.d/shellfuncs > > This file includes @OCF_RA_DIR@/heartbeat/.ocf-shellfuncs at the end. Not here. I see: # Now get the good stuff . /usr/lib/ocf/resource.d//heartbeat/

Re: [Pacemaker] Ping Node

2009-10-13 Thread Andrew Beekhof
On Tue, Oct 13, 2009 at 9:37 PM, Fausto Lombardi wrote: > But in Sles 11 Ha Documentation I find > > OCF_RESKEY_host_list=Host list > >    The list of ping nodes to count. Defaults to all configured ping > nodes. Rarely needs to be specified. This is left over from when the SLES stack used to be

[Pacemaker] promote is delayed more than 60 sec after stonith reset

2009-10-13 Thread hj lee
Hi, I configured two nodes cluster on RHEL 5.3 with the following resources. Note that I am using pacemaker-1.0.6. - IPMI stonith as a clone. Each IPMI clone is monitoring the other node. - One Master/Slave resource: Master is running on node1, Slave is running on node2. - One FakeIPMI resource.

Re: [Pacemaker] crm command syntax for defining multiple instance_attributes

2009-10-13 Thread hj lee
Thank you, it works OK now! On Mon, Oct 12, 2009 at 1:31 AM, Andrew Beekhof wrote: > On Fri, Oct 9, 2009 at 10:08 PM, hj lee wrote: > > Hi Andrew, > > > > According to the Pacemaker Configuration Explained, the meta attribute > and > > instance attribute object can have rule inside of it. But c

Re: [Pacemaker] Ping Node

2009-10-13 Thread Fausto Lombardi
But in Sles 11 Ha Documentation I find OCF_RESKEY_host_list=Host list The list of ping nodes to count. Defaults to all configured ping nodes. Rarely needs to be specified. What is? 2009/10/12 Andrew Beekhof : > On Mon, Oct 5, 2009 at 4:14 PM, Fausto Lombardi wrote: >> Hi of all, >> I'm

Re: [Pacemaker] Recommended versions for the cluster stack?

2009-10-13 Thread Andrew Beekhof
Feel like adding that to the wiki? On Tue, Oct 13, 2009 at 7:04 PM, Jonathan wrote: > Thomas Georgiou wrote: >> I do not know what the recommended versions are, but I wrote ebuilds >> for the pacemaker/corosync cluster stack with the latest versions that >> were available at the time.  They are l

Re: [Pacemaker] Recommended versions for the cluster stack?

2009-10-13 Thread Andrew Beekhof
On Tue, Oct 13, 2009 at 7:49 PM, Jonathan wrote: >> There shouldn't be any dependency between ocfs2 and openais. >> > It was a while ago. The project has been on hold for a month or so, but > IIRC, the problem was that the control deamons (dlm_controld.pcmk and > ocfs2_controld.pcmk) had some depe

Re: [Pacemaker] Recommended versions for the cluster stack?

2009-10-13 Thread Jonathan
Andrew Beekhof wrote: > On Tue, Oct 13, 2009 at 6:39 PM, Jonathan wrote: > >> Andrew Beekhof wrote: >> openais >>> 0.80.5 >>> definitely do NOT use 1.x at this point. >>> >>> >> This is likely the source of my problems. I am currently using 1.0.x+ >> and coro

Re: [Pacemaker] Recommended versions for the cluster stack?

2009-10-13 Thread Jonathan
Thomas Georgiou wrote: > I do not know what the recommended versions are, but I wrote ebuilds > for the pacemaker/corosync cluster stack with the latest versions that > were available at the time. They are located here > http://code.tjhsst.edu/vm-overlay/vm-overlay. Unfortunately, I was not aware

Re: [Pacemaker] Recommended versions for the cluster stack?

2009-10-13 Thread Andrew Beekhof
On Tue, Oct 13, 2009 at 6:39 PM, Jonathan wrote: > Andrew Beekhof wrote: >>> openais >>> >> >> 0.80.5 >> definitely do NOT use 1.x at this point. >> > This is likely the source of my problems. I am currently using 1.0.x+ > and corosync. Yep. You can't shut down cluster nodes at all with that vers

[Pacemaker] updating mac address during fail-over

2009-10-13 Thread Billy Guthrie
Hello, I am using pacemaker-openais version 1.0.4.1 and I have a question; Is it possible to update the MAC address on fail-over. I would like to keep the MAC address the same from one NIC to the other as I have an application that binds the license key to the MAC address. If that is not possibl

Re: [Pacemaker] Human confirmation of dead node?

2009-10-13 Thread Dejan Muhamedagic
Hi, On Tue, Oct 13, 2009 at 05:57:25PM +0200, J Brack wrote: > On 10/13/09, Dejan Muhamedagic wrote: > > Hi, > > > > On Tue, Oct 13, 2009 at 03:23:11PM +0200, J Brack wrote: > >> Hi, > >> > >> I'm currently using heartbeat. I heard that I'm meant to be using > >> pacemaker. I will switch in a hea

Re: [Pacemaker] Recommended versions for the cluster stack?

2009-10-13 Thread Jonathan
Andrew Beekhof wrote: > On Sat, Oct 10, 2009 at 1:13 AM, Jonathan wrote: > >> Hello: >> >> I'm trying to set up a pacemaker/openais cluster on Gentoo, which >> requires building the entire stack from source. >> I have everything running, but the stack has some strange stability >> issues, (in p

Re: [Pacemaker] more package misery

2009-10-13 Thread Johan Verrept
On Tue, 2009-10-13 at 18:04 +0200, Johan Verrept wrote: > Hi, > > when I install the latest resource-agents, it installs > etc/ha.d/shellfuncs > > This file includes @OCF_RA_DIR@/heartbeat/.ocf-shellfuncs at the end. > > .ocf-shellfuncs was in heartbeat 2.99.2, but is no longer present in > hear

[Pacemaker] more package misery

2009-10-13 Thread Johan Verrept
Hi, when I install the latest resource-agents, it installs etc/ha.d/shellfuncs This file includes @OCF_RA_DIR@/heartbeat/.ocf-shellfuncs at the end. .ocf-shellfuncs was in heartbeat 2.99.2, but is no longer present in heartbeat 3.0.0. As far as I can tell this is now lib/heartbeat/ocf-shellfunc

Re: [Pacemaker] Human confirmation of dead node?

2009-10-13 Thread J Brack
On 10/13/09, Dejan Muhamedagic wrote: > Hi, > > On Tue, Oct 13, 2009 at 03:23:11PM +0200, J Brack wrote: >> Hi, >> >> I'm currently using heartbeat. I heard that I'm meant to be using >> pacemaker. I will switch in a heartbeat (sorry) if I can get pacemaker >> to do what I need. > > http://cluster

Re: [Pacemaker] Property of resource group

2009-10-13 Thread Johan Verrept
On Tue, 2009-10-13 at 17:49 +0200, Fausto Lombardi wrote: > Hi of all, > can I do a group where the resources in it start in sequence and if > one of this resource fails, fail also the other in sequence? This is how a "group" behaves. See manual under "group" J.

Re: [Pacemaker] Human confirmation of dead node?

2009-10-13 Thread J Brack
On 10/13/09, Florian Haas wrote: > Please be introduced to the "meatware" stonith plugin. > > Cheers > Florian Unfortunately meatware is only for confirming a node has been reset. If your network goes down, the nodes still swap services momentarily. This is what I am trying to avoid. J

[Pacemaker] Property of resource group

2009-10-13 Thread Fausto Lombardi
Hi of all, can I do a group where the resources in it start in sequence and if one of this resource fails, fail also the other in sequence? I would tho do this to avoid the creation of a lot of constraints. Thanks. ___ Pacemaker mailing list Pacemaker@

Re: [Pacemaker] Human confirmation of dead node?

2009-10-13 Thread Dejan Muhamedagic
Hi, On Tue, Oct 13, 2009 at 03:23:11PM +0200, J Brack wrote: > Hi, > > I'm currently using heartbeat. I heard that I'm meant to be using > pacemaker. I will switch in a heartbeat (sorry) if I can get pacemaker > to do what I need. http://clusterlabs.org/wiki/Project_History > I have a clustered

Re: [Pacemaker] Updated packages for Debian w/ openais only

2009-10-13 Thread Raoul Bhatia [IPAX]
hi, On 10/13/2009 04:16 PM, Martin Gerhard Loschwitz wrote: > once again I announce another set of packages; given that corosync > has so far proven to be a moving target, I decided that at least > for the moment, the packages for lenny are going to be openais with > whitetank only. > > I have up

[Pacemaker] Updated packages for Debian w/ openais only

2009-10-13 Thread Martin Gerhard Loschwitz
Folks, once again I announce another set of packages; given that corosync has so far proven to be a moving target, I decided that at least for the moment, the packages for lenny are going to be openais with whitetank only. I have updated my packages at people.debian.org accordingly; they contain

Re: [Pacemaker] Human confirmation of dead node?

2009-10-13 Thread Florian Haas
Please be introduced to the "meatware" stonith plugin. Cheers Florian On 2009-10-13 15:23, J Brack wrote: > Hi, > > I'm currently using heartbeat. I heard that I'm meant to be using > pacemaker. I will switch in a heartbeat (sorry) if I can get pacemaker > to do what I need. > > I have a cluste

[Pacemaker] Human confirmation of dead node?

2009-10-13 Thread J Brack
Hi, I'm currently using heartbeat. I heard that I'm meant to be using pacemaker. I will switch in a heartbeat (sorry) if I can get pacemaker to do what I need. I have a clustered nfs server, primary is in datacenter1 close to the users, secondary is in datacenter2 not close to the users. There is

Re: [Pacemaker] Resource stickiness during working hours

2009-10-13 Thread Johan Verrept
On Tue, 2009-10-13 at 13:52 +0200, Dejan Muhamedagic wrote: > crm knows when the user's not in the interactive mood, so it may > behave accordingly. Though the error message is still going to > remain, it will be less obtrusive and go to stderr. Awesome! Thanks, J. ___

Re: [Pacemaker] Resource stickiness during working hours

2009-10-13 Thread Dejan Muhamedagic
Hi, On Tue, Oct 13, 2009 at 11:55:38AM +0200, Johan Verrept wrote: > Hi Dejan, > > On Wed, 2009-10-07 at 17:06 +0200, Dejan Muhamedagic wrote: > > Yes, that's no problem, it's just that I'm not sure about how to > > design it since the language is, well, rather flat. > > Might it be possible to

Re: [Pacemaker] Resource stickiness during working hours

2009-10-13 Thread Johan Verrept
Hi Dejan, On Wed, 2009-10-07 at 17:06 +0200, Dejan Muhamedagic wrote: > Yes, that's no problem, it's just that I'm not sure about how to > design it since the language is, well, rather flat. Might it be possible to at least let crm recognise the configuration as valid or ignore it even if it isn'

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-13 Thread Andrew Beekhof
On Tue, Oct 13, 2009 at 9:40 AM, Johan Verrept wrote: > On Mon, 2009-10-12 at 09:06 +0200, Andrew Beekhof wrote: >> On Mon, Oct 12, 2009 at 8:43 AM, Florian Haas >> wrote: >> > Andrew, Dejan: as we consider the ocf:linbit:drbd RA stable as of the >> > DRBD 8.3.4 release, is it acceptable to remo

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-13 Thread Johan Verrept
On Mon, 2009-10-12 at 09:06 +0200, Andrew Beekhof wrote: > On Mon, Oct 12, 2009 at 8:43 AM, Florian Haas wrote: > > Andrew, Dejan: as we consider the ocf:linbit:drbd RA stable as of the > > DRBD 8.3.4 release, is it acceptable to remove the legacy RA from the > > agents repository? > > I think ra

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-13 Thread Johan Verrept
On Tue, 2009-10-13 at 09:51 +0200, Florian Haas wrote: > On 2009-10-13 09:40, Johan Verrept wrote: > > On Mon, 2009-10-12 at 09:06 +0200, Andrew Beekhof wrote: > >> On Mon, Oct 12, 2009 at 8:43 AM, Florian Haas > >> wrote: > >>> Andrew, Dejan: as we consider the ocf:linbit:drbd RA stable as of th

Re: [Pacemaker] Resource stickiness during working hours

2009-10-13 Thread Johan Verrept
On Sun, 2009-10-11 at 21:57 +0200, Andrew Beekhof wrote: > just fixed it now, thanks > also needed to change days -> weekdays and add a score to the rule itself. It works now. Thank you for your help and confirmation. J. ___ Pacemaker mailing

Re: [Pacemaker] build problem: HA_LIBDIR/HA_HBCONF_DIR redefined

2009-10-13 Thread Johan Verrept
On Sun, 2009-10-11 at 21:33 +0200, Andrew Beekhof wrote: > Then you didn't update glue properly :-) > glue_config.h doesn't contain HA_LIBDIR anymore. Correct. Updated glue and it builds fine now :) On Mon, 2009-10-12 at 10:59 +0200, Dejan Muhamedagic wrote: > This one's mine. Fixed today. Confi

[Pacemaker] openais not being correctly stopped upon shutdown/reboot on RH systems

2009-10-13 Thread Nikola Ciprich
Hello, I noticed that when I type poweroff/reboot on any of my cluster nodes, the node always changes state to UNCLEAN in crm_mon of other nodes. While looking into it, I've discovered that there is quite serious problem which prevents openais stop from being executed while switching runlevels. It

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-13 Thread Johan Verrept
On Tue, 2009-10-13 at 09:52 +0200, Andrew Beekhof wrote: > On Tue, Oct 13, 2009 at 9:40 AM, Johan Verrept wrote: > > On Mon, 2009-10-12 at 09:06 +0200, Andrew Beekhof wrote: > >> On Mon, Oct 12, 2009 at 8:43 AM, Florian Haas > >> wrote: > >> > Andrew, Dejan: as we consider the ocf:linbit:drbd RA

Re: [Pacemaker] openais not being correctly stopped upon shutdown/reboot on RH systems

2009-10-13 Thread Nikola Ciprich
Hi Andrew, thanks a lot, I've fixed it myself already, just wanted to report the problem. Good to hear it's been already reported and fixed :) have a nice day. nik. On Tue, Oct 13, 2009 at 09:56:25AM +0200, Andrew Beekhof wrote: > Patch is here: > > https://lists.linux-foundation.org/pipermail

Re: [Pacemaker] openais not being correctly stopped upon shutdown/reboot on RH systems

2009-10-13 Thread Andrew Beekhof
Patch is here: https://lists.linux-foundation.org/pipermail/openais/2009-October/013126.html Need to get it into the build service packages On Tue, Oct 13, 2009 at 9:24 AM, Nikola Ciprich wrote: > Hello, > I noticed that when I type poweroff/reboot on any of my cluster nodes, > the node alway

Re: [Pacemaker] why use ocf::linbit:drbd instead of ocf::heartbeat:drbd?

2009-10-13 Thread Florian Haas
On 2009-10-13 09:40, Johan Verrept wrote: > On Mon, 2009-10-12 at 09:06 +0200, Andrew Beekhof wrote: >> On Mon, Oct 12, 2009 at 8:43 AM, Florian Haas >> wrote: >>> Andrew, Dejan: as we consider the ocf:linbit:drbd RA stable as of the >>> DRBD 8.3.4 release, is it acceptable to remove the legacy R

Re: [Pacemaker] crmd segfault on opensuse 11.1

2009-10-13 Thread Andrew Beekhof
On Tue, Oct 13, 2009 at 8:11 AM, Stratos Zolotas wrote: > > > > After restarting in debug mode i have a segfault. > > I'm attaching a core file found in /var/lib/heartbeat/cores/hacluster. > > Hope it helps Nope, sorry. Cores are only viewable on the machine they were created on. Please subm