[Pacemaker] System Health backend part

2009-05-29 Thread Mark Hamzy
I would like to see a complete solution for system health shipped with pacemaker. Would you be opposed to including the backend parts that monitor system health into pacemaker such as daemons or command line programs? One of the ways to determine the health of a system is to listen to IPMI even

Re: [Pacemaker] lenny + clvm + pacemaker/openais...

2009-05-29 Thread Steven Dake
> > > > I did what you suggest (I used 1, 2 and 3) but the problem remains... I am > > not > > particularly tied to the legacy version. I'd rather try the newer stuff. The > > LVM people just added a corosync interface to clvm. Is pacemaker usable with > > the latest corosync or should I wait a bi

Re: [Pacemaker] lenny + clvm + pacemaker/openais...

2009-05-29 Thread Andrew Beekhof
On Fri, May 29, 2009 at 8:48 PM, Alain St-Denis wrote: > Le 29 mai 2009, Andrew Beekhof a écrit : >> Almost certainly the following is the problem >> >> > May 28 16:19:05.081036 [crm  ] info: pcmk_peer_update: MEMB: lab07 >> > 1796245390 May 28 16:19:05.081044 [crm  ] info: pcmk_peer_update: MEMB:

Re: [Pacemaker] lenny + clvm + pacemaker/openais...

2009-05-29 Thread Alain St-Denis
Le 29 mai 2009, Andrew Beekhof a écrit : > Almost certainly the following is the problem > > > May 28 16:19:05.081036 [crm ] info: pcmk_peer_update: MEMB: lab07 > > 1796245390 May 28 16:19:05.081044 [crm ] info: pcmk_peer_update: MEMB: > > lab09 1829799822 > > Try setting nodeid: in openais.conf

Re: [Pacemaker] [Linux-HA] new doc about stonith/fencing

2009-05-29 Thread Ryan Steele
Jan Kalcic wrote: Really interesting. I would have appreciated some more example (they are always welcome) but still very interesting. Thanks, Jan Dejan Muhamedagic wrote: Hi, Trying to make it a bit less mysterious, I wrote something about fencing and stonith quite a while ago and then forgo

[Pacemaker] hb_gui on C5/EL5 x86_64 broken?

2009-05-29 Thread Rainer Traut
Hi, am using: # rpm -qf /usr/bin/hb_gui pacemaker-mgmt-client-1.99.1-2.1 from opensuse build service. Any change I make with hb_gui results in wierd errors: Traceback (most recent call last): File "/usr/bin/hb_gui", line 2281, in on_apply self.update_crm_config(metadata, new_crm_config)

Re: [Pacemaker] Managing resources - classes

2009-05-29 Thread Dejan Muhamedagic
On Fri, May 29, 2009 at 01:27:07PM +0200, Andrew Beekhof wrote: > On Fri, May 29, 2009 at 12:16 PM, Dejan Muhamedagic > wrote: > > Another option would be to implement an alternative parser. > > At the shell level or in the cib? In the shell. I understand that the CIB remains XML. > > Somethin

Re: [Pacemaker] [Linux-HA] new doc about stonith/fencing

2009-05-29 Thread Jan Kalcic
Really interesting. I would have appreciated some more example (they are always welcome) but still very interesting. Thanks, Jan Dejan Muhamedagic wrote: > Hi, > > Trying to make it a bit less mysterious, I wrote something about > fencing and stonith quite a while ago and then forgot to share > t

Re: [Pacemaker] eth0:0: warning: name may be invalid

2009-05-29 Thread Neil Katin
Use "eth0", not "eth0:0" for the "nic" nvpair. As the script built-in docs say: The base network interface on which the IP address will be brought online. If left empty, the script will try and determine this from the routing table. Do NOT specify an alias interface in the form eth0

Re: [Pacemaker] Managing resources - classes

2009-05-29 Thread Andrew Beekhof
On Fri, May 29, 2009 at 12:16 PM, Dejan Muhamedagic wrote: > Another option would be to implement an alternative parser. At the shell level or in the cib? > Something like named.conf (C like). That one looks more pleasing > to me, but I may be biased ;-) Agreed, its also quite readable ___

Re: [Pacemaker] Managing resources - classes

2009-05-29 Thread Dejan Muhamedagic
On Fri, May 29, 2009 at 10:13:35AM +0200, Andrew Beekhof wrote: > On Thu, May 28, 2009 at 7:33 PM, Dejan Muhamedagic > wrote: > > Hi, > > > > On Thu, May 28, 2009 at 03:59:49PM +0200, Andrew Beekhof wrote: > >> On Wed, May 27, 2009 at 4:22 PM, Dejan Muhamedagic > >> wrote: > >> > Hi, > >> > > >

Re: [Pacemaker] eth0:0: warning: name may be invalid

2009-05-29 Thread Andrew Beekhof
2009/5/29 Димитър Бойн : > Thank you, Neil! > > I have already tried this but eth0 has IP Address "on boot" > The resource does not start if I change the "nic" value tp eth0 only :( > and I indeed need it to be just additional eth0:0. Thats what it would normally do. It adds an alias to an already

Re: [Pacemaker] lenny + clvm + pacemaker/openais...

2009-05-29 Thread Andrew Beekhof
Almost certainly the following is the problem > May 28 16:19:05.081036 [crm ] info: pcmk_peer_update: MEMB: lab07 1796245390 > May 28 16:19:05.081044 [crm ] info: pcmk_peer_update: MEMB: lab09 1829799822 Try setting nodeid: in openais.conf with a single digit value. Various parts of the kernel

Re: [Pacemaker] Managing resources - classes

2009-05-29 Thread Andrew Beekhof
On Thu, May 28, 2009 at 7:33 PM, Dejan Muhamedagic wrote: > Hi, > > On Thu, May 28, 2009 at 03:59:49PM +0200, Andrew Beekhof wrote: >> On Wed, May 27, 2009 at 4:22 PM, Dejan Muhamedagic >> wrote: >> > Hi, >> > >> > On Wed, May 27, 2009 at 08:42:06AM -0400, Eliot Gable wrote: >> >> http://cluster

Re: [Pacemaker] OCF_RESKEY_CRM_meta* envars

2009-05-29 Thread Andrew Beekhof
On Thu, May 28, 2009 at 10:32 PM, Florian Haas wrote: > Andrew, > > would you mind pointing me to wherever in the code the > OCF_RESKEY_CRM_meta* variables are set and passed into the RA > environment? I'd like to understand where and how this happens, have > been unable to find documentation, and

Re: [Pacemaker] Wiki example problems

2009-05-29 Thread Andrew Beekhof
Try this: http://clusterlabs.org/wiki/FAQ#I_Killed_a_Node_but_the_Cluster_Didn.27t_Recover On Thu, May 28, 2009 at 10:35 PM, Ryan Steele wrote: > After following the wiki example for sharing an IP address > (http://clusterlabs.org/wiki/Example_configurations), I'm able to manually > fail over

Re: [Pacemaker] eth0:0: warning: name may be invalid

2009-05-29 Thread Ty! Boyack
Interesting. Sorry to have offered things that you already knew :-) I think you may have a bug case -- I would imagine that systems with two interfaces on the same subnet are less tested than other scenarios. I know I had a similar situation on a HA pair for a firewall, and I just ended up w

Re: [Pacemaker] Bug in crm_verify

2009-05-29 Thread Andrew Beekhof
On Thu, May 28, 2009 at 10:56 PM, Dan Urist wrote: > This is minor, but crm_verify from v.1.0.3 of pacemaker apparently > doesn't support --verbose as stated. The short option (-V) works. > My apologies if this isn't the right place to report a bug-- I didn't > see a bug tracker at clusterlabs.org

Re: [Pacemaker] [LIST:Pacemaker] eth0:0: warning: name may be invalid

2009-05-29 Thread Димитър Бойн
Thanks a lot! Indeed! - I do have two interfaces on the same network... :) Should this inability be considered a bug? Best Regards! ___ Dimitar Boyn - Original Message - From: Ty! Boyack To: 'pacema...@clusterlabs.org' Sent: Thu May 28 23:36:39 2009 Subject: Re: [Pacemaker] [LIST:Pa