[Pacemaker] why dead node bring back ?

2012-10-11 Thread Mars gu
Hi all, my cluster is corosync-2.0.1(used votequorum) and pacemaker-1.1.7(come from fedora 17). There are three nodes in it (h10_147,h10_148,h10_149). I executed the following CLI on h10_147. /etc/init.d/pacemaker stop /etc/init.d/corosync stop th

Re: [Pacemaker] chicken-egg-problem with libvirtd and a VM within cluster

2012-10-11 Thread Andrew Beekhof
This has been a topic that has popped up occasionally over the years. Unfortunately we still don't have a good answer for you. The "least worst" practice has been to have the RA return OCF_STOPPED for non-recurring monitor operations (aka. startup probes) IFF its pre-requistites (ie. binaries, or

[Pacemaker] Patrik Rapposch is out of the office

2012-10-11 Thread Patrik . Rapposch
Ich werde ab 12.10.2012 nicht im Büro sein. Ich kehre zurück am 13.10.2012. Please note, that I am not available. Please always use "ksi.netw...@knapp.com", which ensures that one of our network adminsitrators takes care of your interest. ___ Pacemak

Re: [Pacemaker] STONITHed node cannot rejoin cluster for over 1000 elections

2012-10-11 Thread Andrew Beekhof
On Fri, Oct 12, 2012 at 7:12 AM, Andrew Martin wrote: > Hello, > > > I am running a 3-node Corosync+Pacemaker cluster with 2 "real" nodes running > resources (storage0 and storage1) and a quorum node (storagequorum) in > standby mode. All of the nodes run Ubuntu 12.04 server amd64. There are two

[Pacemaker] STONITHed node cannot rejoin cluster for over 1000 elections

2012-10-11 Thread Andrew Martin
Hello, I am running a 3-node Corosync+Pacemaker cluster with 2 "real" nodes running resources (storage0 and storage1) and a quorum node (storagequorum) in standby mode. All of the nodes run Ubuntu 12.04 server amd64. There are two corosync rings: rrp_mode: active interface {

Re: [Pacemaker] centos 6 fence_apc parameter error

2012-10-11 Thread Michael Brennen
On Thu, 11 Oct 2012, Dejan Muhamedagic wrote: On Wed, Oct 10, 2012 at 09:45:46AM -0500, Michael Brennen wrote: On Wed, 10 Oct 2012, Dejan Muhamedagic wrote: Hi, On Tue, Oct 09, 2012 at 06:47:30PM -0500, Michael Brennen wrote: Hello all, I have built a two node apache/mysql cluster, with dr

[Pacemaker] chicken-egg-problem with libvirtd and a VM within cluster

2012-10-11 Thread Tom Fernandes
Hi all, I have a 2-node-cluster running DRBD, libvirtd and a virtual machine. I observed that when I stop and start corosync on one of the nodes, pacemaker (when starting corosync again) wants to check the status of the vm before starting libvirtd. This check fails as libvirtd needs to be runni

Re: [Pacemaker] ClusterLabs.org Documentation Update

2012-10-11 Thread Lars Marowsky-Bree
On 2012-10-11T10:52:29, Andrew Beekhof wrote: > In addition to some updates for 1.1.8, the documentation at > http://www.clusterlabs.org/doc/ now comes in two flavours. > Clusters from Scratch (and to a lesser extent Pacemaker Explained) now come > in "pcs" and "crmsh" editions. > > So regardl

Re: [Pacemaker] A patch for stonith external/libvirt

2012-10-11 Thread Dejan Muhamedagic
Hi Owen, On Wed, Oct 10, 2012 at 10:07:41AM +0100, Owen Le Blanc wrote: > I attach a patch for the stonith agent external/libvirt. This agent > was failing on our machines because for rebooting machines it tried to > stop and then start them, which doesn't work on our system, while > rebooting th

Re: [Pacemaker] centos 6 fence_apc parameter error

2012-10-11 Thread Dejan Muhamedagic
Hi Rasto, On Thu, Oct 11, 2012 at 11:31:10AM +0200, Rasto Levrinc wrote: > On Thu, Oct 11, 2012 at 10:29 AM, Dejan Muhamedagic > wrote: > > On Wed, Oct 10, 2012 at 09:45:46AM -0500, Michael Brennen wrote: > >> On Wed, 10 Oct 2012, Dejan Muhamedagic wrote: > > >> >> > >> >>==

Re: [Pacemaker] high cib load on config change

2012-10-11 Thread James Harper
FWIW, I'm running ocfs2 and looking through the logs a bit more my symptoms seem to match those discussed here - http://www.mentby.com/Group/linux-ha/crmd-31942-warn-decodetransitionkey-bad-uuid-crm-resource-25438-in-sscanf-result-3-for-00crm-resource-25438.html And my test cluster (built on a b

Re: [Pacemaker] centos 6 fence_apc parameter error

2012-10-11 Thread Rasto Levrinc
On Thu, Oct 11, 2012 at 10:29 AM, Dejan Muhamedagic wrote: > On Wed, Oct 10, 2012 at 09:45:46AM -0500, Michael Brennen wrote: >> On Wed, 10 Oct 2012, Dejan Muhamedagic wrote: >> >> >> >>= >> >>crm configure primitive f_apc stonith:fence_apc params \ >> >>

Re: [Pacemaker] How to gang multiple APC AP7901 outlets together under the same name

2012-10-11 Thread Dejan Muhamedagic
On Thu, Oct 11, 2012 at 11:34:20AM +1100, Andrew Beekhof wrote: > Nice guide. You should consider adding it to the wiki. Indeed. You can apply for an account at http://www.linux-ha.org/wiki Cheers, Dejan > On Thu, Oct 4, 2012 at 12:20 AM, Epps, Josh wrote: > > Procedure: How to gang multiple A

Re: [Pacemaker] Announce: pcs-0.9.26

2012-10-11 Thread LGL Extern
I checked the source code and found that the current version uses a mixture of fixed pathes for the binaries and calls without paths. E.g. "cibadmin" or "/usr/sbin/crm_mon". Do you have plans to support installations where the software is installed in /opt/ha? This is the case when the option --p

Re: [Pacemaker] centos 6 fence_apc parameter error

2012-10-11 Thread Dejan Muhamedagic
On Wed, Oct 10, 2012 at 09:45:46AM -0500, Michael Brennen wrote: > On Wed, 10 Oct 2012, Dejan Muhamedagic wrote: > > >Hi, > > > >On Tue, Oct 09, 2012 at 06:47:30PM -0500, Michael Brennen wrote: > >>Hello all, > >> > >>I have built a two node apache/mysql cluster, with drbd syncing the > >>two. I