Re: [Pacemaker] Failing back a multi-state resource eg. DRBD

2011-03-11 Thread Dejan Muhamedagic
Hi Holger, On Fri, Mar 11, 2011 at 02:45:07PM +0100, Holger Teutsch wrote: > On Mon, 2011-03-07 at 14:21 +0100, Dejan Muhamedagic wrote: > > Hi, > > > > On Fri, Mar 04, 2011 at 09:12:46AM -0500, David McCurley wrote: > > > Are you wanting to move all the resources back or just that one resource?

Re: [Pacemaker] Failing back a multi-state resource eg. DRBD

2011-03-11 Thread Holger Teutsch
On Mon, 2011-03-07 at 14:21 +0100, Dejan Muhamedagic wrote: > Hi, > > On Fri, Mar 04, 2011 at 09:12:46AM -0500, David McCurley wrote: > > Are you wanting to move all the resources back or just that one resource? > > > > I'm still learning, but one simple way I move all resources back from nodeb

Re: [Pacemaker] Failback problem with active/active cluster

2011-03-11 Thread Andrew Beekhof
On Fri, Mar 11, 2011 at 2:19 PM, Charles KOPROWSKI wrote: > Le 11/03/2011 11:47, Andrew Beekhof a écrit : >> Essentially you have encountered a limitation in the allocation >> algorithm for clones in 1.0.x >> The recently released 1.1.5 has the behavior you're looking for, but >> the patch is far

Re: [Pacemaker] Failback problem with active/active cluster

2011-03-11 Thread Charles KOPROWSKI
Le 11/03/2011 11:47, Andrew Beekhof a écrit : On Thu, Mar 10, 2011 at 1:50 PM, Charles KOPROWSKI wrote: Hello, I set up a 2 nodes cluster (active/active) to build an http reverse proxy/firewall. There is one vip shared by both nodes and an apache instance running on each node. Here is the con

[Pacemaker] proper "dampen" value for ping resource

2011-03-11 Thread Klaus Darilion
Hi! I wonder what a proper value for "dampen" would be. Dampen is documented as: # attrd_updater --help|grep dampen -d, --delay=value The time to wait (dampening) in seconds further changes occur So, I would read this as the delay to forward changes, e.g. to not trigger fail-over on the f

Re: [Pacemaker] Failure after intermittent network outage

2011-03-11 Thread Pavel Levshin
Hi Andrew. I'm sorry, but I can not agree. Look again at the DC log. Here it says: "Action lost". This is why I use this term. Then it declares every monitor action as it has failed with rc=1, which is not true. Note that even those actions which were directed to inexistent RA are listed a

Re: [Pacemaker] Node remains offline (was Node remains online)

2011-03-11 Thread Andrew Beekhof
On Fri, Mar 11, 2011 at 12:12 PM, Bart Coninckx wrote: > Hi Andrew, > > thank you for taking the time to answer. > > On Friday 11 March 2011 10:57:36 Andrew Beekhof wrote: > >> Nothing you've shown here seems to indicate its offline - what leads >> you to that conclusion? > > both crm_mon and hb_g

[Pacemaker] Drbd on a asymmetric cluster

2011-03-11 Thread Arthur B. Olsen
My config is: node sql01 attributes standby="off" node sql02 attributes standby="off" primitive drbd_mysql ocf:linbit:drbd params drbd_resource="r0" op monitor interval="15s" primitive fs_mysql ocf:heartbeat:Filesystem params device="/dev/drbd0" directory="/datastore01" fstype="ext4" primitive i

Re: [Pacemaker] Node remains offline (was Node remains online)

2011-03-11 Thread Bart Coninckx
Hi Andrew, thank you for taking the time to answer. On Friday 11 March 2011 10:57:36 Andrew Beekhof wrote: > Nothing you've shown here seems to indicate its offline - what leads > you to that conclusion? both crm_mon and hb_gui show this. Thank you, B. __

Re: [Pacemaker] Patch for bugzilla 2541: Shell should warn if parameter uniqueness is violated

2011-03-11 Thread Dejan Muhamedagic
Hi, On Thu, Mar 10, 2011 at 07:08:20PM +0100, Holger Teutsch wrote: > Hi Dejan, > On Thu, 2011-03-10 at 10:14 +0100, Dejan Muhamedagic wrote: > > Hi Holger, > > > > On Wed, Mar 09, 2011 at 07:58:02PM +0100, Holger Teutsch wrote: > > > Hi Dejan, > > > > > > On Wed, 2011-03-09 at 14:00 +0100, Deja

Re: [Pacemaker] problem with apache coming up

2011-03-11 Thread Andrew Beekhof
On Wed, Feb 16, 2011 at 4:50 PM, Testuser SST wrote: > Failed actions: >    Apache_start_0 (node=astinos, call=19, rc=1, status=complete): unknown > error > > Any suggestions ? The apache is normal operabel with a service httpd > stop/start command. > Well, thats not the same script that the

Re: [Pacemaker] Failback problem with active/active cluster

2011-03-11 Thread Andrew Beekhof
On Thu, Mar 10, 2011 at 1:50 PM, Charles KOPROWSKI wrote: > Hello, > > I set up a 2 nodes cluster (active/active) to build an http reverse > proxy/firewall. There is one vip shared by both nodes and an apache instance > running on each node. > > Here is the configuration : > > node lpa \ >        

Re: [Pacemaker] shutting down pacemaker/corosync without shutting down the services

2011-03-11 Thread Andrew Beekhof
is-managed-default=false On Fri, Mar 11, 2011 at 11:29 AM, Klaus Darilion wrote: > Hi! > > For maintenance reasons (e.g. updating pacemaker) it might be necessary > to shut down pacemaker. But in such cases I want that the services to > keep running. > > Is it possible to shut down pacemaker but

Re: [Pacemaker] Failure after intermittent network outage

2011-03-11 Thread Andrew Beekhof
On Thu, Mar 10, 2011 at 1:03 PM, Pavel Levshin wrote: > Hi, > > No, I think you've missed the point. RA did not answer at all. Monitor > actions had been lost due to a cluster transition: You are incorrect. While it is true that some actions were NACK's (not lost), such NACKs do not make it into

Re: [Pacemaker] shutting down pacemaker/corosync without shutting down the services

2011-03-11 Thread Michael Schwartzkopff
On Friday 11 March 2011 11:29:47 Klaus Darilion wrote: > Hi! > > For maintenance reasons (e.g. updating pacemaker) it might be necessary > to shut down pacemaker. But in such cases I want that the services to > keep running. > > Is it possible to shut down pacemaker but keep the current service >

[Pacemaker] shutting down pacemaker/corosync without shutting down the services

2011-03-11 Thread Klaus Darilion
Hi! For maintenance reasons (e.g. updating pacemaker) it might be necessary to shut down pacemaker. But in such cases I want that the services to keep running. Is it possible to shut down pacemaker but keep the current service state, ie. all services should keep running on their current node. th

Re: [Pacemaker] Node remains offline (was Node remains online)

2011-03-11 Thread Andrew Beekhof
On Thu, Mar 10, 2011 at 9:10 PM, Bart Coninckx wrote: > Hi all, > > I have a three node cluster and while introducing the third node, it > remains offline no matter what I do. Nothing you've shown here seems to indicate its offline - what leads you to that conclusion? > Another symptom is that s