Re: [Pacemaker] Preventing Automatic Failback

2014-01-21 Thread Michael Monette
This is the last time ill update this thread. I made some guesses in my last one but everything is clear now. I am still learning lots. I had two problems. I thought they were related but they were not. The DRBD problem was I had the wfc-timeout value to 30 in the drbd.conf and Pacemaker is def

Re: [Pacemaker] pgsql RA - slave is in HS:ASYNC status and won; t promote

2014-01-21 Thread 東一彦
Hi, Tomáš. > my slave node is still async, event the select you mentioned shows async .. Is "synchronous_standby_names" set correctly ? synchronous_standby_names is PostgreSQL's parameter. That is a list of standby names that can support synchronous replication(*). (* please see below for deta

Re: [Pacemaker] Preventing Automatic Failback

2014-01-21 Thread Michael Monette
I just wanted to update you on this I checked into the scores you were talking about, and I really thought you were right! I set the score on node-1 to 1001 through the DRBD linbit script on node-1 and left the other at 1. Restarting node-1 made it all happen again. I even realized that thi

Re: [Pacemaker] [Question] About replacing in resource_set of the order limitation.

2014-01-21 Thread renayama19661014
Hi All, My test seemed to include a mistake. It seems to be replaced by two limitation. > However, I think that symmetircal="false" is applied to all order limitation > in this. > (snip) >       >         >           >         >         >           >           ... >           >      

Re: [Pacemaker] Manual fence confirmation by stonith_admin doesn't work again.

2014-01-21 Thread Bob Haxo
I have filed a bug, including crm_report, against my encounter with what seems to be the issue being referred to in this thread. Bug 5191 - pcs stonith confirm server doesn't "stonith_admin --confirm=server" appears to behave the same as "pcs stonith confirm server" I found "fence_ack_manual s

Re: [Pacemaker] crm_resource -L not trustable right after restart

2014-01-21 Thread Brian J. Murrell (brian)
On Thu, 2014-01-16 at 14:49 +1100, Andrew Beekhof wrote: > > What crm_mon are you looking at? > I see stuff like: > > virt-fencing (stonith:fence_xvm):Started rhos4-node3 > Resource Group: mysql-group > mysql-vip(ocf::heartbeat:IPaddr2): Started rhos4-node3 > mysql

Re: [Pacemaker] Preventing Automatic Failback

2014-01-21 Thread Michael Monette
Also one final thing I want to add. Corosync and pacemaker are enabled with chkconfig. So a hard reboot is esentually restarting the services too. The moment pacemaker is started at boot, this happens. (Although I've tried disabling and manually starting the services after I recover the serve

Re: [Pacemaker] Preventing Automatic Failback

2014-01-21 Thread Michael Monette
Hi David, Thanks for your reply. Just to clear it up: If everything is running on node-1 and I do a "crm node standby node-1", everything goes to node-2. When I "crm node online node-1" everything is perfectly fine and things do not get disrupted on node-2. The services remain on node-2 until

Re: [Pacemaker] Preventing Automatic Failback

2014-01-21 Thread David Vossel
- Original Message - > From: "Michael Monette" > To: pacemaker@oss.clusterlabs.org > Sent: Monday, January 20, 2014 8:22:25 AM > Subject: [Pacemaker] Preventing Automatic Failback > > Hi, > > I posted this question before but my question was a bit unclear. > > I have 2 nodes with DRBD w

Re: [Pacemaker] Stonith logging question

2014-01-21 Thread Néstor C .
Can you trace the resource? crm resource trace ... Maybe, if you can do it you get more info. 2014/1/21 Robert Lindgren > Hi, > > I'm trying to get rid of some stonith info logging but I fail :( > > The log-lines are like this in syslog: > Jan 21 13:24:15 wolf1 stonith-ng: [6349]: info: stoni

[Pacemaker] Stonith logging question

2014-01-21 Thread Robert Lindgren
Hi, I'm trying to get rid of some stonith info logging but I fail :( The log-lines are like this in syslog: Jan 21 13:24:15 wolf1 stonith-ng: [6349]: info: stonith_command: Processed st_execute from lrmd: rc=-1 Jan 21 13:24:15 wolf1 external/ipmi[11606]: [11616]: debug: ipmitool output: Chassis P

[Pacemaker] Get group behaviour with Master slave or clones envolved

2014-01-21 Thread Néstor C .
Hello. When you need that some primitives switch in block you can use a group. There is a manner to get this when you have a clone or a master/slave involved? For example: Imagine a drbd disk (DR), a filesystem over ti (FS) and a service over all (SRV). The first one is a ms resource, and the o