Re: [Pacemaker] Best way to recover from failed STONITH?

2012-12-21 Thread Andreas Kurz
On 12/21/2012 07:47 PM, Andrew Martin wrote: > Andreas, > > Thanks for the help. Please see my replies inline below. > > - Original Message - >> From: "Andreas Kurz" >> To: pacemaker@oss.clusterlabs.org >> Sent: Friday, December 21, 2012 10:11:08 AM >> Subject: Re: [Pacemaker] Best way t

Re: [Pacemaker] Multi-state slave resource promoted when node was not quorate, expected?

2012-12-21 Thread Jesse Hathaway
On Mon, Dec 17, 2012 at 7:19 PM, Andrew Beekhof wrote: > No. That sounds like a bug. Can you attach a crm_report tarball to a > bugzilla entry please? Thanks Andrew, Bug reported created, please let me know if you need any other diagnostic information: https://developerbugs.linuxfoundation.o

Re: [Pacemaker] Best way to recover from failed STONITH?

2012-12-21 Thread Andrew Martin
Andreas, Thanks for the help. Please see my replies inline below. - Original Message - > From: "Andreas Kurz" > To: pacemaker@oss.clusterlabs.org > Sent: Friday, December 21, 2012 10:11:08 AM > Subject: Re: [Pacemaker] Best way to recover from failed STONITH? > > On 12/21/2012 04:18 PM,

Re: [Pacemaker] Best way to recover from failed STONITH?

2012-12-21 Thread Andreas Kurz
On 12/21/2012 04:18 PM, Andrew Martin wrote: > Hello, > > Yesterday a power failure took out one of the nodes and its STONITH device > (they share an upstream power source) in a 3-node active/passive cluster > (Corosync 2.1.0, Pacemaker 1.1.8). After logging into the cluster, I saw that > the S

[Pacemaker] Best way to recover from failed STONITH?

2012-12-21 Thread Andrew Martin
Hello, Yesterday a power failure took out one of the nodes and its STONITH device (they share an upstream power source) in a 3-node active/passive cluster (Corosync 2.1.0, Pacemaker 1.1.8). After logging into the cluster, I saw that the STONITH operation had given up in failure and that none of

Re: [Pacemaker] booth is the state of "started" on pacemaker before booth write ticket info in cib.

2012-12-21 Thread Jiaju Zhang
On Fri, 2012-12-21 at 15:44 +0900, Yuichi SEINO wrote: > Hi Jiaju, > > 2012/12/18 Jiaju Zhang : > > On Mon, 2012-12-17 at 10:40 +0900, Yuichi SEINO wrote: > >> Hi Jiaju, > >> > >> >> >> > >> >> >> Perhaps, this problem didn't happen before the following commit. > >> >> >> https://github.com/jjzha

[Pacemaker] Log STDERR from OCF scripts

2012-12-21 Thread Michal Fiala
Hallo, we use corosyng logging via syslog (to_logfile: no; to_syslog: yes; syslog_facility: local0; debug: on). Some OCF scripts do not use OCF API to execute commands. I mean function ocf_run, which capture STDOUT and STDERR. For example linbit/drbd uses its own function to execute do_cmd(), whic