Re: [Pacemaker] "pcs cluster status" options seems to not work

2014-02-21 Thread Bob Haxo
show ". But not much easier: using "grep --after-context=1" works for the checking the output of "pcs status". A bit of mental laziness on my part. And I had gotten accustomed to the crm output. Bob Haxo On Fri, 2014-02-21 at 01:47 -0500, Chris Feist wrote: >

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-21 Thread Bob Haxo
https://savannah.nongnu.org/bugs/index.php?41660 On Fri, 2014-02-21 at 10:53 +0100, Kristoffer Grönlund wrote: > On Fri, 21 Feb 2014 08:51:58 +0100 > Lars Marowsky-Bree wrote: > > > On 2014-02-20T16:03:36, Bob Haxo wrote: > > > > > Sooo, seems that we need to k

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-21 Thread Bob Haxo
Kristoffer, Lars, I will file the issue. We need to track this and link with our internal bug report. Thanks Bob Haxo On Fri, 2014-02-21 at 10:53 +0100, Kristoffer Grönlund wrote: > On Fri, 21 Feb 2014 08:51:58 +0100 > Lars Marowsky-Bree wrote: > > > On 2014-02-20T16:

[Pacemaker] "pcs cluster status" options seems to not work

2014-02-20 Thread Bob Haxo
Resources configured PCSD Status: Error: no nodes found in corosync.conf Bob Haxo ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-20 Thread Bob Haxo
On Fri, 2014-02-21 at 10:46 +1100, Andrew Beekhof wrote: > On 21 Feb 2014, at 10:25 am, Bob Haxo wrote: > > > Andrew, > > > >> From my vantage point, it will be most unfortunate if pcs and crm cannot > > both be used for routine start and stop of resou

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-20 Thread Bob Haxo
fic interfaces. Regards, Bob Haxo On Fri, 2014-02-21 at 09:26 +1100, Andrew Beekhof wrote: > On 21 Feb 2014, at 4:30 am, Bob Haxo wrote: > > > Andrew, Lars, > > > > Yes, I have determined that this error is the result of mixing crmsh > stop/start with pcs disable/enabl

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-20 Thread Bob Haxo
olved. Bob Haxo On Thu, 2014-02-20 at 07:56 -0800, Bob Haxo wrote: > Andrew, > > I think that this is what you identified: > > This error appears to be related to the two instances of > with > libvirtd-clone, both which contain a name="target-role" value

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-20 Thread Bob Haxo
stop" rather than a "pcs disable". Regardless, even if RH is not going to support crmsh, at least RH needs to be sensitive to the use of crmsh with their PCS Pacemaker implementation. I'll test/investigate the "crm stop" and see whether this creates the additional entry

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-19 Thread Bob Haxo
Chris, was easy to duplicate ... I thought that I had cleared the error, but that had not happened. Bob Haxo [root@mici-admin ~]# pcs resource disable virt [root@mici-admin ~]# pcs resource disable libvirtd-clone Error: Error performing operation: Invalid argument Multiple attributes match name

Re: [Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-19 Thread Bob Haxo
Chris, I'll try to get the cluster back into the state where the issue was observed. Bob Haxo On Wed, 2014-02-19 at 15:23 -0600, Chris Feist wrote: > On 02/19/2014 11:16 AM, Bob Haxo wrote: > > Encountered this error with pcs but not with crm. Looks like a regex > > erro

[Pacemaker] possible regex error in "pcs resource enable/disable"

2014-02-19 Thread Bob Haxo
Error performing operation: Invalid argument Multiple attributes match name=target-role Value: Stopped(id=libvirtd-clone-meta-target-role) Value: Stopped(id=libvirtd-clone-meta_attributes-target-role) xml: R

Re: [Pacemaker] Manual fence confirmation by stonith_admin doesn't work again.

2014-01-21 Thread Bob Haxo
found "fence_ack_manual server" to successfully provided the needed ack that stonith has been successful. Bob Haxo On Mon, 2014-01-13 at 14:58 +1100, Andrew Beekhof wrote: > On 10 Jan 2014, at 3:54 pm, Nikita Staroverov wrote: > > > > >>>> There is

Re: [Pacemaker] command to dump cluster configuration in "pcs" format?

2014-01-16 Thread Bob Haxo
On Fri, 2014-01-17 at 09:15 +1100, Andrew Beekhof wrote: > On 17 Jan 2014, at 9:05 am, Lars Marowsky-Bree wrote: > > > On 2014-01-17T07:40:34, Andrew Beekhof wrote: > > > >>> Well, unless RHT states that installing crmsh on top of their > >>> distribution invalidates support for the pacemaker

Re: [Pacemaker] command to dump cluster configuration in "pcs" format?

2014-01-16 Thread Bob Haxo
Hi Lars, On Thu, 2014-01-16 at 12:59 +0100, Lars Marowsky-Bree wrote: > On 2014-01-15T20:25:30, Bob Haxo wrote: > > > Unfortunately, it configuration has taken me weeks to develop (what now > > seems to be) a working configuration (including mods to the > > VirtualDomain

Re: [Pacemaker] command to dump cluster configuration in "pcs" format?

2014-01-15 Thread Bob Haxo
On Thu, 2014-01-16 at 12:32 +1100, Andrew Beekhof wrote: > On 16 Jan 2014, at 11:49 am, Bob Haxo wrote: > > >> On 01/15/2014 05:02 PM, Bob Haxo wrote: > >> > Greetings, > >> > > >> > The command "crm configure show" dumps the clus

Re: [Pacemaker] command to dump cluster configuration in "pcs" format?

2014-01-15 Thread Bob Haxo
> On 01/15/2014 05:02 PM, Bob Haxo wrote: > > Greetings, > > > > The command "crm configure show" dumps the cluster configuration in a > > format > > that is suitable for use in configuring a cluster. > > > > The command "pcs config&

[Pacemaker] command to dump cluster configuration in "pcs" format?

2014-01-15 Thread Bob Haxo
er. Is there a "pcs" command analogous to the "crm" command that dumps the cluster configuration in "pcs" format? Regards, Bob Haxo ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.

Re: [Pacemaker] reboot of non-vm host results in VM restart -- of chickens and eggs and VMs

2014-01-07 Thread Bob Haxo
to > cleanup and my vm has rebooted. And the same issue raised by myself earlier: > From: Bob Haxo > Subject: [Pacemaker] GFS2 with Pacemaker on RHEL6.3 restarts with > reboot > Date: Wed, 8 Aug 2012 19:14:31 -0700 > ... > > Problem: When the the non-VM-host is rebooted

Re: [Pacemaker] "stonith_admin -F node" results in a pair of reboots

2014-01-07 Thread Bob Haxo
with HA software. I'll let you know how this goes. Thanks, Bob Haxo On Tue, 2014-01-07 at 09:21 +0100, Fabio M. Di Nitto wrote: > On 1/6/2014 6:24 PM, Bob Haxo wrote: > > Hi Fabio, > > > >>> There is an example on how to configure gfs2 also in the rhel6.5 >

Re: [Pacemaker] "stonith_admin -F node" results in a pair of reboots

2014-01-06 Thread Bob Haxo
Hi Fabio, > > There is an example on how to configure gfs2 also in the rhel6.5 > > pacemaker documentation, using pcs. Super! Please share the link to this documentation. I only discovered the gfs2+pcs example with the rhel7 beta docs. Bob Haxo On Sat, 2014-01-04 at 16:56 +0

Re: [Pacemaker] "stonith_admin -F node" results in a pair of reboots

2014-01-03 Thread Bob Haxo
Digimer, Yes, for the configuration that includes drbd, 'crm-fence-peer.sh' and 'resource-and-stonith' are included in the configuration. Thanks, Bob Haxo On Wed, 2014-01-01 at 01:04 -0500, Digimer wrote: > Did you hook DRBD into pacemaker's fencing using '

Re: [Pacemaker] "stonith_admin -F node" results in a pair of reboots

2013-12-31 Thread Bob Haxo
nding" elsewhere (crm_mon?) without considering that it might be fencing that is pending. Interesting. Thanks & my best wishes for a healthy new year. Bob Haxo On Wed, 2014-01-01 at 00:19 -0500, Digimer wrote: > This is probably because cman (which is it's own cluster stac

[Pacemaker] "stonith_admin -F node" results in a pair of reboots

2013-12-31 Thread Bob Haxo
ays the reboot of the fencing node happens? 2) Or, maybe I didn't correctly duplicate the Chapter 6 example? 3) Or, perhaps something is wrong or omitted from the Chapter 6 example? Suggestions will be much appreciated. Thanks, Bob Haxo RHEL6.5 pacemaker-cli-1.1.10-14.el6_5.1.x86_64 crmsh

Re: [Pacemaker] reboot of non-vm host results in VM restart -- of chickens and eggs and VMs

2013-12-19 Thread Bob Haxo
rt of libvirt, with more care. Thanks, Bob Haxo On Thu, 2013-12-19 at 21:30 +0100, emmanuel segura wrote: > remove the libvirtd from pacemaker and chkconfig libvirtd on every > node, like that the cluster just manage the vm, maybe i wrong but i > don't see any reason for put libvirt

Re: [Pacemaker] reboot of non-vm host results in VM restart -- of chickens and eggs and VMs

2013-12-19 Thread Bob Haxo
ls with the failing logic loop, resulting in a re-start of the VM. I hoping that "Unfortunately we still don't have a good answer for you." is no longer the case, and that there is a fix or that there is a community accepted workaround for the issue. Regards, Bob Haxo On Thu, 20

[Pacemaker] reboot of non-vm host results in VM restart -- of chickens and eggs and VMs

2013-12-19 Thread Bob Haxo
t of the VM does not occur with the reboot of the non-host system? Regards, Bob Haxo ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org G

Re: [Pacemaker] racing crm commands... last write wins?

2013-03-20 Thread Bob Haxo
Any suggestions for avoiding replacing rather than incrementally modifying the configuration? Thanks, Bob Haxo SGI On Mon, 2013-03-04 at 17:25 +0100, Lars Marowsky-Bree wrote: > On 2013-03-04T17:14:28, Dejan Muhamedagic wrote: > > > > Thought so at the time, yes. And I do think

Re: [Pacemaker] crm in RHEL 6.4 ... where are you?

2013-02-21 Thread Bob Haxo
Digimer Thanks for the info. I missed this change, and my codes are now borken. Huge mistake by me. Hopefully the crm syntax has not changed, or not changed significantly. Also, thanks for the "2-Node Red Hat KVM Cluster Tutorial". Extremely helpful for some of our work. Bob Hax

[Pacemaker] crm in RHEL 6.4 ... where are you?

2013-02-21 Thread Bob Haxo
Greetings, Anyone know where "crm" is in RHEL 6.4, or in the most recent set of RHEL 6.3 updates? crm is not included in the latest pacemaker-cli package: pacemaker-cli-1.1.8-7.el6.x86_64.rpm Bob Haxo SGI ___ Pacemaker mailing list:

[Pacemaker] remaining node hang & sysconfig/cman CMAN_LEAVE_OPTS="remove"

2012-08-20 Thread Bob Haxo
ng node cluster functions hanging when the other node is halted or rebooted. For the record, "service cman stop remove" works. The Pacemaker no-quorum-policy="ignore" seems to not affect this behavior. One could imagine that setting that policy on a CMAN/Pacemaker cluster would

Re: [Pacemaker] chain/cascade stonith agents?

2012-08-16 Thread Bob Haxo
On Thu, 2012-08-16 at 09:37 +1000, Andrew Beekhof wrote: > On Thu, Aug 16, 2012 at 1:59 AM, Bob Haxo wrote: > > HI All, > > > > Is chaining/cascading of stonith agents implemented? > > Yes. But you'll want to use the current git HEAD > > > If yes,

[Pacemaker] chain/cascade stonith agents?

2012-08-15 Thread Bob Haxo
d return a "stonith success"? I was thinking that I would chain stonith_ipmilan with the ever popular stonith_null to achieve this end. Cheers, Bob Haxo bhaxo at sgi.com ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clu

Re: [Pacemaker] GFS2 with Pacemaker on RHEL6.3 restarts with reboot

2012-08-12 Thread Bob Haxo
Thanks Andrew, I'll check out the latest git and give building a try. Cheers, Bob Haxo On Mon, 2012-08-13 at 12:09 +1000, Andrew Beekhof wrote: > On Mon, Aug 13, 2012 at 11:27 AM, Bob Haxo wrote: > > I had > > not connected the dots regarding use of cman being an *earlier

Re: [Pacemaker] GFS2 with Pacemaker on RHEL6.3 restarts with reboot

2012-08-12 Thread Bob Haxo
On Fri, 2012-08-10 at 12:21 +1000, Andrew Beekhof wrote: > On Thu, Aug 9, 2012 at 12:14 PM, Bob Haxo wrote: > > Greetings. > > > > I have followed the setup instructions of Clusters From Scratch : > > Creating Active/Passive and Active/Active Clusters on Fedora, Editio

[Pacemaker] GFS2 with Pacemaker on RHEL6.3 restarts with reboot

2012-08-08 Thread Bob Haxo
: migration-threshold=100 + (12) start: rc=0 (ok) + (13) monitor: interval=24ms rc=0 (ok) virt: migration-threshold=100 + (120) start: rc=0 (ok) + (121) monitor: interval=1ms rc=0 (ok) Thanks for reading ... Bob Haxo bh...@sgi.com node hikari node hikari2 primitive

Re: [Pacemaker] Patch for bugzilla 2541: Shell should warn if parameter uniqueness is violated

2011-06-29 Thread Bob Haxo
Dejan, Thanks ... you are correct ... probably a little Force is acceptable as a quick-and-dirty fix. I'll get a bug filed with Novell. Bob Haxo On Wed, 2011-06-29 at 14:35 +0200, Dejan Muhamedagic wrote: > Hi, > > On Tue, Jun 28, 2011 at 06:10:20PM -0700, Bob Haxo wrote: &

Re: [Pacemaker] Patch for bugzilla 2541: Shell should warn if parameter uniqueness is violated

2011-06-28 Thread Bob Haxo
eness seems to solve the issue. For example, setting unique to 0: Is proper protocol for me to file a bug with Novell as this is code from Novell? Regards, Bob Haxo On Tue, 2011-06-28 at 17:53 -0700, Bob Haxo wrote: > All, > > I'm seeing a problem referred to in this thread:

Re: [Pacemaker] Patch for bugzilla 2541: Shell should warn if parameter uniqueness is violated

2011-06-28 Thread Bob Haxo
id": "X" Do you still want to commit? yes crm(live)configure# The problem is that I want to perform this operation as "crm < commands", which returns the warnings and assumes a "NO" to the question. I have current Novell updates applied, which includes th

Re: [Pacemaker] Node doesn't rejoin automatically after reboot - POSSIBLE CAUSE

2011-01-14 Thread Bob Haxo
headnode of these HA clusters, but no "crm_abort" there either. Are there (by default) any logs other than in /var/log? Bob Haxo On Fri, 2011-01-14 at 13:50 +0100, Andrew Beekhof wrote: > On Thu, Jan 13, 2011 at 9:31 PM, Bob Haxo wrote: > > Hi Tom (and Andrew), > > &

Re: [Pacemaker] Howto write a STONITH agent

2011-01-13 Thread Bob Haxo
Hi Christoph, Have you taken a look in /usr/lib64/stonith/plugins/external? The "ipmi" plugin might serve as a coding example/template. Or maybe the "drac5" plugin. At first glance, "drac5" appears to be using ssh. Bob Haxo On Thu, 2011-01-13 at 21:09 +0100,

Re: [Pacemaker] Node doesn't rejoin automatically after reboot - POSSIBLE CAUSE

2011-01-13 Thread Bob Haxo
pengine streams and that too came online. Regards, Bob Haxo On Thu, 2011-01-13 at 11:15 -0800, Bob Haxo wrote: > So, Tom ...how do you get the failed node online? > > I've re-installed with the same image that is running on three other > nodes, but still fails. This node was qu

Re: [Pacemaker] Node doesn't rejoin automatically after reboot

2011-01-13 Thread Bob Haxo
pacemaker-updates through > the zypper-update-channel available (sles11-sp1). > > Regards, > Tom > > > 2011/1/13 Bob Haxo : > > Tom, others, > > > > Please, what was the solution to this issue? > > > > Thanks, > > Bob Haxo > > >

Re: [Pacemaker] Node doesn't rejoin automatically after reboot

2011-01-13 Thread Bob Haxo
Tom, others, Please, what was the solution to this issue? Thanks, Bob Haxo On Mon, 2010-09-06 at 09:50 +0200, Tom Tux wrote: > Yes, corosync is running after the reboot. It comes up with the > regular init-procedure (runlevel 3 in my case). > > 2010/9/6 Andrew Beekhof : >

Re: [Pacemaker] streamed writes fail with migration for NFS v3 over TCP

2009-05-20 Thread Bob Haxo
uot; ... as in an obvious culprit to the crime, and my manipulations of "timeo" and "retrans" have not solved the issue. Anyone have any ideas why NFSv3 over TCP reads should be successful across 100s of migrations and failovers, but writes bomb? Thanks, Bob Haxo SGI On Wed,

Re: [Pacemaker] trigger STONITH for testing purposes

2009-05-20 Thread Bob Haxo
led to filesystem corruption. Very messy. (this is a test-only setup, so no real data is present) So, no, the change that I made was neither deleting nor setting no-quorum-policy=stop. Setting no-quorum-policy=ignore seems to be required for the cluster to support migrations and failovers. Chee

Re: [Pacemaker] streamed writes fail with migration for NFS v3 over TCP

2009-05-20 Thread Bob Haxo
reads and writes are handled. Someone with more NFS internals experience maybe could point out where the problem is occurring and propose a workaround or fix. Anyone have any suggestions? Cheers, Bob Haxo SGI On Tue, 2009-05-19 at 21:04 -0500, Karl Katzke wrote: > Bob - > > > No,

Re: [Pacemaker] trigger STONITH for testing purposes

2009-05-19 Thread Bob Haxo
the suggestions. Cheers, Bob Haxo On Tue, 2009-05-19 at 14:03 +0200, Andrew Beekhof wrote: > On Mon, May 18, 2009 at 8:12 PM, Bob Haxo wrote: > > > > Any suggestions as to what needs changing so that the stonith deathmarch can > > be avoided? > > If you only have

[Pacemaker] streamed writes fail with migration for NFS v3 over TCP

2009-05-19 Thread Bob Haxo
migrations working for NFS over TCP? Suggestions? Cheers, Bob haxo SGI ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Re: [Pacemaker] trigger STONITH for testing purposes

2009-05-18 Thread Bob Haxo
And, the two stonith resources: And the relevant pair of constraints: Any suggestions as to what needs changing so that the stonith deathmarch can be avoided? Cheers and thanks, Bob Haxo SGI On Fri, 2009-05

Re: [Pacemaker] trigger STONITH for testing purposes

2009-05-15 Thread Bob Haxo
uggestions? Cheers, Bob Haxo SGI On Fri, 2009-05-15 at 16:53 -0700, Bob Haxo wrote: > Greetings, > > What manual administrative actions can be used to trigger STONITH > action? > > I have created a pair of STONITH resources (external/ipmi) and would > like to test that these re

[Pacemaker] trigger STONITH for testing purposes

2009-05-15 Thread Bob Haxo
Greetings, What manual administrative actions can be used to trigger STONITH action? I have created a pair of STONITH resources (external/ipmi) and would like to test that these resources work as expected (which, if I understand the default correctly, is to reboot the node). Thanks, Bob Haxo