Re: [Pacemaker] [Problem] The attrd does not sometimes stop.

2012-01-09 Thread renayama19661014
Hi Lars, I attach strace file when a problem reappeared at the end of last year. I used glue which applied your patch for confirmation. It is the file which I picked with attrd by strace -p command right before I stop Heartbeat. Finally SIGTERM caught it, but attrd did not stop. The attrd stopp

Re: [Pacemaker] SBD stonith issues in RHEL cluster

2012-01-09 Thread Qiu Zhigang
Hi, Could anybody help me ? Thank u. Best Regards, Qiu Zhigang From: Qiu Zhigang [mailto:qiuzhig...@fronware.com] Sent: Monday, January 09, 2012 4:30 PM To: 'The Pacemaker cluster resource manager' Subject: [Pacemaker] SBD stonith issues in RHEL cluster Hi, All I want to use

Re: [Pacemaker] SBD stonith issues in RHEL cluster

2012-01-09 Thread Qiu Zhigang
Hi, Forgot the version of RHCS. corosync-1.4.1-3.el6.x86_64 pacemaker-1.1.5-8.el6.x86_64 Best Regards, Qiu Zhigang From: Qiu Zhigang [mailto:qiuzhig...@fronware.com] Sent: Monday, January 09, 2012 4:30 PM To: 'The Pacemaker cluster resource manager' Subject: [Pacemaker] SBD stoni

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Jake Smith
Andrew, Curious - Did you check the version before you applied the fix I posted? No idea on the long shutdown time (mine wasn't even shutting down before the fix - now it's under 30 seconds)... What version of corosync? pacemaker? Jake - Original Message - > From: "Andrew Martin"

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Andrew Martin
Hi Jake, I applied the fix you posted, rebooted my system, and I am now able to create primitives from the crm shell! I have confirmed that the ppa version of libglib2.0-0 is the one that is installed (2.24.1-0ubuntu1.1~ppa1). I am not sure if it is a side-effect of this problem or not but

Re: [Pacemaker] Remote CRM shell from LCMC

2012-01-09 Thread Rasto Levrinc
On Mon, Jan 9, 2012 at 10:23 PM, Dejan Muhamedagic wrote: > Hi Rasto, > > On Wed, Dec 28, 2011 at 12:57:33AM +0100, Rasto Levrinc wrote: >> Hi, >> >> this being a slow news day, There is this great new feature in LCMC, but >> probably completely useless. :) The LCMC used to show for testing purpos

Re: [Pacemaker] Remote CRM shell from LCMC

2012-01-09 Thread Dejan Muhamedagic
Hi Rasto, On Wed, Dec 28, 2011 at 12:57:33AM +0100, Rasto Levrinc wrote: > Hi, > > this being a slow news day, There is this great new feature in LCMC, but > probably completely useless. :) The LCMC used to show for testing purposes > the CRM shell configuration, but people started to use it, so

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Jake Smith
Andrew - my solution below may have been a premature answer. It may have only applied to something on my system that wasn't right. First thing would be to check and see if you have the correct libglib2.0-0 version: 2.24.1-0ubuntu1.1~ppa1 If you do than disregard below. Jake - Original M

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Jake Smith
- Original Message - > From: "Rasto Levrinc" > To: "The Pacemaker cluster resource manager" > Sent: Monday, January 9, 2012 2:12:54 PM > Subject: Re: [Pacemaker] Cannot Create Primitive in CRM Shell > > On Mon, Jan 9, 2012 at 3:34 PM, Andrew Martin > wrote: > > Hi Florian, > > > > Thank

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Rasto Levrinc
On Mon, Jan 9, 2012 at 3:34 PM, Andrew Martin wrote: > Hi Florian, > > Thanks for the quick response. This is a fresh install of > pacemaker/heartbeat on two VMs so it should not have any previous/corrupted > configuration (Ubuntu 10.04 amd64). I had previously deployed pacemaker on > alternative

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Jake Smith
> Date: Mon, 9 Jan 2012 16:37:58 +0100 > From: Florian Haas > To: The Pacemaker cluster resource manager > > Subject: Re: [Pacemaker] Cannot Create Primitive in CRM Shell > Message-ID: > > Content-Type: text/plain; charset=UTF-8 > > On Mon, Jan 9, 2012 at 4:10 PM, Andrew Martin > w

Re: [Pacemaker] syslog full of redundand link messages

2012-01-09 Thread Andreas Kurz
Hello, On 01/09/2012 04:43 PM, Attila Megyeri wrote: > Hi, > > Thanks Florian, Dan. > > Yes, there was a mistake, I changed the bindaddress to 10.100.1.0 - but it > wasn't an issue as the subnet is /8 for some other reasons. > > Anyway those errors are still coming once a second, but not on ev

Re: [Pacemaker] syslog full of redundand link messages

2012-01-09 Thread Attila Megyeri
Hi, Thanks Florian, Dan. Yes, there was a mistake, I changed the bindaddress to 10.100.1.0 - but it wasn't an issue as the subnet is /8 for some other reasons. Anyway those errors are still coming once a second, but not on every node. Any indication where I should start troubleshooting? When ar

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 4:10 PM, Andrew Martin wrote: > Perhaps as a corollary problem I have noticed that I cannot seem to start or > restart pacemaker: > # service pacemaker restart > Starting Pacemaker Cluster Manager: [FAILED] You're running on Heartbeat. The "pacemaker" init script manages th

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Andrew Martin
Perhaps as a corollary problem I have noticed that I cannot seem to start or restart pacemaker: # service pacemaker restart Starting Pacemaker Cluster Manager: [FAILED] # tail /var/log/daemon.log Jan 9 09:04:37 webapps1 pacemakerd: [5725]: info: Invoked: pacemakerd Jan 9 09:04:37 webapps1 pa

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Andrew Martin
Hi Florian, Thanks for the quick response. This is a fresh install of pacemaker/heartbeat on two VMs so it should not have any previous/corrupted configuration (Ubuntu 10.04 amd64). I had previously deployed pacemaker on alternative copies of these VM images, but both of those have since been

Re: [Pacemaker] syslog full of redundand link messages

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 3:15 PM, Attila Megyeri wrote: > Hi, > > I might be taking something wrong, but, > > bindnetaddr: 10.100.1.255 > > does not mean it will listen on this address, but will listen on every > interface where this mask matches. > This is just to make the config file simpler and

Re: [Pacemaker] syslog full of redundand link messages

2012-01-09 Thread Attila Megyeri
Hi, I might be taking something wrong, but, bindnetaddr: 10.100.1.255 does not mean it will listen on this address, but will listen on every interface where this mask matches. This is just to make the config file simpler and common for all nodes in the same subnet. Or am I taking something t

Re: [Pacemaker] Resource "ping" fails on passive node after upgrading to second nic

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 2:01 PM, Senftleben, Stefan (itsc) wrote: > This is the cibadmin dump of the active one: > http://pastebin.com/Yg4Jsaxy You would see this in a "crm_mon -rf": Failed actions: pri_ping:1_start_0 (node=lxds05, call=-1, rc=1, status=Timed Out): unknown error "Timed out"

[Pacemaker] NFSv4 Cluster - Creating NFS resource in fedora 16 fails

2012-01-09 Thread Vogelsang, Andreas
Hello together! I’m trying to set up an NFSv4 Cluster. As operating system I choose Fedora 16. I’m following this Manual from LINBIT: “Highly available NFS storage with DRBD and Pacemaker” (http://www.linbit.com/en/education/tech-guides/highly-available-nfs-with-drbd-and-pacemaker/) But at the

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Dan Frincu
Hi, On Mon, Jan 9, 2012 at 1:44 PM, Florian Haas wrote: > On Mon, Jan 9, 2012 at 11:42 AM, Dan Frincu wrote: >> Hi, >> >> On Fri, Jan 6, 2012 at 11:24 PM, Andrew Martin wrote: >>> Hello, >>> >>> I am working with DRBD + Heartbeat + Pacemaker to create a 2-node >>> highly-available cluster. I ha

Re: [Pacemaker] Resource "ping" fails on passive node after upgrading to second nic

2012-01-09 Thread Senftleben, Stefan (itsc)
Hello Florian, okay, I will try to improve in giving better error reports. That is an error in the corosync log on the active node: Jan 09 13:49:51 lxds07 crmd: [1360]: info: te_rsc_command: Initiating action 2: stop pri_ping:1_stop_0 on lxds05 "corosync-objctl | grep member" brings no output on

Re: [Pacemaker] Resource "ping" fails on passive node after upgrading to second nic

2012-01-09 Thread Florian Haas
Stefan, sorry, your report triggers a complete -EPARSE in my brain. On Mon, Jan 9, 2012 at 10:38 AM, Senftleben, Stefan (itsc) wrote: > Hello everybody, > > last week I installed and configured in each cluster node a second network > interface. > After configuring the corosync.cfg the passive n

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 11:42 AM, Dan Frincu wrote: > Hi, > > On Fri, Jan 6, 2012 at 11:24 PM, Andrew Martin wrote: >> Hello, >> >> I am working with DRBD + Heartbeat + Pacemaker to create a 2-node >> highly-available cluster. I have been following this official guide on >> DRBD's website for conf

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Dan Frincu
Hi, On Fri, Jan 6, 2012 at 11:24 PM, Andrew Martin wrote: > Hello, > > I am working with DRBD + Heartbeat + Pacemaker to create a 2-node > highly-available cluster. I have been following this official guide on > DRBD's website for configuring all of the components: > http://www.linbit.com/fileadm

Re: [Pacemaker] syslog full of redundand link messages

2012-01-09 Thread Dan Frincu
Hi, On Sun, Jan 8, 2012 at 1:59 AM, Attila Megyeri wrote: > Hi All, > > > > My syslogs are full of messages like this: > > > > Jan  7 23:55:47 oa2 corosync[362]:   [TOTEM ] received message requesting > test of ring now active > > Jan  7 23:55:48 oa2 corosync[362]:   [TOTEM ] received message req

[Pacemaker] Resource "ping" fails on passive node after upgrading to second nic

2012-01-09 Thread Senftleben, Stefan (itsc)
Hello everybody, last week I installed and configured in each cluster node a second network interface. After configuring the corosync.cfg the passive node stops the primative ping (three ping targets). totem { version: 2 token: 3000 token_retransmits_before_loss_const: 10

[Pacemaker] SBD stonith issues in RHEL cluster

2012-01-09 Thread Qiu Zhigang
Hi, All I want to use SBD device as a stonith device in RHCS, but how could I configure sbd resource agent? I use the following command, primitive sbd_fence stonith:external/sbd params sbd_device="/dev/disk/by-id/scsi-3300035230a3a" but a error occurred, ERROR: sbd_fence: p