Re: [Pacemaker] do_lrm_control: Failed to sign on to the LRM repeatedly!

2010-11-19 Thread Andrew Beekhof
On Fri, Nov 19, 2010 at 12:08 AM, Dave Williams wrote: > I have prolem with a cluster that wont start up. It is running a 2 node > failover (master slave) clustered ftp server using drbd to duplicate the > filesystem. > > Upgraded from 10.04 Lucid to 10.10 Maverick to obtain support for > upstart

[Pacemaker] About the log output in the V1 mode of pacemaker 1.1

2010-11-19 Thread nozawat
Hi I output log in /var/log/ha.log by the following setting. It is not output by /var/log/ha.log when I operate this with a V1 mode. In the case of a V1 mode, is not setting enough? --- /etc/corosync/corosync.conf --- service { name: pacemaker ver: 0 } logging { fileline: on to

Re: [Pacemaker] [FYI] Required errata for RHEL5

2010-11-19 Thread Pavlos Parissis
On 19 November 2010 05:21, Keisuke MORI wrote: > Hi all, > > For your information for RedHat users, > > As a conclusion of testing in my company, we consider that > the following errata should be applied on RHEL5.5 or below in order to > get Pacemaker work more stably. > http://rhn.redhat.com/erra

Re: [Pacemaker] do_lrm_control: Failed to sign on to the LRM repeatedly!

2010-11-19 Thread Dave Williams
> > This install is seriously sick. > Multiple copies of all our daemons. > > If I had to guess, I'd say there were version incompatibilities > between the various cluster packages. > OK - How best to investigate? I quoted the versions of the packages in my original post. ___

[Pacemaker] Problem with CRMD restart

2010-11-19 Thread JiaQiang Xu
Hi, I'm using pacemaker 1.0.9 and corosync 1.2.7. Recently I found a problem with CRMD restart. If CRMD crashes or is manually killed, for now corosync will try to restart it up to 100 times (done in lib/ais/plugin.c). But what if CRMD become so buggy (or due to some environmental factor) that it

[Pacemaker] UDPU transport patch added, when will the RPMs be available

2010-11-19 Thread Dan Frincu
Hi, The subject is pretty self-explanatory but I'll ask anyway, the patch for UDPU has been released, this adds the ability to set unicast peer addresses of nodes in a cluster, in network environments where multicast is not an option. When will it be available as an RPM? If I'm barking up th

Re: [Pacemaker] About the log output in the V1 mode of pacemaker 1.1

2010-11-19 Thread Dejan Muhamedagic
Hi, On Fri, Nov 19, 2010 at 04:48:16PM +0900, nozawat wrote: > Hi > > I output log in /var/log/ha.log by the following setting. > It is not output by /var/log/ha.log when I operate this with a V1 mode. > In the case of a V1 mode, is not setting enough? V1 mode is Heartbeat only, no pacemaker,

Re: [Pacemaker] crm resource restart fails to restart the service

2010-11-19 Thread Dejan Muhamedagic
Hi, On Thu, Nov 18, 2010 at 01:35:24PM -0500, Vadym Chepkov wrote: > On Wed, Nov 17, 2010 at 1:03 PM, Dejan Muhamedagic > wrote: > >> > > >> > Funny, it worked here for me every time for apache, Dummy, > >> > Delay, stonith resources. With both pacemaker 1.0 and 1.1. > >> > > >> >> To test it ri

Re: [Pacemaker] do_lrm_control: Failed to sign on to the LRM repeatedly!

2010-11-19 Thread Dejan Muhamedagic
Hi, On Fri, Nov 19, 2010 at 08:26:59AM +, Dave Williams wrote: > > > > This install is seriously sick. > > Multiple copies of all our daemons. > > > > If I had to guess, I'd say there were version incompatibilities > > between the various cluster packages. > > > > OK - How best to investig

Re: [Pacemaker] do_lrm_control: Failed to sign on to the LRM repeatedly!

2010-11-19 Thread Dave Williams
On 14:29, Fri 19 Nov 10, Dejan Muhamedagic wrote: > Hi, > > On Fri, Nov 19, 2010 at 08:26:59AM +, Dave Williams wrote: > > > > > > This install is seriously sick. > > > Multiple copies of all our daemons. > > > > > > If I had to guess, I'd say there were version incompatibilities > > > betwe

Re: [Pacemaker] do_lrm_control: Failed to sign on to the LRM repeatedly!

2010-11-19 Thread Dejan Muhamedagic
Hi, On Fri, Nov 19, 2010 at 01:43:44PM +, Dave Williams wrote: > On 14:29, Fri 19 Nov 10, Dejan Muhamedagic wrote: > > Hi, > > > > On Fri, Nov 19, 2010 at 08:26:59AM +, Dave Williams wrote: > > > > > > > > This install is seriously sick. > > > > Multiple copies of all our daemons. > > >

Re: [Pacemaker] About the log output in the V1 mode of pacemaker 1.1

2010-11-19 Thread nozawat
Hi dejan, I am sorry to be hard to understand it. I carried it out by constitution of clause turn 2 of the following pages. However, log was not output definitely. < http://theclusterguy.clusterlabs.org/post/907043024/introducing-the-pacemaker-master-control-process-for > The constitution is

Re: [Pacemaker] About the log output in the V1 mode of pacemaker 1.1

2010-11-19 Thread Dejan Muhamedagic
Hi, On Fri, Nov 19, 2010 at 11:12:09PM +0900, nozawat wrote: > Hi dejan, > > I am sorry to be hard to understand it. > I carried it out by constitution of clause turn 2 of the following pages. > However, log was not output definitely. > < > http://theclusterguy.clusterlabs.org/post/907043024/i

Re: [Pacemaker] Stonith Device APC AP7900

2010-11-19 Thread Dejan Muhamedagic
Hi, On Wed, Nov 17, 2010 at 05:01:20PM -0600, Andrew Daugherity wrote: > > Message: 3 > > Date: Tue, 16 Nov 2010 11:24:26 +0100 > > From: Dejan Muhamedagic > > To: The Pacemaker cluster resource manager > > > > Subject: Re: [Pacemaker] Stonith Device APC AP7900 > > Message-ID: <2010111610242

Re: [Pacemaker] Stonith Device APC AP7900

2010-11-19 Thread Dejan Muhamedagic
Hi, On Thu, Nov 18, 2010 at 06:45:01AM +0200, Chris Picton wrote: > > > On 2010/11/18 1:01 AM, Andrew Daugherity wrote: > In production I am planning to have 2 separate AP7900 units each plugged > into 2 different APC UPS units to achieve that. I would then have the > single node name

Re: [Pacemaker] UDPU transport patch added, when will the RPMs be available

2010-11-19 Thread Andrew Beekhof
On Fri, Nov 19, 2010 at 11:38 AM, Dan Frincu wrote: > Hi, > > The subject is pretty self-explanatory but I'll ask anyway, the patch for > UDPU has been released, this adds the ability to set unicast peer addresses > of nodes in a cluster, in network environments where multicast is not an > option.

[Pacemaker] stonith monitor timeout not restartable

2010-11-19 Thread Ron Kerry
I have a customer running pacemaker/openais from the SLE11-HAE distribution. On occasion we have a stonith clone instance timeout. It becomes unrestartable and it is not generally recoverable unless we completely stop openais on both nodes and restart. I have an example from the logs below. Is th

Re: [Pacemaker] Stonith Device APC AP7900

2010-11-19 Thread Rick Cone
Andrew, Thanks for the good information. I am able to make this work. I setup groups on the 2 AP7900's with Outlet 1 for the first system and Outlet 2 the second (on each) and gave them the nodes names. Then, I created a group for each (with the same node name),and also and made all the ports

Re: [Pacemaker] UDPU transport patch added, when will the RPMs be available

2010-11-19 Thread Steven Dake
On 11/19/2010 11:42 AM, Andrew Beekhof wrote: > On Fri, Nov 19, 2010 at 11:38 AM, Dan Frincu wrote: >> Hi, >> >> The subject is pretty self-explanatory but I'll ask anyway, the patch for >> UDPU has been released, this adds the ability to set unicast peer addresses >> of nodes in a cluster, in net