Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-06 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 10:21 PM, Craig Hurley wrote: > I tried using ping instead of pingd and I added "number" to the > evaluation, I get the same results :/ > > primitive p_ping ocf:pacemaker:ping params host_list=172.20.0.254 > clone c_ping p_ping meta globally-unique=false > location loc_ping

Re: [Pacemaker] resource stop timeout broken in 1.0 branch tip

2010-10-06 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 11:29 AM, Keisuke MORI wrote: > 2010/10/6 Andrew Beekhof : >>> Is there more changesets >>> that need to be backported regarding to this issues? >> >> There is now that Andreas brought the problem to my attention :-) >>   http://hg.clusterlabs.org/pacemaker/1.1/rev/e097c7022

[Pacemaker] [Problem]The monitor that start-delay is long does not stop.

2010-10-06 Thread renayama19661014
Hi, I operated the next to confirm the contribution of the mailing list. * http://www.gossamer-threads.com/lists/linuxha/pacemaker/66939 Step1) I prepare cib.xml having monitor which set start-delay than five minutes.. Step2) I start two nodes and send cib. Last updated: Thu Oct

Re: [Pacemaker] pacemaker version

2010-10-06 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi wrote: > On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra > wrote: >> That is what I heard too, that's the reason for this question. >> > > On June, inside a complex thread regarding "colocation -inf", Andrew > reported the link and also several clar

Re: [Pacemaker] pacemaker version

2010-10-06 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 1:51 PM, Vadym Chepkov wrote: > > On Oct 6, 2010, at 2:48 AM, Andrew Beekhof wrote: > >> On Tue, Oct 5, 2010 at 7:53 PM, Shravan Mishra >> wrote: >>> Hi, >>> >>> I was interested in knowing that if I have to choose between pacemaker >>> 1.0 vs 1.1 which one should I use. >

Re: [Pacemaker] hgrc of the online repository

2010-10-06 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 12:23 PM, Bernd Schubert wrote: > Hello Andrew, > > any chance you could add a few lines to the .hg/hgrc of the online repository? > Or to /etc/mercurial/hgrc or /etc/mercurial/hgrc.d? > > Reading patches is more easy if function names are provided... > > [diff] > git = True

[Pacemaker] ActiveMQ on pacemaker

2010-10-06 Thread Ivo Rodrigues
Hello guys, I'm trying to make activeMQ working on pacemaker (master/slave) with DRBD for the kahaDB. This way, if a node goes down the second will step up. I created a symbolic link for activemq start script on etc/init.d/ and defined a lsb:activemq resource.. but when the node start shutting do

Re: [Pacemaker] custom path to LSB compliant script in primitive?

2010-10-06 Thread Craig Hurley
Thank you. On 7 October 2010 10:05, Lars Ellenberg wrote: > On Thu, Oct 07, 2010 at 09:36:27AM +1300, Craig Hurley wrote: >> Hello, >> >> Is it possible to provide a custom path to a lsb compliant script in a >> primitive?  To get it working I copied my script to /etc/init.d/rp and >> I'm using t

Re: [Pacemaker] custom path to LSB compliant script in primitive?

2010-10-06 Thread Lars Ellenberg
On Thu, Oct 07, 2010 at 09:36:27AM +1300, Craig Hurley wrote: > Hello, > > Is it possible to provide a custom path to a lsb compliant script in a > primitive? To get it working I copied my script to /etc/init.d/rp and > I'm using the following config: > > primitive p_rp lsb:rp \ > op mon

Re: [Pacemaker] custom path to LSB compliant script in primitive?

2010-10-06 Thread Serge Dubrouski
LSB scripts do not accept any parameters. You have to convert to OCF. Since you already started changing it, you are half way done :-) On Wed, Oct 6, 2010 at 2:36 PM, Craig Hurley wrote: > Hello, > > Is it possible to provide a custom path to a lsb compliant script in a > primitive?  To get it wo

[Pacemaker] custom path to LSB compliant script in primitive?

2010-10-06 Thread Craig Hurley
Hello, Is it possible to provide a custom path to a lsb compliant script in a primitive? To get it working I copied my script to /etc/init.d/rp and I'm using the following config: primitive p_rp lsb:rp \ op monitor interval="30s" \ meta target-role="Started" ... but I'd like to

Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-06 Thread Vadym Chepkov
On Oct 6, 2010, at 4:21 PM, Craig Hurley wrote: > I tried using ping instead of pingd and I added "number" to the > evaluation, I get the same results :/ > > primitive p_ping ocf:pacemaker:ping params host_list=172.20.0.254 > clone c_ping p_ping meta globally-unique=false > location loc_ping g_c

Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-06 Thread Craig Hurley
I tried using ping instead of pingd and I added "number" to the evaluation, I get the same results :/ primitive p_ping ocf:pacemaker:ping params host_list=172.20.0.254 clone c_ping p_ping meta globally-unique=false location loc_ping g_cluster_services rule -inf: not_defined p_ping or p_ping number

Re: [Pacemaker] stonith resource issue

2010-10-06 Thread Shravan Mishra
hb_report -V cluster-glue: 1.0.6 (5b6c964bbf97ec3710224774c5e22de59724b118) On Wed, Oct 6, 2010 at 1:32 PM, Shravan Mishra wrote: > Please fine hb_report. > > Thanks > Shravan > > On Wed, Oct 6, 2010 at 1:14 PM, Shravan Mishra > wrote: >> Hi, >> >> Please find the stonith and crmd logs attache

Re: [Pacemaker] stonith resource issue

2010-10-06 Thread Shravan Mishra
Please fine hb_report. Thanks Shravan On Wed, Oct 6, 2010 at 1:14 PM, Shravan Mishra wrote: > Hi, > > Please find the stonith and crmd logs attached. > > I have pruned stonith.logs as it contained lots of repeatable messages. > > I 'm in the process of installing Date::parse perl module then I'l

Re: [Pacemaker] clone

2010-10-06 Thread Cédric Castanet
Le 06/10/2010 17:06, Dejan Muhamedagic a écrit : Oops. Well, turns out that crm doesn't really like non-ascii chars. Two of your quotes above are funny. At globally-unique and clone-max. Replace them with ascii " and you should be OK. Thanks, Dejan Oups ! I'm confused. I wasn't able to see

Re: [Pacemaker] stonith resource issue

2010-10-06 Thread Dejan Muhamedagic
Hi, On Wed, Oct 06, 2010 at 11:04:34AM -0400, Shravan Mishra wrote: > Hi guys, > > I'm having a weird problem with my stonith resources.They are > constantly starting and stopping. > > I'm using: > > pacemaker=1.1.3 > corosync=1.2.8 > glue=glue_1.0-10 Hmm, which version is this really? Can you

Re: [Pacemaker] configuration problem: migration no working always

2010-10-06 Thread Dejan Muhamedagic
Hi, On Wed, Oct 06, 2010 at 01:11:20PM +0200, Claus Denk wrote: > Dear all, > > I don't understand the following behaviour. We have and > active/passive cluster with drbd, mysql and apache, I attach the > configuration. > I have devided the resources in two groups (filesystem+mysql in one > "Basi

Re: [Pacemaker] pacemaker version

2010-10-06 Thread Gianluca Cecchi
On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra wrote: > That is what I heard too, that's the reason for this question. > On June, inside a complex thread regarding "colocation -inf", Andrew reported the link and also several clarifications after some questions of mine... See in particular: http:

Re: [Pacemaker] clone

2010-10-06 Thread Dejan Muhamedagic
Hi, On Wed, Oct 06, 2010 at 11:55:53AM +0200, Cédric Castanet wrote: > , *Michael Schwartzkopff* a écrit : > >Try to configure the resource-stickiness within the resource and not within > >the > >clone. > > > Sorry not answering directly to the thread, but i've done a mistake > in the mailing lis

[Pacemaker] stonith resource issue

2010-10-06 Thread Shravan Mishra
Hi guys, I'm having a weird problem with my stonith resources.They are constantly starting and stopping. I'm using: pacemaker=1.1.3 corosync=1.2.8 glue=glue_1.0-10 2.6.29.6-0.6.smp.gcc4.1.x86_64 My configuration looks like this: === node ha1.itactics.com node ha2.itactics.

Re: [Pacemaker] clone

2010-10-06 Thread Michael Schwartzkopff
On Wednesday 06 October 2010 16:45:19 Cédric Castanet wrote: > Do you think the folowing cib-bootstrap-options error is linked to my > probleme ? > > r...@node1:~# crm configure verify > WARNING: MonSite: default timeout 20s for start is smaller than the > advised 40s > WARNING: MonSite: default t

Re: [Pacemaker] clone

2010-10-06 Thread Cédric Castanet
Do you think the folowing cib-bootstrap-options error is linked to my probleme ? r...@node1:~# crm configure verify WARNING: MonSite: default timeout 20s for start is smaller than the advised 40s WARNING: MonSite: default timeout 20s for stop is smaller than the advised 60s ERROR: cib-bootstr

Re: [Pacemaker] pacemaker version

2010-10-06 Thread Shravan Mishra
That is what I heard too, that's the reason for this question. Thanks Shravan On Wed, Oct 6, 2010 at 7:51 AM, Vadym Chepkov wrote: > > On Oct 6, 2010, at 2:48 AM, Andrew Beekhof wrote: > >> On Tue, Oct 5, 2010 at 7:53 PM, Shravan Mishra >> wrote: >>> Hi, >>> >>> I was interested in knowing t

Re: [Pacemaker] Online and Offline status when doing crm_mon

2010-10-06 Thread Mike A Meyer
Pavlos, Thanks for the information.  No I didn't know we had to follow that procedure.  That did solve my issue. Mike From: Pavlos Parissis To: The Pacemaker cluster resource manager Date: 10/06/2010 01:55 AM Subject: Re: [Pacemaker] Online and Offline status when doing crm_mon On

Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-06 Thread Vadym Chepkov
On Oct 6, 2010, at 3:43 AM, Jayakrishnan wrote: > > Hello, > > Guess the change:-- > location loc_pingd g_cluster_services rule -inf: not_defined pingd or pingd > number:lte 0 > > should work > > > ocf:pacemaker:ping is recommended as a replacement for pingd RA Both RA define node a

Re: [Pacemaker] pacemaker version

2010-10-06 Thread Vadym Chepkov
On Oct 6, 2010, at 2:48 AM, Andrew Beekhof wrote: > On Tue, Oct 5, 2010 at 7:53 PM, Shravan Mishra > wrote: >> Hi, >> >> I was interested in knowing that if I have to choose between pacemaker >> 1.0 vs 1.1 which one should I use. > > Have a read of: > > http://theclusterguy.clusterlabs.org

[Pacemaker] configuration problem: migration no working always

2010-10-06 Thread Claus Denk
Dear all, I don't understand the following behaviour. We have and active/passive cluster with drbd, mysql and apache, I attach the configuration. I have devided the resources in two groups (filesystem+mysql in one "BasicServices", cluster ip and apache in the other, "ClusterServices"). Now, if

[Pacemaker] hgrc of the online repository

2010-10-06 Thread Bernd Schubert
Hello Andrew, any chance you could add a few lines to the .hg/hgrc of the online repository? Or to /etc/mercurial/hgrc or /etc/mercurial/hgrc.d? Reading patches is more easy if function names are provided... [diff] git = True nodates = True showfunc = True Thanks, Bernd -- Bernd Schubert D

Re: [Pacemaker] clone

2010-10-06 Thread Cédric Castanet
, *Michael Schwartzkopff* a écrit : Try to configure the resource-stickiness within the resource and not within the clone. Sorry not answering directly to the thread, but i've done a mistake in the mailing list configuration. hope this is ok now. I tried but it doesn't change anything:

Re: [Pacemaker] resource stop timeout broken in 1.0 branch tip

2010-10-06 Thread Keisuke MORI
2010/10/6 Andrew Beekhof : >> Is there more changesets >> that need to be backported regarding to this issues? > > There is now that Andreas brought the problem to my attention :-) >   http://hg.clusterlabs.org/pacemaker/1.1/rev/e097c70226fe > >> If not, I think that the Andreas' patch should be ap

Re: [Pacemaker] clone

2010-10-06 Thread Michael Schwartzkopff
On Wednesday 06 October 2010 11:19:05 Cédric Castanet wrote: > Hi everybody, > I'm discovering pacemaker and cluster. I've Succed to build a > avtive/passive cluster with apache and a custom app. I want to switch to > active/active. so i guess i need to clone the resource. I had a probleme > with d

[Pacemaker] clone

2010-10-06 Thread Cédric Castanet
Hi everybody, I'm discovering pacemaker and cluster. I've Succed to build a avtive/passive cluster with apache and a custom app. I want to switch to active/active. so i guess i need to clone the resource. I had a probleme with de crm configure clone. I'm not able to find a way to solve it. Can

Re: [Pacemaker] Can somebody please explain pengine's urge to move all resources?

2010-10-06 Thread Keisuke MORI
Hi Raoul, 2010/9/28 Andrew Beekhof : > On Tue, Sep 28, 2010 at 11:48 AM, Raoul Bhatia [IPAX] > wrote: >> On 09/23/2010 09:28 AM, Andrew Beekhof wrote: >>> The good news is that 1.1.3 doesn't have that behavior. >>> Lets see how 1.0 goes once all the relevant patches have been backported. >> >> t

Re: [Pacemaker] resource stop timeout broken in 1.0 branch tip

2010-10-06 Thread Andrew Beekhof
On Mon, Oct 4, 2010 at 1:18 PM, Keisuke MORI wrote: > 2010/10/2 Andreas Hofmeister : >> Hi, >> >> it seems to me that patch >> >>  http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/8241f689bf9f >> >> broke  timeouts for stop operations. The observable effect is that the >> timeout for stop operat

Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-06 Thread Jayakrishnan
Hello, Guess the change:-- location loc_pingd g_cluster_services rule -inf: not_defined pingd or pingdnumber:lte 0 should work -- Regards, Jayakrishnan. L Visit: www.foralllinux.blogspot.com www.jayakrishnan.bravehost.com On Wed, Oct 6, 2010 at 11:56 AM, Claus Denk wrote: > I am having