Re: [Pacemaker] 2 node failover cluster + MySQL Master-Master replica setup

2010-11-11 Thread Ruzsinszky Attila
>You're not making sense, first you say MySQL Master-Master, then you >mention master mysqld on clusterB and slave mysqld on clusterA. So, >which one is it: Yes, it is true. If I stop openais and I start mysql without openais the config is M-M (or Multi-Master). When pacemaker starts mysql process

Re: [Pacemaker] symmetric anti-collocation

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 9:02 PM, Alan Jones wrote: > How to I express symmetric anti-collocation in Pacemaker 1.0.9.1? > I'd like to write two rules: > colocation X-Y -2: X Y > colocation Y-X -2: Y X the second one is implied by the first and is therefore redundant > The idea is that external co

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
Ah, silly me. After +iter->offset = 0; you need: +iter->values = NULL; On Fri, Nov 12, 2010 at 2:07 AM, nozawat wrote: > Hi Andrew, > >  I show below a result of print. > > 1)print *iter > (gdb) print *iter > $1 = {offset = 2, hash = 0x12aa7ec0, values = 0x7fff1568e580} > - > > 2)p

Re: [Pacemaker] Balancing of clone resources (globally-unique=true)

2010-11-11 Thread Chris Picton
I have attached the output as requested On Thu, 11 Nov 2010 11:21:51 +0100, Andrew Beekhof wrote: >>> what version is this? >> >> >> This is 1.0.9 > > Odd. I wouldn't have expected this behavior. Can you attach the > output > from cibadmin -Ql please? > > >>> On Tue, Nov 9, 2010 at 5:51 PM,

[Pacemaker] start filesystem like this is right?

2010-11-11 Thread jiaju liu
start resource steps step(1) crm configure primitive vol_mpath0 ocf:heartbeat:Filesystem meta target-role=stopped params device=/dev/mapper/mpath0 directory=/mnt/mapper/mpath0 fstype='lustre' op start timeout=300s  op stop timeout=120s op monitor timeout=120s interval=60s op notify timeout=60s

Re: [Pacemaker] stand_alone_ping stop Node start

2010-11-11 Thread jiaju liu
tch.pdf > Bugs: > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker > > -- next part -- An HTML attachment was scrubbed... URL: <http://oss.clusterlabs.org/pipermail/pacemaker/attachments/2010/4d7f3ea1/attachment-0001.htm> ---

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew, I show below a result of print. 1)print *iter (gdb) print *iter $1 = {offset = 2, hash = 0x12aa7ec0, values = 0x7fff1568e580} - 2)print *values (gdb) print *values $2 = {data = 0x7fff1568e5c0, next = 0x2b247ebb85a1, prev = 0x1} - Regards, Tomo 2010/11/12 Andrew Beekhof >

[Pacemaker] symmetric anti-collocation

2010-11-11 Thread Alan Jones
How to I express symmetric anti-collocation in Pacemaker 1.0.9.1? I'd like to write two rules: colocation X-Y -2: X Y colocation Y-X -2: Y X The idea is that external conditions could place either resource and I'd like Pacemaker to place the other accordingly. Unfortunately, Pacemaker will only app

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Dan Frincu
Hi, Pavlos Parissis wrote: On 11 November 2010 13:04, Dan Frincu wrote: Hi, Andrew Beekhof wrote: On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu wrote: Hi all, I've managed to make this setup work, basically the issue with a symmetric-cluster="false" and specifying the resources' locatio

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 3:50 PM, nozawat wrote: > Hi Andrew > >  Sorry,pengine output a core. could you go up to frame #1 and run: print *iter print *values > > - > gdb) where > #0  0x2b247fa8b53a in g_list_nth_data () from /lib64/libglib-2.0.so.0 > #1  0x2b247ebc5027 in g_hash_table

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew Sorry,pengine output a core. - gdb) where #0 0x2b247fa8b53a in g_list_nth_data () from /lib64/libglib-2.0.so.0 #1 0x2b247ebc5027 in g_hash_table_iter_next (iter=0x7fff1568e4c0, key=0x0, value=0x7fff1568e4e0) at ../include/crm/common/util.h: 348 #2 0x2b247ebc9301

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Pavlos Parissis
On 11 November 2010 13:04, Dan Frincu wrote: > Hi, > > Andrew Beekhof wrote: > > On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu wrote: > > > Hi all, > > I've managed to make this setup work, basically the issue with a > symmetric-cluster="false" and specifying the resources' location manually > mean

[Pacemaker] drbd-xen and fencing

2010-11-11 Thread Vadym Chepkov
Hi, I posted a less elaborate version of this question to drbd mail-list, but, unfortunately, didn't get a reply, maybe audience of this list has more experience. I am trying to make xen live migration to work reliably, but wasn't successful so far. Here is the problem. In a cluster configura

Re: [Pacemaker] using xml for rules

2010-11-11 Thread Pavlos Parissis
I removed "score=2" from 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 and now I only get, from these I can't figure

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Dan Frincu
Hi, Andrew Beekhof wrote: On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu wrote: Hi all, I've managed to make this setup work, basically the issue with a symmetric-cluster="false" and specifying the resources' location manually means that the resources will always obey the location constraint,

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 12:31 PM, nozawat wrote: > Hi Andrew, > >  I ran it. However, an error has been output. >  Probably I have a feeling that glib does not move well. >  I attached ha-log. > >  I feel like cannot read a library well. >  It is contents of core as follows. you'll need the debug

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew, I'm sorry. The contents of the core file were wrong. I am as follows definitely. - $ gdb /usr/lib64/heartbeat/pengine core.27920 GNU gdb Fedora (6.8-37.el5) Copyright (C) 2008 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew, I ran it. However, an error has been output. Probably I have a feeling that glib does not move well. I attached ha-log. I feel like cannot read a library well. It is contents of core as follows. $ gdb /usr/sbin/corosync core.27920 GNU gdb Fedora (6.8-37.el5) Copyright (C) 200

Re: [Pacemaker] [Question]About the recovery procedure from the state that a node was divided.

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 4, 2010 at 2:44 AM, wrote: > Hi All, > > We tested it about the recovery procedure from the state that a node was > divided. > (As for four nodes, three nodes are active, and one node is constitution of > the standby.) > > It is the restoration from a state divided by two nodes that

Re: [Pacemaker] [Problem]Number of times control of the fail-count is late.

2010-11-11 Thread Andrew Beekhof
On Wed, Nov 10, 2010 at 5:20 AM, wrote: > Hi, > > We constituted a cluster by two node constitution. > The migration-threshold set it to 2. > > We confirmed a phenomenon in the next procedure. > > Step1) Start two nodes and send config5.crm. (The clnDiskd-resources is > original.) > > ==

Re: [Pacemaker] start error because "not installed" - stop fails with "not installed" - stonith

2010-11-11 Thread Andrew Beekhof
On Sat, Oct 9, 2010 at 12:36 AM, Andreas Kurz wrote: > Hello, > > if a resource has encounters a start error with rc=5 "not installed" the > stop action is not skipped before a restart is tried. I'd not expect a stop action at all. What version? > > Typically in such a situation the stop will a

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 11:38 AM, Simon Jansen wrote: > Hi Andrew, > > thank you for your answer. > >> Does the ocf:heartbeat:Rsyslog script call crm_master? >> It needs to to tell pacemaker which instance to promote. > > Yes it does. But I forgot to call crm_master with the option -D in the stop

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Andrew Beekhof
On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu wrote: > Hi all, > > I've managed to make this setup work, basically the issue with a > symmetric-cluster="false" and specifying the resources' location manually > means that the resources will always obey the location constraint, and (as > far as I coul

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Simon Jansen
Hi Andrew, thank you for your answer. Does the ocf:heartbeat:Rsyslog script call crm_master? > It needs to to tell pacemaker which instance to promote. > Yes it does. But I forgot to call crm_master with the option -D in the stop action. I think that this was the error. After correcting this issu

Re: [Pacemaker] stand_alone_ping stop Node start

2010-11-11 Thread Andrew Beekhof
On Thu, Oct 14, 2010 at 3:47 AM, jiaju liu wrote: > Hi > I reboot my node, and it appears > node2 pingd: [3932]: info: stand_alone_ping: Node 192.168.10.100 is > unreachable (read) > and the node could not start > > 192.168.10.100 is ib network I will start ib after the node start, so do > you

Re: [Pacemaker] using xml for rules

2010-11-11 Thread Andrew Beekhof
On Sun, Oct 17, 2010 at 7:26 PM, Pavlos Parissis wrote: > Hi, > > I am trying to make a rule to control the failback on the resources. I want > during working days from 06:00 to 23:00 and on weekend from 08:00 to 16:00 > to have > resource-stickiness 1000 and on the left hours zero, so cluster can

Re: [Pacemaker] Cotrolled migration of resources

2010-11-11 Thread Andrew Beekhof
On Fri, Oct 29, 2010 at 4:18 AM, Matthew F. Hymowitz wrote: > I am building a two node active/passive cluster supporting > iSCSITarget,iSCSILogicalUnit and ocf:linbit:drbd.   I can hard power down a > primary node (i.e. pull the power cord ) and all my resource migrate > correctly to the other nod

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Dejan Muhamedagic
Hi, On Thu, Nov 11, 2010 at 11:11:14AM +0100, Andrew Beekhof wrote: > On Thu, Nov 11, 2010 at 9:59 AM, Simon Jansen > wrote: > > Hi everybody, > > > > I wrote a multistate resource agent to manage the Rsyslog service in a > > master/slave setup. > > In the master role the service is started with

Re: [Pacemaker] crazy notify with master/slave of group

2010-11-11 Thread Andrew Beekhof
On Tue, Nov 2, 2010 at 1:04 AM, Alan Jones wrote: > Hi, > Pacemaker 1.0.9.1, Corosync 1.2.7 > I have a sane master/slave configuration that gives me normal looking > notify() calls when I standby each node in turn. > However, when I configure the master/slave on a group of three > resources, thing

Re: [Pacemaker] Balancing of clone resources (globally-unique=true)

2010-11-11 Thread Andrew Beekhof
On Wed, Nov 10, 2010 at 1:51 PM, Chris Picton wrote: > On Wed, 10 Nov 2010 09:32:00 +0100, Andrew Beekhof wrote: > >> what version is this? > > > This is 1.0.9 Odd. I wouldn't have expected this behavior. Can you attach the output from cibadmin -Ql please? >> >> On Tue, Nov 9, 2010 at 5:51 PM,

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 9:59 AM, Simon Jansen wrote: > Hi everybody, > > I wrote a multistate resource agent to manage the Rsyslog service in a > master/slave setup. > In the master role the service is started with a predefined > rsyslog_master.conf. In the slave role with a predefined rsyslog_sla

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 10:26 AM, nozawat wrote: > Hi Andrew, > >  Thanks for a revision. >  I confirmed completion of compiling it. >  I revised it a little, I attach a patch. Thanks! Did you try running it? > > Regards, > Tomo > > > 2010/11/11 Andrew Beekhof >> >> This might be a little bett

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew, Thanks for a revision. I confirmed completion of compiling it. I revised it a little, I attach a patch. Regards, Tomo 2010/11/11 Andrew Beekhof > This might be a little better: > > diff -r dd75da218e4f configure.ac > --- a/configure.ac Fri Oct 29 12:12:45 2010 +0200 > +++ b

Re: [Pacemaker] How can I restart a clone resource on a specific node ?

2010-11-11 Thread Andrew Beekhof
On Tue, Nov 9, 2010 at 5:56 PM, oaidel wrote: > Hello, > > I have a cluster 2 nodes (RL 5.2) using pacemaker (1.0.9) with corosync > > > > > value="1.0.9-89bd754939df5150de7cd76835f98fe90851b677"/> > name="cluster-infrastructure" value="openais"/> > name="expected-quorum-votes" value="2"/> >

[Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Simon Jansen
Hi everybody, I wrote a multistate resource agent to manage the Rsyslog service in a master/slave setup. In the master role the service is started with a predefined rsyslog_master.conf. In the slave role with a predefined rsyslog_slave.conf configuration file. The start operation of the agent star

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
This might be a little better: diff -r dd75da218e4f configure.ac --- a/configure.ac Fri Oct 29 12:12:45 2010 +0200 +++ b/configure.ac Tue Nov 09 13:20:55 2010 +0100 @@ -654,7 +654,7 @@ AC_MSG_RESULT(using $GLIBCONFIG) AC_CHECK_LIB(glib-2.0, g_hash_table_get_values) if test "x$ac_cv_li

Re: [Pacemaker] crm_mon and pingd

2010-11-11 Thread Keisuke MORI
No objections. I've push the changeset below: http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/53132ed532ea But it would be still preferable to rely on -A, particularly if you want to use two or more pingd resources or change the attribute name. I'd also agree for the filtering feature as an e