Re: [Pacemaker] Which Linux to use for cluster

2010-03-17 Thread Norbert Winkler
Thank you al for answering wtih really good statements especially thanks to Dr. Schwartzkopff and to Dominik Klein both helps me a lot Thank you Norbert Winkler Am 17.03.2010 10:37, schrieb Michael Schwartzkopff: Am Mittwoch, 17. März 2010 09:01:11 schrieb Andrew Beekhof: On Tue, Mar 16

[Pacemaker] Announce: pacemaker-mgmt 2.0.0 released

2010-03-17 Thread Yan Gao
Hi, Because there has been a specific change of pacemaker-mgmt for working with pacemaker-1.1 and devel branch, which makes the code of pacemaker-mgmt no longer compatible with pacemaker-1.0 series. And probably there would be more that kind of changes. So I tagged and released pacemaker-mgmt-2.0

Re: [Pacemaker] need info multicast-corosync

2010-03-17 Thread Michael Schwartzkopff
Am Mittwoch, 17. März 2010 22:46:32 schrieb Winkler Norbert: > Hello again > At first sorry for this simple (stupid?) question in this special forum > so please don't lough: > I am still learning a lot trying to configure a cluster. > I am unterstand the sense of multicastadress but i have still

Re: [Pacemaker] clone resource doesn't stop during node standby

2010-03-17 Thread Junko IKEDA
Hi, the previous hb_report was broken. I attached syslog agin. (log's time stamp is updated) Thanks, Junko On Thu, 18 Mar 2010 11:49:45 +0900, Junko IKEDA wrote: Hi, I run the following resources on two node. # crm_mon -1 Last updated: Thu Mar 18 11:26:54 2010 Stack: openais

[Pacemaker] clone resource doesn't stop during node standby

2010-03-17 Thread Junko IKEDA
Hi, I run the following resources on two node. # crm_mon -1 Last updated: Thu Mar 18 11:26:54 2010 Stack: openais Current DC: cspm01 - partition with quorum Version: 1.0.8-2a76c6ac04bc stable-1.0 tip 2 Nodes configured, 2 expected votes 2 Resources configured. Online:

Re: [Pacemaker] 10.8 and pingd problem

2010-03-17 Thread Serge Dubrouski
Both version of pingd expose the same problem for me. After fresh start CIB doesn't get updated with pingd attributes for the cluster nodes so my location rule: location connected myGroup \ rule $id="connected-rule" -inf: not_defined pingd or pingd lte 0 prevents resources from starting.

Re: [Pacemaker] the behavior of clone resource

2010-03-17 Thread Junko IKEDA
Hi Dejan, Thank you for your advice, I removed /usr/etc/logd.cf first, and set use_logd off in corosync.conf. logging setting in corosync.conf is here, logging { fileline: on to_syslog: yes syslog_facility: local1 syslog_priority: debug debug: on

Re: [Pacemaker] restart of crmd?

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 10:36 PM, Alan Jones wrote: > On Wed, Mar 17, 2010 at 1:39 PM, Andrew Beekhof wrote: >> >> On Wed, Mar 17, 2010 at 7:23 PM, Alan Jones >> wrote: >> > Is there any interest among people working with Pacemaker to provide for >> > restarting crmd locally without failover and

[Pacemaker] need info multicast-corosync

2010-03-17 Thread Winkler Norbert
Hello again At first sorry for this simple (stupid?) question in this special forum so please don't lough: I am still learning a lot trying to configure a cluster. I am unterstand the sense of multicastadress but i have still problems to handle it. My nodes are 192.168.1.21 and 192.168.1.22 o

Re: [Pacemaker] restart of crmd?

2010-03-17 Thread Alan Jones
On Wed, Mar 17, 2010 at 1:39 PM, Andrew Beekhof wrote: > On Wed, Mar 17, 2010 at 7:23 PM, Alan Jones > wrote: > > Is there any interest among people working with Pacemaker to provide for > > restarting crmd locally without failover and rediscovering resouce agent > > states through their monitor

Re: [Pacemaker] 10.8 and pingd problem

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 9:10 PM, Serge Dubrouski wrote: > Hello - > > I've just installed fresh new Pacemaker 1.0.8 and ran into a problem > with pind. When Corosync/Pacemaker/Pingd start it doesn't initialize > pingd attribute for the nodes so the resources in the cluster stay > down. Then if I c

Re: [Pacemaker] restart of crmd?

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 7:23 PM, Alan Jones wrote: > Is there any interest among people working with Pacemaker to provide for > restarting crmd locally without failover and rediscovering resouce agent > states through their monitor scripts? Out of interest, what would that achieve? _

Re: [Pacemaker] Starting a resource before drbd becomes primary

2010-03-17 Thread Michael Schwartzkopff
Am Mittwoch, 17. März 2010 11:39:51 schrieb jimbob palmer: > What is the syntax to start a resource before drbd becomes primary? order ordResDRBD inf: resource:start msDRBD:Promote -- Dr. Michael Schwartzkopff MultiNET Services GmbH Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany Tel: +4

Re: [Pacemaker] node states

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 7:53 PM, Matthew Palmer wrote: > On Wed, Mar 17, 2010 at 07:16:16AM -0500, Schaefer, Diane E wrote: >>   We were wondering what the node state of UNCLEAN, with the three >>   variations of online, offline and pending returned in crm_mon mean.  We >>   had the heartbeat serv

[Pacemaker] 10.8 and pingd problem

2010-03-17 Thread Serge Dubrouski
Hello - I've just installed fresh new Pacemaker 1.0.8 and ran into a problem with pind. When Corosync/Pacemaker/Pingd start it doesn't initialize pingd attribute for the nodes so the resources in the cluster stay down. Then if I change configuration (delete/add pingd resource or clone, stop/start

Re: [Pacemaker] Announce: Pacemaker 1.0.8 released

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 8:37 PM, Serge Dubrouski wrote: > It doesn't, at least the one for FC10. Ah, true. Though the problem will go away in a month or two when F-13 comes out and Fedora drops support for F-10 :) > On Wed, Mar 17, 2010 at 1:31 PM, Andrew Beekhof wrote: >> On Wed, Mar 17, 2010

Re: [Pacemaker] Announce: Pacemaker 1.0.8 released

2010-03-17 Thread Serge Dubrouski
It doesn't, at least the one for FC10. On Wed, Mar 17, 2010 at 1:31 PM, Andrew Beekhof wrote: > On Wed, Mar 17, 2010 at 8:16 PM, Serge Dubrouski wrote: >> Hello - >> >> It looks like there is a package name conflict for RedHat >> distribution, at least for CentOS. RedHat has its own resource-age

Re: [Pacemaker] Announce: Pacemaker 1.0.8 released

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 8:16 PM, Serge Dubrouski wrote: > Hello - > > It looks like there is a package name conflict for RedHat > distribution, at least for CentOS. RedHat has its own resource-agents > package: > > > # yum search resource-agents > == Matched: resource-agents ==

Re: [Pacemaker] Announce: Pacemaker 1.0.8 released

2010-03-17 Thread Serge Dubrouski
Hello - It looks like there is a package name conflict for RedHat distribution, at least for CentOS. RedHat has its own resource-agents package: # yum search resource-agents == Matched: resource-agents === resource-agents.i386 : Reusable cluster resource s

Re: [Pacemaker] node states

2010-03-17 Thread Matthew Palmer
On Wed, Mar 17, 2010 at 07:16:16AM -0500, Schaefer, Diane E wrote: > We were wondering what the node state of UNCLEAN, with the three > variations of online, offline and pending returned in crm_mon mean. We > had the heartbeat service off on one of our nodes and the other node > reported U

[Pacemaker] restart of crmd?

2010-03-17 Thread Alan Jones
Is there any interest among people working with Pacemaker to provide for restarting crmd locally without failover and rediscovering resouce agent states through their monitor scripts? Alan ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://os

[Pacemaker] Announce: Pacemaker 1.0.8 released

2010-03-17 Thread Andrew Beekhof
Pacemaker 1.0.8 was tagged and released last night. You can read the full announcement and changelog at: http://theclusterguy.clusterlabs.org/post/452813842/pacemaker-1-0-8-released Updated packages for rpm-based distros are also now available at the usual location. See the following link for

[Pacemaker] node states

2010-03-17 Thread Schaefer, Diane E
Hi, We were wondering what the node state of UNCLEAN, with the three variations of online, offline and pending returned in crm_mon mean. We had the heartbeat service off on one of our nodes and the other node reported UNCLEAN (online). We seem to get it when the nodes are not communicating.

Re: [Pacemaker] Resource-Monitoring with an "On Fail"-Action

2010-03-17 Thread Tom Tux
Hi Dejan Thanks for your answer. I'm using this cluster with the packages from the HAE (HighAvailability-Extension)-Repository from SLES11. Therefore, is it possible, to upgrade the cluster-glue from source? I think, the better way is to wait for updates in the hae-repository from novell. Or do y

[Pacemaker] Starting a resource before drbd becomes primary

2010-03-17 Thread jimbob palmer
What is the syntax to start a resource before drbd becomes primary? ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Re: [Pacemaker] Two node ms cluster: poweroff behaviour

2010-03-17 Thread jimbob palmer
Silly me. Thanks. 2010/3/16 Florian Haas : > crm configure property no-quorum-policy=ignore > > Cheers, > Florian > > On 2010-03-16 17:21, jimbob palmer wrote: >> How can I configure a two node master slave cluster to continue >> working when one node is powered off? >> I would like it to keep wor

Re: [Pacemaker] Multi-level ACLs for the CIB

2010-03-17 Thread Yan Gao
Hi Andrew, On 02/23/10 17:23, Yan Gao wrote: > On 02/23/10 04:10, Andrew Beekhof wrote: >> On Mon, Feb 22, 2010 at 8:58 AM, Yan Gao wrote: >>> Hi Andrew, >>> >>> On 02/08/10 17:48, Andrew Beekhof wrote: On Thu, Feb 4, 2010 at 5:24 PM, Yan Gao wrote: >> And put exclusions for things like

Re: [Pacemaker] Resource-Monitoring with an "On Fail"-Action

2010-03-17 Thread Dejan Muhamedagic
Hi, On Wed, Mar 17, 2010 at 10:57:16AM +0100, Tom Tux wrote: > Hi Dominik > > The problem is, that the cluster does not do the monitor-action every > 20s. The last time, when he did the action was at 09:21. And now we > have 10:37: There was a serious bug in some cluster-glue packages. What you'

Re: [Pacemaker] [PATCH] Medium: build: require Net-SNMP 5.3 or later

2010-03-17 Thread Dejan Muhamedagic
Hi, On Wed, Mar 17, 2010 at 09:17:38AM +0100, Florian Haas wrote: > Andrew, > > now that Pacemaker has been on a bi-monthly release schedule for a > while, is there any chance you could consider publishing RCs before the > actual releases, at least for the stable-1.0 branch? Good idea. That woul

Re: [Pacemaker] Resource-Monitoring with an "On Fail"-Action

2010-03-17 Thread Tom Tux
Hi Dominik The problem is, that the cluster does not do the monitor-action every 20s. The last time, when he did the action was at 09:21. And now we have 10:37: MySQL_MonitorAgent_Resource: migration-threshold=3 + (479) stop: last-rc-change='Wed Mar 17 09:21:28 2010' last-run='Wed Mar 17 09:

Re: [Pacemaker] Which Linux to use for cluster

2010-03-17 Thread Michael Schwartzkopff
Am Mittwoch, 17. März 2010 09:01:11 schrieb Andrew Beekhof: > On Tue, Mar 16, 2010 at 10:52 PM, Michael Schwartzkopff > > wrote: > > Am Dienstag, 16. März 2010 20:43:32 schrieb Winkler Norbert: > >> Hallo again Forum > >> It seems that a failed the second time building a pacemaker cluster > >> fi

Re: [Pacemaker] Resource-Monitoring with an "On Fail"-Action

2010-03-17 Thread Dominik Klein
Hi Tom have a look at the logs and see whether the monitor op really returns 99. (grep for the resource-id). If so, I'm not sure what the cluster does with rc=99. As far as I know, rc=4 would be status=failed (unknown actually). Regards Dominik Tom Tux wrote: > Thanks for your hint. > > I've co

Re: [Pacemaker] Resource-Monitoring with an "On Fail"-Action

2010-03-17 Thread Tom Tux
Thanks for your hint. I've configured an lsb-resource like this (with migration-threshold): primitive MySQL_MonitorAgent_Resource lsb:mysql-monitor-agent \ meta target-role="Started" migration-threshold="3" \ op monitor interval="10s" timeout="20s" on-fail="restart" I have now mo

Re: [Pacemaker] Which Linux to use for cluster

2010-03-17 Thread Dominik Klein
Hi Norbert I don't know what you did in 11.2, but I'll try to tell you what I do. I'm mostly still on 11.1 and use the clusterlabs repo. After installing the operating system from scratch, pretty much all I do is following the install page from the wiki http://clusterlabs.org/wiki/Install ie zy

Re: [Pacemaker] [PATCH] Medium: build: require Net-SNMP 5.3 or later

2010-03-17 Thread Florian Haas
Andrew, now that Pacemaker has been on a bi-monthly release schedule for a while, is there any chance you could consider publishing RCs before the actual releases, at least for the stable-1.0 branch? Cheers, Florian On 03/17/2010 09:11 AM, Florian Haas wrote: > # HG changeset patch > # User Flor

[Pacemaker] [PATCH] Medium: build: require Net-SNMP 5.3 or later

2010-03-17 Thread Florian Haas
# HG changeset patch # User Florian Haas # Date 1268813453 -3600 # Branch stable-1.0 # Node ID 6f008f4c9710758972d2065c58c2800ba4694492 # Parent 2a76c6ac04bcccf42b89a08e55bfbd90da2fb49a Medium: build: require Net-SNMP 5.3 or later Changeset ff75cd9e1093 introduced support for Net-SNMP 5.3, but a

Re: [Pacemaker] Which Linux to use for cluster

2010-03-17 Thread Andrew Beekhof
On Tue, Mar 16, 2010 at 10:52 PM, Michael Schwartzkopff wrote: > Am Dienstag, 16. März 2010 20:43:32 schrieb Winkler Norbert: >> Hallo again Forum >> It seems that  a failed the second time building a pacemaker cluster >> first with opensuse 11. 2 onboard software >> second with opensuse 11.2 with

Re: [Pacemaker] Which Linux to use for cluster

2010-03-17 Thread Norbert Winkler
Thank you for this information I have also the old book at home "Clusterbau mit Linux-Ha Version 2" and i'll hope it will work. I also have a good backup (veeam) so i can live with heartbeat cluster also I will try this first and i will go one try pacemaker in virtuell machines that i can see t