[Pacemaker] crm_mon watch fencing option - no notification

2015-03-03 Thread Schaefer, Diane E
Hello, We are running on SLES 11 SP3 pacemaker 1.1.11-0.7.53 and are attempting to get notifications of fencing events for an audit trail we are constructing. We are also hoping to get events for resource start/stops and failure. We have constructed a script that writes to a special log when

[Pacemaker] hawk session timeout?

2014-12-01 Thread Schaefer, Diane E
Hi, I am running hawk 0.6.1-0.11.1 on SLES SP3. How do I configure HAWK so my web session times out. My users are concerned since it never times out by default. Thanks, Diane Schaefer ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http:/

[Pacemaker] NFS concurrency

2014-08-05 Thread Schaefer, Diane E
Hi, I'm working with SLES 11 SP3 and pacemaker 1.1.10-0.15.25 I'm looking to define constraints that can allow multiple NFSV4 filesystem/exports to be started concurrently (they belong to the same LVM). I also have multiple access points. My model looks like this: | FS1 -

Re: [Pacemaker] Hawk session ends after start or stop action

2014-03-04 Thread Schaefer, Diane E
Hi Lars, >> I am running pacemaker on SLES 11 SP3 and have applied the update package >> released in December. The hawk level is 0.6.1-0.11.1 and lighttpd is >> 1.4.20-2.52.1 . When I log into hawk using firefox, google chrome or IE 9 >> all with the hacluster userid. I can view my clus

[Pacemaker] Hawk session ends after start or stop action

2014-03-03 Thread Schaefer, Diane E
Hi, I am running pacemaker on SLES 11 SP3 and have applied the update package released in December. The hawk level is 0.6.1-0.11.1 and lighttpd is 1.4.20-2.52.1 . When I log into hawk using firefox, google chrome or IE 9 all with the hacluster userid. I can view my cluster definition, but I

[Pacemaker] Pacemaker-gui will not clear constraint after a migration after upgrade to pacemaker 1.1.10

2014-01-14 Thread Schaefer, Diane E
Hello, I am running on SUSE Linux and recently upgraded to pacemaker 1.1.10. Here are my RPM levels: pacemaker-mgmt-2.1.2-0.11.4 pacemaker-mgmt-client-2.1.2-0.11.4 pacemaker-1.1.10-0.9.28 When I migrate a resource group, I get the new cli-ban- location constraint appear in the CIB. Crm confi

[Pacemaker] pgsql stays in Disconnect after online node

2013-11-15 Thread Schaefer, Diane E
We are working with the pgsql RA and pacemaker 1.1.9. We are having trouble understanding why our slave resource doesn't come out of DISCONNECT after we online the node. We did the following: 1) Put node running the Master (usrv-tsegp8) in Standby. The old slave became the master and th

[Pacemaker] location syntax for crm shell.

2011-09-14 Thread Schaefer, Diane E
Hi, We are running a two-node cluster using pacemaker 1.1.5-18.1 with heartbeat 3.0.4-41.1. I am confused on the correct syntax to use when adding a location constraint using the crm shell. I would like a resource to always run on a particular node. Here are the results of my experiments:

[Pacemaker] compression with heartbeat doesn't seem to work

2011-08-19 Thread Schaefer, Diane E
Hi, We are running a two-node cluster using pacemaker 1.1.5-18.1 with heartbeat 3.0.4-41.1. We are experiencing what seems like network issues and cannot make heartbeat recover. We are experiencing "message too long" and the systems can no longer sync. Our ha.cf is as follows: autojoin none

[Pacemaker] crm load replace issue

2011-08-10 Thread Schaefer, Diane E
Hi, I have two sets of 2 node clusters, one set up for disaster recovery. I am attempting to backup my cluster from the production system and load the definitions into the DR cluster. The systems are named the same. I am using heartbeat 3.0.4 and pacemaker 1.1.5. When I run the crm configu

Re: [Pacemaker] cleaning resources restart dependents

2010-08-06 Thread Schaefer, Diane E
>> Hi, >> >> ? We had a question as to why the dependent resources are restarted when a >> clean is issued to the parent to clear the fail-count if all were happily >> running?? This is with pacemaker 1.0.6 and heartbeat 3.0. >Clean removes the resource's operation history. >The cluster the

[Pacemaker] cleaning resources restart dependents

2010-08-05 Thread Schaefer, Diane E
Hi, We had a question as to why the dependent resources are restarted when a clean is issued to the parent to clear the fail-count if all were happily running? This is with pacemaker 1.0.6 and heartbeat 3.0. Thanks for any information, Diane Schaefer _

Re: [Pacemaker] abrupt power failure problem

2010-06-15 Thread Schaefer, Diane E
sender and delete the e-mail and its attachments from all computers. -Original Message- From: Bernd Schubert [mailto:bs_li...@aakef.fastmail.fm] Sent: Tuesday, June 15, 2010 1:39 PM To: pacemaker@oss.clusterlabs.org Cc: Schaefer, Diane E Subject: Re: [Pacemaker] abrupt power failure

Re: [Pacemaker] Dependent Resources

2010-06-09 Thread Schaefer, Diane E
>> Hi, >> >> I have a parent resource(A) with two others that depend on it (B, C). The >> resources of B and C will not run if A is not running. I would like to >> monitor B and C in addition to A for availability. >> >> >> I defined collocation and order constraints as follows >> >> coloc

[Pacemaker] Dependent resources

2010-06-03 Thread Schaefer, Diane E
Hi, I have a parent resource(A) with two others that depend on it (B, C). The resources of B and C will not run if A is not running. I would like to monitor B and C in addition to A for availability. I defined collocation and order constraints as follows colocation colocation-A-B inf: B A col

[Pacemaker] disabling monitor actions

2010-05-04 Thread Schaefer, Diane E
Hi, We are using Pacemaker 1.0.6 and heartbeat 3.0.2. I would like to temporarily disable certain monitor actions. The pacemaker doc says to use cibadmin -M -X '' And to re-enable it with cibadmin -M -X '' However these actions are not successful and the resource continues to be monitored.

[Pacemaker] crm_resource --un-move option

2010-04-29 Thread Schaefer, Diane E
Hi, I would like to migrate my resources from one node to the other and automatically remove the cli-prefer-expr constraint so it can move back if needed. I have a parent resource with many children. When I issue the "crm_resource -r -M " command, I see the resources stop. However, depe

Re: [Pacemaker] long time to start

2010-04-21 Thread Schaefer, Diane E
>> Hi, >> I have a resource that sometimes can take 10 minutes to start after >> a failure due to log records that need to be sync'd. (my own OCF) >> >> I noticed while the start action was being performed, if other >> >> resources in my cluster report a "not running", no restart will

Re: [Pacemaker] long time to start

2010-04-19 Thread Schaefer, Diane E
>> Actually this RA waits for the sync to complete. If it takes longer than >> the allotted time-out, Pacemaker SIGTERM/SIGKILLs it. The issue is if it >> can never complete in the allotted time frame... >Then make the timeout longer? I could make the timeout longer, although I'm not sure

Re: [Pacemaker] long time to start

2010-04-19 Thread Schaefer, Diane E
>> Hi, >> >> ? I have a resource that sometimes can take 10 minutes to start after a >> failure due to log records that need to be sync?d. (my own OCF)? I noticed >> while the start action was being performed, if other resources in my cluster >> report a ?not running?, no restart will be attem

[Pacemaker] long time to start

2010-04-16 Thread Schaefer, Diane E
Hi, I have a resource that sometimes can take 10 minutes to start after a failure due to log records that need to be sync'd. (my own OCF) I noticed while the start action was being performed, if other resources in my cluster report a "not running", no restart will be attempted until my long r

[Pacemaker] node states

2010-03-17 Thread Schaefer, Diane E
Hi, We were wondering what the node state of UNCLEAN, with the three variations of online, offline and pending returned in crm_mon mean. We had the heartbeat service off on one of our nodes and the other node reported UNCLEAN (online). We seem to get it when the nodes are not communicating.

[Pacemaker] clarification on action timeouts

2010-03-02 Thread Schaefer, Diane E
Hi, If I don't specify an action (stop, start, monitor) timeout, is there a default? Does a value of 0 mean no timeout? Thanks, Diane Schaefer THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received

[Pacemaker] what are valid chars for Type ID?

2010-02-23 Thread Schaefer, Diane E
Hi, I just wanted to confirm that Resource IDs can only be letters, digits and "_". Is there a way to enter any special characters into the ID fields such as "/" and "!"? Thanks, Diane Schaefer THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for

[Pacemaker] displaying failcounts with crm_mon --failcounts

2010-01-27 Thread Schaefer, Diane E
Hi, I am using Pacemaker 1.0.6. I noticed when I use crm_mon --failcounts when I have a resource in the failed state I get the migration threshold and the fail-count Migration summary: * Node qpr2: default_route: migration-threshold=5 fail-count=6 last-failure='Wed Jan 27 14:47:49 2010' *

Re: [Pacemaker] running same resource on each node

2010-01-25 Thread Schaefer, Diane E
On Mon, Jan 25, 2010 at 3:58 PM, Schaefer, Diane E mailto:diane.schae...@unisys.com>> wrote: >> Hi, >> >? ? I would like to configure a single lsb resource that will run on both >> nodes of my cluster. >Check out: > > http://www.clusterl

[Pacemaker] running same resource on each node

2010-01-25 Thread Schaefer, Diane E
Hi, I would like to configure a single lsb resource that will run on both nodes of my cluster. I am using Pacemaker 1.0.6 and Heartbeat 3.0.2. This resource needs to be always running on each node, but only running once per node. If it stops, I want the cluster to restart it. But only on the

[Pacemaker] automatic reset of failcount?

2010-01-21 Thread Schaefer, Diane E
Hi, We are using Pacemaker 1.0.6 and heartbeat 3.0. We'd like to know if there is a way to configure the failcount for a resource to automatically reset. We have several resources that have a migration-threshold in a two node cluster. Once the threshold is met, the resource migrates to the o