On Fri, Jul 30, 2010 at 8:38 AM, Thomas Guthmann <tguthm...@iseek.com.au> wrote: > Re, > >> [..] I can provide a hb_report if necessary. > See in attachment a report for the simple config below. Note that I dumbly > erased the conf before doing the report but I paste it below. > > I've done a very simple cluster. 2 nodes running a dummy resource with 2 > cloned pings testing a virtual IP that I activated or deactivated for the > test. > > 11:20AM : cluster is up and running > 11:25AM : shutdown the IP > 11:30AM : force a refresh with attrd_updater (because pingd=1 still) > It doesn't change anything still seen as up... > 11:37AM : change a value in the CIB dampen from 120 to 121 for instance > Now db2 pingd is null but db1 is still 1. crm changes have > been done on db2 - dunno if it's linked. > 11:40AM : start the IP again > 12:00AM : IP is still seen as down... > > primitive dummy ocf:pacemaker:Dummy > primitive ping ocf:pacemaker:ping \ > params host_list="IP.TO.TE.ST" dampen="121" attempts="3" debug="1" > clone CONNECTIVITY ping > location rule-connectivity dummy \ > rule $id="rule-ping" -inf: not_defined pingd or pingd number:lte 0 > property $id="cib-bootstrap-options" \ > dc-version="1.0.9-89bd754939df5150de7cd76835f98fe90851b677" \ > cluster-infrastructure="openais" \ > expected-quorum-votes="2" \ > no-quorum-policy="ignore" \ > pe-warn-series-max="2880" \ > stonith-enabled="false" > rsc_defaults $id="rsc_defaults-options" \ > resource-stickiness="1" > > Thomas
It certainly looks like its working... Jul 30 11:37:50 db1.icare.appnet.iseek.com.au ping[11699]: WARNING: 202.83.64.201 is inactive: PING 202.83.64.201 (202.83.64.201) 56(84) bytes of data.#012#012--- 202.83.64.201 ping statistics ---#0123 packets transmitted, 0 received, +1 errors, 100% packet loss, time 2000ms Followed by Jul 30 11:37:50 db1.icare.appnet.iseek.com.au attrd_updater: [11718]: info: Invoked: attrd_updater -n pingd -v 0 -d 121 Jul 30 11:39:51 db1.icare.appnet.iseek.com.au attrd: [3936]: info: attrd_trigger_update: Sending flush op to all hosts for: pingd (0) Jul 30 11:39:51 db1.icare.appnet.iseek.com.au attrd: [3936]: info: attrd_perform_update: Sent update 65: pingd=0 Alas there is no debug running so I can't say for sure that the call returned, but this makes it pretty likely: Jul 30 11:39:51 db1.icare.appnet.iseek.com.au crmd: [3938]: info: abort_transition_graph: te_update_diff:146 - Triggered transition abort (complete=1, tag=transient_attributes, id=db1.icare.appnet.iseek.com.au, magic=NA, cib=0.183.11) : Transient attribute: update _______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker