On 2011-10-13 12:38, Thomas wrote: > Hello, > > I am using 3-node corosync/pacemaker cluster setup. Repeatedly one of > the nodes refuses to join the cluster. Here is a snippet from the log file: > > Oct 13 12:34:03 sh2 crmd: [2292]: info: crm_timer_popped: Welcomed: 1, > Integrated: 0 > Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State > transition S_INTEGRATION -> S_FINALIZE_JOIN [ input=I_INTEGRATED > cause=C_TIMER_POPPED origin=crm_timer_popped ] > Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: Progressed > to state S_FINALIZE_JOIN after C_TIMER_POPPED > Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: 1 cluster > nodes failed to respond to the join offer. > Oct 13 12:34:03 sh2 crmd: [2292]: info: ghash_print_node: Welcome > reply not received from: sh2 6 > Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_log: FSA: Input I_ELECTION_DC > from do_dc_join_finalize() received in state S_FINALIZE_JOIN > Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State > transition S_FINALIZE_JOIN -> S_INTEGRATION [ input=I_ELECTION_DC > cause=C_FSA_INTERNAL origin=do_dc_join_finalize ] > Oct 13 12:34:03 sh2 crmd: [2292]: info: do_dc_join_offer_all: join-7: > Waiting on 1 outstanding join acks > > Any idea what I should look after? > > Networking (both rings) seems to work just fine.
"Seems to"? Have you confirmed with "corosync-cfgtool -s" on all nodes? > Versions used are: > corosync 1.2.1-4 & pacemaker 1.0.9.1+hg15626-1 from current version of > debian squeeze. Please upgrade to the versions in squeeze-backports at the earliest convenience. Cheers, Florian -- Need help with Corosync? http://www.hastexo.com/now _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
