Hello,
I am using 3-node corosync/pacemaker cluster setup. Repeatedly one of
the nodes refuses to join the cluster. Here is a snippet from the log file:
Oct 13 12:34:03 sh2 crmd: [2292]: info: crm_timer_popped: Welcomed: 1,
Integrated: 0
Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State
transition S_INTEGRATION -> S_FINALIZE_JOIN [ input=I_INTEGRATED
cause=C_TIMER_POPPED origin=crm_timer_popped ]
Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: Progressed
to state S_FINALIZE_JOIN after C_TIMER_POPPED
Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: 1 cluster
nodes failed to respond to the join offer.
Oct 13 12:34:03 sh2 crmd: [2292]: info: ghash_print_node: Welcome
reply not received from: sh2 6
Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_log: FSA: Input I_ELECTION_DC
from do_dc_join_finalize() received in state S_FINALIZE_JOIN
Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State
transition S_FINALIZE_JOIN -> S_INTEGRATION [ input=I_ELECTION_DC
cause=C_FSA_INTERNAL origin=do_dc_join_finalize ]
Oct 13 12:34:03 sh2 crmd: [2292]: info: do_dc_join_offer_all: join-7:
Waiting on 1 outstanding join acks
Any idea what I should look after?
Networking (both rings) seems to work just fine. Versions used are:
corosync 1.2.1-4 & pacemaker 1.0.9.1+hg15626-1 from current version of
debian squeeze.
Any hint would be appreciated,
Thomas
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems