I am running a pacemaker/heartbeat cluster on Debian.

Heartbeat is 3.0.4-1 (from wheezy)


from my daemon.log

Apr 17 17:07:07 s1 attrd: [2692]: info: ha_msg_dispatch: Lost connection to
heartbeat service.
Apr 17 17:07:07 s1 stonithd: [2691]: info: ha_msg_dispatch: Lost connection
to heartbeat service.
Apr 17 17:07:07 s1 crmd: [2693]: info: ha_msg_dispatch: Lost connection to
heartbeat service.
Apr 17 17:07:07 s1 cib: [2689]: info: ha_msg_dispatch: Lost connection to
heartbeat service.
Apr 17 17:07:07 s1 ccm: [2688]: ERROR: Lost connection to heartbeat service.
Need to bail out.
Apr 17 17:07:07 s1 cib: [2689]: info: mem_handle_func:IPC broken, ccm is
dead before the client!
Apr 17 17:07:07 s1 cib: [2689]: ERROR: cib_ccm_dispatch: CCM connection
appears to have failed: rc=-1.
Apr 17 17:07:07 s1 cib: [2689]: ERROR: cib_ccm_dispatch: Exiting to recover
from CCM connection failure
Apr 17 17:07:07 s1 attrd: [2692]: info: cib_native_msgready: Lost connection
to the CIB service [2689].
Apr 17 17:07:07 s1 attrd: [2692]: CRIT: cib_native_dispatch: Lost connection
to the CIB service [2689/callback].
Apr 17 17:07:07 s1 attrd: [2692]: CRIT: cib_native_dispatch: Lost connection
to the CIB service [2689/command].
Apr 17 17:07:07 s1 crmd: [2693]: info: mem_handle_func:IPC broken, ccm is
dead before the client!
Apr 17 17:07:07 s1 crmd: [2693]: info: cib_native_msgready: Lost connection
to the CIB service [2689].
Apr 17 17:07:07 s1 crmd: [2693]: CRIT: cib_native_dispatch: Lost connection
to the CIB service [2689/callback].
Apr 17 17:07:07 s1 crmd: [2693]: CRIT: cib_native_dispatch: Lost connection
to the CIB service [2689/command].
Apr 17 17:07:07 s1 crmd: [2693]: ERROR: crmd_cib_connection_destroy:
Connection to the CIB terminated...
Apr 17 17:07:07 s1 crmd: [2693]: ERROR: ccm_dispatch: CCM connection appears
to have failed: rc=-1.
Apr 17 17:07:07 s1 attrd: [2692]: ERROR: attrd_cib_connection_destroy:
Connection to the CIB terminated...
Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_log: FSA: Input I_ERROR from
crmd_cib_connection_destroy() received in state S_NOT_DC
Apr 17 17:07:07 s1 crmd: [2693]: info: do_state_transition: State transition
S_NOT_DC -> S_RECOVERY [ input=I_ERROR cause=C_FSA_INTERNAL
origin=crmd_cib_connection_destroy ]
Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_recover: Action A_RECOVER
(0000000001000000) not supported
Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_log: FSA: Input I_ERROR from
ccm_dispatch() received in state S_RECOVERY
Apr 17 17:07:07 s1 crmd: [2693]: info: do_dc_release: DC role released
Apr 17 17:07:07 s1 crmd: [2693]: info: do_te_control: Transitioner is now
inactive
Apr 17 17:07:07 s1 crmd: [2693]: ERROR: do_log: FSA: Input I_TERMINATE from
do_recover() received in state S_RECOVERY
Apr 17 17:07:07 s1 crmd: [2693]: info: do_state_transition: State transition
S_RECOVERY -> S_TERMINATE [ input=I_TERMINATE cause=C_FSA_INTERNAL
origin=do_recover ]
Apr 17 17:07:07 s1 crmd: [2693]: info: do_shutdown: All subsystems stopped,
continuing


debug log showed some pacemaker stuff:

Apr 17 17:07:07 s1 attrd: [2692]: debug: xmlfromIPC: Peer disconnected
Apr 17 17:07:07 s1 crmd: [2693]: debug: xmlfromIPC: Peer disconnected
Apr 17 17:07:07 s1 crmd: [2693]: debug: s_crmd_fsa: Processing I_ERROR: [
state=S_NOT_DC cause=C_FSA_INTERNAL origin=crmd_cib_connection_destroy ]
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_ERROR
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_DC_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_INTEGRATE_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_FINALIZE_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_RECOVER
Apr 17 17:07:07 s1 crmd: [2693]: debug: s_crmd_fsa: Processing I_ERROR: [
state=S_RECOVERY cause=C_CCM_CALLBACK origin=ccm_dispatch ]
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_ERROR
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_DC_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_DC_RELEASE
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_dc_release: Releasing the role of
DC
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_DC_RELEASED
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_PE_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_TE_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: cib_client_del_notify_callback:
Removing callback for cib_diff_notify events
Apr 17 17:07:07 s1 crmd: [2693]: debug: s_crmd_fsa: Processing I_TERMINATE:
[ state=S_RECOVERY cause=C_FSA_INTERNAL origin=do_recover ]
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_ERROR
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_DC_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_INTEGRATE_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_FINALIZE_TIMER_STOP
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_SHUTDOWN
Apr 17 17:07:07 s1 crmd: [2693]: debug: do_fsa_action: actions:trace:   //
A_LRM_DISCONNECT
Apr 17 17:07:07 s1 crmd: [2693]: debug: verify_stopped: Checking for active
resources before exit



I have the core dump from the heartbeat process.
Where should i send it?


Thanks for any help

Mark P
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to