On Wed, Dec 12, 2012 at 11:51 AM,  <laurent+pacema...@u-picardie.fr> wrote:
>
> Hi,
>
> I've just observed something weird.
> A node is running a stonith resource for which gethosts gives an empty
> node list. The result of stonith_admin -l does include it in the
> device list !
>
> result of "stonith_admin -l elasticsearch-05" run from
> elasticsearch-06 :
>  stonith-xen-peatbull
>  stonith-xen-eddu
> 2 devices found
>
> stonith-xen-peatbull is a correct fencing device
> stonith-xen-eddu is a fencing device with an empty hostlist
>
> running "my-xen0 gethosts" with the stonith-xen-eddu params by hand
> doesn't return any host, and it does exit with 0 (is that correct to
> return 0 with an empty host list ?)
>
> logs :
> Dec 12 01:09:10 elasticsearch-06 stonith-ng[18181]:   notice: 
> stonith_device_register: Added 'stonith-cluster-xen' to the device list (6 
> active devices)
> Dec 12 01:09:10 elasticsearch-06 attrd[18183]:   notice: 
> attrd_trigger_update: Sending flush op to all hosts for: probe_complete (true)
> Dec 12 01:09:10 elasticsearch-06 attrd[18183]:   notice: 
> attrd_perform_update: Sent update 5: probe_complete=true
> Dec 12 01:09:11 elasticsearch-06 stonith-ng[18181]:   notice: 
> stonith_device_register: Added 'stonith-xen-eddu' to the device list (6 
> active devices)
> Dec 12 01:09:11 elasticsearch-06 stonith-ng[18181]:   notice: 
> stonith_device_register: Added 'stonith-xen-peatbull' to the device list (6 
> active devices)
> Dec 12 01:09:12 elasticsearch-06 stonith: [18434]: info: external/my-xen0-ha 
> device OK.
> Dec 12 01:09:12 elasticsearch-06 crmd[18185]:   notice: process_lrm_event: 
> LRM operation stonith-cluster-xen_start_0 (call=61,rc=0, cib-update=27, 
> confirmed=true) ok
> Dec 12 01:09:14 elasticsearch-06 stonith: [18465]: info: external_run_cmd: 
> '/usr/lib/stonith/plugins/external/my-xen0 status' output: elasticsearch-05
> Dec 12 01:09:14 elasticsearch-06 stonith: [18465]: info: external_run_cmd: 
> '/usr/lib/stonith/plugins/external/my-xen0 status' output: elasticsearch-06
> Dec 12 01:09:15 elasticsearch-06 stonith: [18465]: info: external/my-xen0 
> device OK.
> Dec 12 01:09:15 elasticsearch-06 crmd[18185]:   notice: process_lrm_event: 
> LRM operation stonith-xen-peatbull_start_0 (call=68, rc=0, cib-update=28, 
> confirmed=true) ok
> Dec 12 01:09:15 elasticsearch-06 stonith: [18458]: info: external/my-xen0 
> device OK.
> Dec 12 01:09:15 elasticsearch-06 crmd[18185]:   notice: process_lrm_event: 
> LRM operation stonith-xen-eddu_start_0 (call=66, rc=0, cib-update=29, 
> confirmed=true) ok
> Dec 12 01:12:34 elasticsearch-06 stonith-ng[18181]:   notice: 
> dynamic_list_search_cb: Disabling port list queries for stonith-xen-kornog 
> (1): (null)
> Dec 12 01:12:34 elasticsearch-06 stonith-ng[18181]:   notice: 
> dynamic_list_search_cb: Disabling port list queries for stonith-xen-nikka 
> (1): (null)
> Dec 12 01:12:34 elasticsearch-06 stonith-ng[18181]:   notice: 
> dynamic_list_search_cb: Disabling port list queries for stonith-xen-yoichi 
> (1): (null)
> Dec 12 01:12:34 elasticsearch-06 stonith: [19301]: CRIT: external_hostlist: 
> 'my-xen0 gethosts' returned an empty hostlist
> Dec 12 01:12:34 elasticsearch-06 stonith: [19301]: ERROR: Could not list 
> hosts for external/my-xen0.
> Dec 12 01:12:37 elasticsearch-06 stonith: [19332]: CRIT: external_hostlist: 
> 'my-xen0 gethosts' returned an empty hostlist
> Dec 12 01:12:37 elasticsearch-06 stonith: [19332]: ERROR: Could not list 
> hosts for external/my-xen0.
> Dec 12 01:12:37 elasticsearch-06 stonith-ng[18181]:   notice: 
> dynamic_list_search_cb: Disabling port list queries for stonith-xen-eddu (1): 
> failed:  255
>
> David, I mentioned a node being wrongly fenced in the "stonith-timeout
> duration 0 is too low" bug, could it be related ?

Doubtful, what does your config look like?
IIRC, these agents want to be told which machines they can fence

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to