Jan Kalcic wrote:
Really interesting. I would have appreciated some more example (they are
always welcome) but still very interesting.

Thanks,
Jan

Dejan Muhamedagic wrote:
Hi,

Trying to make it a bit less mysterious, I wrote something about
fencing and stonith quite a while ago and then forgot to share
the link. Sorry about that.

Here it is:

http://www.clusterlabs.org/mediawiki/images/f/f2/Crm_fencing.pdf

As usual, constructive criticism/suggestions/etc are welcome.
I won't be able to read your impressions for the next two weeks,
but will sure look forward to see them afterwards.

Cheers,

Dejan
_______________________________________________
Linux-HA mailing list
linux...@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems



_______________________________________________
Pacemaker mailing list
Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker


I found this to be informative as well, Dejan - thanks for taking the time to write this. However, I too agree with Jan in that some examples using more recommended non-testing STONITH devices would be great, since SSH, null, and other network-based tests are apparently frowned upon in production environments (based on comments by Andrew and the article here which he referenced: http://theclusterguy.clusterlabs.org/post/113230399/highly-available-data-corruption). For example, I have Raritan 30A PDU's in my cabs, but I didn't see anything in the output of 'stonith -L' except an APC switched rack PDU.


Now I know that a document like this can't be expected to cover every single type of STONITH device in existence, but some instructions on writing custom STONITH plugins might be useful so that folks can write them for their particular STONITH device (PDU or IPMI card or what have you) and contribute back to the community which will in turn help others. I've looked at both the clusterlabs.org and linux-ha.org sites, but didn't see any documentation on rolling your own at either site, and the Novell docs on this topic were GUI-centric which unfortunately aren't as helpful to those of use sticking with the CLI.


The other thing that might be helpful is to know what the goal is in terms of recovering from a STONITH action. If one has a node that STONITH powers off at the PDU outlet because it's lost networking, and then networking is subsequently restored, how are we do get the node back in action?


Thanks and Regards,
Ryan

_______________________________________________
Pacemaker mailing list
Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Reply via email to