On 02/15/2012 11:35 AM, James FLatten wrote:
I have crossed connected the HP iLO3 interfaces and setup stonith on each node.

I have figured out the issue and I am posting for anyone in the future that runs into this.

It turns out that the HP iLO3 will stay on-line for around 3 seconds or so after a complete power failure. This is enough time for Pacemaker/Stonithd to get the status of the server as being down and continue as planned.

I had introduced a attribute called delay="10" to one node in an attempt to avoid both nodes shooting each other during a complete loss of cluster communication.

Removal of the delay allows both nodes to see the status of it's partner for a few seconds during a complete power failure on one node.

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to