On 28/08/13 13:13, Xiaomin Zhang wrote:
Hi, Gurus:
I've a simple master-slave setup for a mirrored DRBD storage: This
storage is written by a daemon Java application server to produce
transaction data.
node Lhs072gkz \
attributes standby="on"
node Lpplj9jb4
node Lvoim0kaw
primitive drbd1 ocf:linbit:drbd \
params drbd_resource="r0" \
op monitor interval="15s"
ms ms_drbd1 drbd1 \
meta master-max="1" master-node-max="1" clone-max="2"
clone-node-max="1" notify="true" target-role="Started"
location drbd-fence-by-handler-ms_drbd1 ms_drbd1 \
rule $id="drbd-fence-by-handler-rule-ms_drbd1" $role="Master"
-inf: #uname ne
Lpplj9jb4
It seems Split-Brains is very likely to happen when I reboot the slave
machine even the Java application is just writing nothing on the DRBD
storage.
Is this an expected behavior?
And I found some topics about automatically recover from Split-Brain for
DRBD () It just says to put some configurations in DRBD, all things
should work. Is this a good practice?
Thanks.
No, split-brains are not at all expected behaviour, but they happen when
things are not setup properly.
The best thing to do is to avoid a split-brain in the first place, which
is easy to do if you setup (working) stonith/fencing.
If you configure stonith in pacemaker using IPMI (the most common
method) and test it to make sure nodes reboot on failure, you can then
"hook" drbd into pacemaker's fencing. You do this by setting the fence
policy to "resource-and-stonith" and then tell DRBD to use the
"crm-fence-peer.sh" fence handler.
This tells DRBD that, if the peer fails (or vanishes), to block IO and
call a fence. The fence handler is then invoked which calls pacemaker
and says "please fence node X". When pacemaker succeeds, it will tell
the handler which in turn tells DRBD that it's now safe to resume IO.
One of the nodes will be dead so you will avoid the split-brain in the
first place.
If your servers have IPMI, iLO, iDRAC, RSA, etc, you can use the
'fence_ipmilan' fence agent in your pacemaker configuration. If you need
help with this, just say.
Cheers
digimer
--
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?
_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org