Hello all,

Quick question here today. Please forgive me if this has been answered, I have 
searched for a couple days and not been able to come up with the answer. I am 
working on a standard 2 node cluster using DRBD and I have my resources in a 
group. All in working well, but my question has to do with what happens when 
there is a problem with an individual service. Consider the following example 
using heartbeat (3.0.1-1) drbd (8.3.6) and pacemaker (1.0.6):

Cluster with one reosurce group which contains these resources in this order: 
   
   -drbd master/slave
   -virtual file system
   -openvpn
   -samba
   -apache webserver
   -cupsd

Problem I am running into is if there is a problem with openvpn in this example 
(VPN goes down and keys are missing so it CANNOT restart without intervention), 
watching the cluster with crm_mon, I see that all the services under openvpn in 
order (samba,apache, cupsd) will all starta "rolling restart". In other words, 
I see openvpn fail, then samba goes down, then apache goes down, then cups goes 
down. Next cups comes up, apache comes up, samba comes up, then openvpn tries 
to start but fails so the progress starts over - smba, apache and cups stop 
then start again. What I end up with is a system where those last 3 services 
which runs fine alone keep coming up then going down again, over and over. Only 
way I can change this is to fix the openvpn issue, then things restart and stay 
restarted.

My question is: is this normal (expected) behavior? If so how do I change this? 
I have tried several on-fail options in the monitors for those services (tried: 
stop, restart, and block) but this doesn't change the behavior. I would like to 
just have the one service stop without affecting the others. Do I need to 
re-think using a resource group?? Any assistance would be greatly appreciated. 
The pacemaker site has a lot of documentation but it's not the clearest 
explainations at times.

-Thanks

Kenneth M DeChick
Linux Systems Administrator
Community Computer Service, Inc.
(315)-255-1751 ext154
http://www.medent.com
k...@medent.com
Registered Linux User #497318
-- -- -- -- -- -- -- -- -- -- --
"You canna change the laws of physics, Captain; I've got to have thirtyminutes! 
"

.
 
This message has been scanned for viruses and dangerous content by MailScanner, 
SpamAssassin &nbsp;&amp; ClamAV. <BR>
 <BR>
This message and any attachments may contain information that is protected by 
law as privileged and confidential, and is transmitted for the sole use <BR>
of the intended recipient(s). If you are not the intended recipient, you are 
hereby notified that any use, dissemination, copying or retention of this 
e-mail <BR>
or the information contained herein is strictly prohibited. If you received 
this e-mail in error, please immediately notify the sender by e-mail, and 
permanently <BR>
delete this e-mail. <BR>

_______________________________________________
Pacemaker mailing list
Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Reply via email to