Re: [Pacemaker] continue starting chain with failed group resources

Patrick H. Tue, 14 Dec 2010 17:22:56 -0800


Sent: Tue Dec 14 2010 11:37:06 GMT-0700 (Mountain Standard Time)
From: Dejan Muhamedagic <deja...@fastmail.fm>
To: The Pacemaker cluster resource manager <pacemaker@oss.clusterlabs.org>

Subject: Re: [Pacemaker] continue starting chain with failed groupresources

Hi,

On Mon, Dec 13, 2010 at 10:43:36PM -0700, Patrick H. wrote:

After tinkering with this for a few hours I finally have something working.

colocation co-raid inf: ( md_raid iscsi_1 iscsi_2 iscsi_3 )


This should be noop. You'd want something like this, I think:

colocation co-raid inf: md_raid ( iscsi_1 iscsi_2 iscsi_3 )

No, that makes the md_raid service depend on all the iscsi servicesbeing started, which I dont want

order or-raid 0: ( iscsi_1 iscsi_2 iscsi_3 ) md_raid

Got rid of the group, changed the score on the order to 0, and
changed the grouping of both the colocation and order. This
*appears* to function as intended, but if anyone can point out any
pitfalls I'd appreciate it

-Patrick

Sent: Mon Dec 13 2010 21:12:04 GMT-0700 (Mountain Standard Time)
From: Patrick H. <pacema...@feystorm.net>
To: The Pacemaker cluster resource manager <pacemaker@oss.clusterlabs.org>
Subject: [Pacemaker] continue starting chain with failed group resources

Is there a way to continue down a chain of starting resources once
a previous resource hast tried to start, no matter if the try was
successful or not?


No, that's currently not possible to express. I think that you
should take the iSCSI resources out of the cluster and let them
start on boot _before_ the cluster manager. If there are not
enough disks, then the md_raid resource is going to fail.

Cant do that either. When the node that is currently using the iscsiservices fails, they have to be migrated over to another host so it canassemble them into a raid array. If theyre not being managed bypacemaker, they wont migrate.

I made a few more tweaks from the configuration I posted earlier and itseems to work pretty good with only one exception.

colocation co-raid inf: ( md_raid iscsi_1 iscsi_2 iscsi_3 )

order or-raid_start 0: ( iscsi_1:start iscsi_2:start iscsi_3:start )md_raid:startorder or-raid_stop inf: md_raid:stop ( iscsi_1:stop iscsi_2:stopiscsi_3:stop )

That makes it so that when they start up, they start in order, but itisnt required that every iscsi start before md_raid, just that they tryto startThen when they stop, its manditory that they stop in that order so thatno iscsi service will stop while md_raid is still running.

The exception I mentioned is a bug in the policy engine. Bug 2435. Thepolicy engine allows resources within a colocation set to start on othernodes. So if I were to stop one of the iscsi services, and then start itagain, it might start on a different node. Unless this bug gets fixedsoon, I'll probably modify the iscsi script so that all the iscsidevices are under 1 resource.

Thanks,

Dejan

I've got 3 iSCSI resources which are in a group, and then an md
raid-5 array as another resource. I have the raid array resource
set to start after the group with a colocation rule, but it will
only start if the whole group comes up. Since this is raid-5, we
can obviously handle some disk failure and start up anyway. So how
do I get it to try to start it up once all the iSCSI resources
have tried to start? Went looking through the docs and didnt find
anything.

Note: there will be other resources in the chain (like mounting
the filesystem) that I dont want to try and start if the raid
array resource didnt start.
------------------------------------------------------------------------

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker



_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

Re: [Pacemaker] continue starting chain with failed group resources

Reply via email to