Re: [Pacemaker] How to migrate a resource group

2009-11-18 Thread Rasto Levrinc
On Thu, November 19, 2009 1:55 am, lepace wrote: > Hi, > I want to migrate a resource group manual,but the command tell me migrate > a resource not a resource group, how can I migrate a resource group? How do you migrate it? This command should still work: crm_resource -r $GROUP -H $HOST --migra

[Pacemaker] How to migrate a resource group

2009-11-18 Thread lepace
Hi, I want to migrate a resource group manual,but the command tell me migrate a resource not a resource group, how can I migrate a resource group? ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacem

Re: [Pacemaker] Error starting Apache on 2 nodes cluster

2009-11-18 Thread Luke Bigum
Angie, I can't tell exactly what's you've provided, can you post your CRM configuration (the output of 'crm configure show')? While you're at it, also provide ' crm_verify -LV' and 'crm_mon -fo1'. This looks suspicious though: Nov 19 01:25:08 test2 crmd: [24251]: info: process_lrm_event: LRM o

[Pacemaker] Error starting Apache on 2 nodes cluster

2009-11-18 Thread Angie T. Muhammad
Hello I'm a pacemaker and openais beginner. I followed the document 'cluster from scratch' and I successfully managed to create and monitor a 'ClusterIP' and 'LoadBalancer' resources. But, Whenever I try to start Apache: # crm configure primitive WebSite ocf:heartbeat:apache params configfile=/etc

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Sorry, the permissions were the problem I think. It looks like the hacluster user actually is the writer of these files. Thanks for you help hj -Frank From: Frank DiMeo [mailto:frank.di...@bigbandnet.com] Sent: Wednesday, November 18, 2009 5:17 PM To

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Well, set the permissions to 755 but still nothing is stored in /var/lib/heartbeat/crm in a running node -Frank From: Frank DiMeo [mailto:frank.di...@bigbandnet.com] Sent: Wednesday, November 18, 2009 5:09 PM To: pacemaker@oss.clusterlabs.org Subject: Re:

Re: [Pacemaker] ocf:heartbeat:pingd receives wrong value in $__ACTION

2009-11-18 Thread hj lee
Hi again, I found what is wrong. I should use "monitor" as an operation name in crm command. I though it is id, realized today it becomes operation name directly. Thanks hj On Wed, Nov 18, 2009 at 3:04 PM, hj lee wrote: > Hi, > > Sorry for long delay. This is my pingd configuration. > > primit

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Sorry, I just noticed that the directories under /var/lib/heartbeat/ have permissions of 750...not good. This probably explains the persistence problem. -Frank From: Frank DiMeo [mailto:frank.di...@bigbandnet.com] Sent: Wednesday, November 18, 2009 5:04

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Maybe the permissions and ownership of /var/lib/heartbeat/XXX directories are wrong on my system? On my nodes, the owner of all directories under /var/lib/heartbeat is root/root and the permissions are 755 -Frank From: hj lee [mailto:kerd...@gmail.com]

Re: [Pacemaker] ocf:heartbeat:pingd receives wrong value in $__ACTION

2009-11-18 Thread hj lee
Hi, Sorry for long delay. This is my pingd configuration. primitive pingd-res ocf:heartbeat:pingd \ params dampen="5s" multiplixer="2" host_list="my-gateway-ip" \ op pingd-monitor-op interval="1s" timeout="10s" \ meta target-role="started" clone pingd-clone pingd-res \

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Again, thanks. 1) When I restart all the nodes in my cluster, I don't see my previous configuration come back, which is why I was wondering about the persistence. The was a restart them is by running /etc/init.d/openais stop on both nodes, then running /etc/init.d/openais start on bot

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Resending without the mangled grammar-- Thanks for your response, some comments: 1) So, if we cannot load a persistent configuration after a restart of the all nodes in a cluster, how does one automatically configure a (rebooted) unattended cluster with a known set of resources? I

Re: [Pacemaker] getting started

2009-11-18 Thread hj lee
On Wed, Nov 18, 2009 at 2:27 PM, Frank DiMeo wrote: > Thanks for you response, some comments: > > > > 1) So, if we cannot load a persistent configuration after a restart > of the all nodes in the cluster, how does one automatically configure an > (rebooted) unattended cluster with a known

Re: [Pacemaker] Starting a cloned resource only on nodes running another resource

2009-11-18 Thread Ashley Winters
Andrew Beekhof wrote, > > On Wed, Nov 18, 2009 at 12:11 AM, Ashley Winters > wrote: > > I want to start a single instance of resource A on any node which is > > running one or more instances of resource B. And, I don't want to stop > > resource A until there are no more instances of resource B on

Re: [Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
Thanks for you response, some comments: 1) So, if we cannot load a persistent configuration after a restart of the all nodes in the cluster, how does one automatically configure an (rebooted) unattended cluster with a known set of resources? Is it expected that a person will reconfig

Re: [Pacemaker] getting started

2009-11-18 Thread hj lee
> 1) Is there a way to start pacemaker and specify an xml file as the startup > configuration? Right now, I start openais/pacemaker, wait for DC election, > then run cibadmin to load the file. That's fine for now, but I want to > automate this eventually. > I don't think it's possible. > 2) Is

[Pacemaker] getting started

2009-11-18 Thread Frank DiMeo
I've been working with pacemaker/openais for a few weeks now and have several "getting started" questions, so I'll put them all into one email: 1) Is there a way to start pacemaker and specify an xml file as the startup configuration? Right now, I start openais/pacemaker, wait for DC election,

Re: [Pacemaker] **** SPAM **** Re: pacemaker-1.0.6 + corosync 1.1.2 crashing

2009-11-18 Thread Nikola Ciprich
> The cores indicate that the crash occurred while connecting to corosync right? yes > Where did all the cluster packages come from? I've packaged those myself, all are based on clean sources without any additional patches. > > ___ > Pacemaker mailing l

Re: [Pacemaker] Starting a cloned resource only on nodes running another resource

2009-11-18 Thread Andrew Beekhof
On Wed, Nov 18, 2009 at 12:11 AM, Ashley Winters wrote: > I want to start a single instance of resource A on any node which is running > one or more instances of resource B. And, I don't want to stop resource A > until there are no more instances of resource B on that node. I'm using > pacemake

Re: [Pacemaker] **** SPAM **** Re: pacemaker-1.0.6 + corosync 1.1.2 crashing

2009-11-18 Thread Nikola Ciprich
I got the same: root 20715 0.0 0.2 175696 4180 ?Ssl 14:46 0:00 /usr/sbin/corosync But I think that Your suspicion about user is right, pacemaker process cores are still appearing at /var/lib/heartbeat/cores/hacluster instead of /var/lib/heartbeat/cores/root so I guess corosync

Re: [Pacemaker] **** SPAM **** Re: pacemaker-1.0.6 + corosync 1.1.2 crashing

2009-11-18 Thread Andrew Beekhof
On Wed, Nov 18, 2009 at 3:00 PM, Nikola Ciprich wrote: > I got the same: > root     20715  0.0  0.2 175696  4180 ?        Ssl  14:46   0:00 > /usr/sbin/corosync > > But I think that Your suspicion about user is right, pacemaker process cores > are still appearing at /var/lib/heartbeat/cores/haclu

Re: [Pacemaker] pacemaker-1.0.6 + corosync 1.1.2 crashing

2009-11-18 Thread Andrew Beekhof
On Nov 18, 2009, at 10:32 AM, Nikola Ciprich wrote: ok, but then how do I set it for corosync? just adding aisexec { user: root group: root } The above is what I have. if you run: ps axfu | grep coro do you get something like this: root 29024 0.3 0.1 465000 4348

Re: [Pacemaker] how to correctly identify when a "Resource xxx-n cannot run anywhere"

2009-11-18 Thread Andrew Beekhof
On Wed, Nov 18, 2009 at 8:53 AM, Tim Serong wrote: > On 11/18/2009 at 08:13 AM, Димитър Бойн wrote: >> Hi, >> I would love to be able to identify when a certain resource cannot run >> anywhere in a cluster anymore. >> A valid scenario would be when my cluster loses quorum -I want to leave it >> "f

Re: [Pacemaker] pacemaker-1.0.6 + corosync 1.1.2 crashing

2009-11-18 Thread Andrew Beekhof
On Wed, Nov 18, 2009 at 9:16 AM, Nikola Ciprich wrote: > Hi, > well, I'm a bit confused now :) > I have identical configuration working on my testing virtual machines, so it > doesn't seem to be problem there. Furthermore, corosync.conf manpage doesn't > mention any directives for setting user/g

Re: [Pacemaker] pacemaker-1.0.6 + corosync 1.1.2 crashing

2009-11-18 Thread Nikola Ciprich
Hi, well, I'm a bit confused now :) I have identical configuration working on my testing virtual machines, so it doesn't seem to be problem there. Furthermore, corosync.conf manpage doesn't mention any directives for setting user/group and "aisexec" directive is not mentioned there either. I tri