Re: [Pacemaker] Unable to stop Multi state resource

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 12:34 PM, Rakesh K wrote: > Rakesh K writes: > > > Hi Andrew > > FSR is a File system replication script which adheres to ocf cluster frame > work, > the script is similar to Mysql ocf script, which is a multi state resource, > where in master  ssh server would be running

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 12:40 PM, Rakesh K wrote: > Andrew Beekhof writes: > >> >> There is nothing in this config that requires tomcat2 to be stopped. >> >> Perhaps: >>    colocation Tomcat2-with-Tomcat inf: Tomcat1 Tomcat2VIP >> was intended to be: >>    colocation Tomcat2-with-Tomcat inf: Tomc

Re: [Pacemaker] crm : unknown expected votes

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 3:37 PM, wrote: > Hi, > > > >     I created a 2 node cluster created using pacemaker on Fedora > 14(2.6.35.6-45.fc14.x86_64) > >     I have two errors that I am not able to resolve. > >     Can someone help me resolve these errors. > > > >   1 )  It always shows “ unknown

Re: [Pacemaker] Question of the syslog output in pacemaker-1.1

2011-04-19 Thread Yuusuke IIDA
Hi, Andrew (2011/04/19 18:13), Andrew Beekhof wrote: On Tue, Apr 19, 2011 at 9:25 AM, Yuusuke IIDA wrote: Hi, Andrew I use corosync-1.3.0 and Pacemaker-1.1.5. The log outputs it via rsyslog. I changed syslog_facility of corosync.conf to local1 and was going to let a designated file output

Re: [Pacemaker] Resources won't start

2011-04-19 Thread Phil Hunt
>>Did it start? No, here is the output, all resources kind of went away. Thats what I've been fighting all day.. Last updated: Tue Apr 19 13:52:18 2011 Stack: openais Current DC: CentClus2 - partition with quorum Version: 1.0.10-da7075976b5ff0bee71074385f8fd02f296ec8a3 2 Nodes

Re: [Pacemaker] Resources won't start

2011-04-19 Thread mark - pacemaker list
Hi Phil, On Tue, Apr 19, 2011 at 3:36 PM, Phil Hunt wrote: > Hi > I have iscsid running, no iscsi. Good. You don't want the system to auto-connect the iSCSI disks on boot, pacemaker will do that for you. > > > > Here is the crm status: > > Last updated: Tue Apr 19 12:39:03 2011 >

[Pacemaker] Resources won't start

2011-04-19 Thread Phil Hunt
Hi I've been having alot of problems figuring out a problem. In the enclosed config for a 2 node cluster, letting 2 RHEL5 boxes work as a cluster with a shared iSCSI disk stored on a Windows Storage Server box, the resources will not start. I have iscsid running, no iscsi. I was modifying be

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:38 AM, Marek Marczykowski wrote: >> in your opintion, is it possible to fix this via the ocf ra or does it >> have to be a separate cronjob? > > I haven't idea how to do it in ra. There is no easy way to look what > binlogs are on the other node. Maybe some tricks storing that inf

[Pacemaker] Resource Agents 1.0.4: HA LVM Patch

2011-04-19 Thread Ulf
Hi, I attached a patch to enhance the LVM agent with the capability to set a tag on the VG (set_hosttag = true) in conjunction with a volume_list filter this can prevent to activate a VG on multiple host. Unfortunately active VGs will stay active in case of unclean operation. The tag is always

[Pacemaker] crm : unknown expected votes

2011-04-19 Thread hari.n.tatituri
Hi, I created a 2 node cluster created using pacemaker on Fedora 14(2.6.35.6-45.fc14.x86_64) I have two errors that I am not able to resolve. Can someone help me resolve these errors. 1 ) It always shows " unknown expected votes" when I see 'crm status'. 2 ) In the logfile it s

Re: [Pacemaker] how to get pacemaker:ping recheck before promoting drbd resources on a node

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 11:54 AM, Jelle de Jong wrote: > On 19-04-11 11:31, Andrew Beekhof wrote: >> It the underlying messaging/membership layer goes into spasms - >> there's not much ping can do to help you. What version of corosync >> have you got?  Some versions have been better than others. >

Re: [Pacemaker] A question and demand to a resource placement strategy function

2011-04-19 Thread Yan Gao
On 04/18/11 18:17, Yuusuke IIDA wrote: > * When it is not dispersed well > When I produced trouble in a resource in order of next, I am partial, and the > resource is placed in one node. > > main_rsc3 -> main_rsc2 -> main_rsc1 > > Online: [srv-b1 srv-b2 srv-a1] > Full list of resources: > main_

Re: [Pacemaker] Pacemaker / Postfix startup problem...

2011-04-19 Thread Adam Reiss
I'll get a chance to work on it today. I'll let you know what happens. :) Thanks!! -Original Message- From: Raoul Bhatia [IPAX] [mailto:r.bha...@ipax.at] Sent: Tuesday, April 19, 2011 5:15 AM To: The Pacemaker cluster resource manager Cc: Adam Reiss Subject: Re: [Pacemaker] Pacemake

[Pacemaker] Announce: Hawk (HA Web Konsole) 0.4.0

2011-04-19 Thread Tim Serong
Greetings All, This is to announce version 0.4.0 of Hawk, a web-based GUI for managing and monitoring Pacemaker High-Availability clusters. You can use Hawk 0.4.0 to: - Monitor your cluster, with much the same functionality as crm_mon (displays node and resource status, failed ops). - P

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Rakesh K
Andrew Beekhof writes: > > There is nothing in this config that requires tomcat2 to be stopped. > > Perhaps: >colocation Tomcat2-with-Tomcat inf: Tomcat1 Tomcat2VIP > was intended to be: >colocation Tomcat2-with-Tomcat inf: Tomcat2 Tomcat1 > > The only other service active is httpd, wh

Re: [Pacemaker] Unable to stop Multi state resource

2011-04-19 Thread Rakesh K
Rakesh K writes: Hi Andrew FSR is a File system replication script which adheres to ocf cluster frame work, the script is similar to Mysql ocf script, which is a multi state resource, where in master ssh server would be running and in slave there are rsync scripts which uses to synchronize th

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Andrew Beekhof
There is nothing in this config that requires tomcat2 to be stopped. Perhaps: colocation Tomcat2-with-Tomcat inf: Tomcat1 Tomcat2VIP was intended to be: colocation Tomcat2-with-Tomcat inf: Tomcat2 Tomcat1 The only other service active is httpd, which also has no constraints indicating it sh

[Pacemaker] SBD kills both nodes in a two node cluster.

2011-04-19 Thread Ulf
I' ve two nodes with shared storage and multipathing. But the SBD device doesn't work as expected. My idea was that in case of a split brain: One node kills the other node and one will survive. But in my case I get a double kill, both nodes will be killed at the same time. I simulated the split b

Re: [Pacemaker] how to get pacemaker:ping recheck before promoting drbd resources on a node

2011-04-19 Thread Jelle de Jong
On 19-04-11 11:31, Andrew Beekhof wrote: > It the underlying messaging/membership layer goes into spasms - > there's not much ping can do to help you. What version of corosync > have you got? Some versions have been better than others. corosync 1.2.1-4 pacemaker 1.0.9.1+hg15626-1 /etc/debian_ver

Re: [Pacemaker] how to get pacemaker:ping recheck before promoting drbd resources on a node

2011-04-19 Thread Andrew Beekhof
On Mon, Apr 18, 2011 at 8:57 PM, Jelle de Jong wrote: > Hello everybody, > > I need to be able to bring down my network interface (network failure > test) and few seconds later bring it up again. Without my drbd cluster > going nuts and creating split brains. > > I was advised to use ocf:pacemaker

Re: [Pacemaker] A question and demand to a resource placement strategy function

2011-04-19 Thread Andrew Beekhof
Yan is our utilization expert, lets see if he can provide some direction here :-) -- Andrew 2011/4/18 Yuusuke IIDA : > Hi, Andrew > > I want to disperse using a resource placement strategy function of > Pacemaker-1.1 > in the fail-over point of the resource in N to N environment. > > After testi

Re: [Pacemaker] Pacemaker / Postfix startup problem...

2011-04-19 Thread Raoul Bhatia [IPAX]
adam, any news on this? if this is not working for you, i've got another idea. but please report the current status first... thanks, raoul On 04/14/2011 08:33 PM, Raoul Bhatia [IPAX] wrote: > hi adam, > > On 14.04.2011 18:10, Adam Reiss wrote: >> Hi Raoul, >> >> We're trying to setup a HA SMTP R

Re: [Pacemaker] Question of the syslog output in pacemaker-1.1

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 9:25 AM, Yuusuke IIDA wrote: > Hi, Andrew > > I use corosync-1.3.0 and Pacemaker-1.1.5. > > The log outputs it via rsyslog. > > I changed syslog_facility of corosync.conf to local1 and was going to let a > designated file output the log of the cluster. > > However, setting

Re: [Pacemaker] Heartbeat over Disk or non IP possible?

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 9:31 AM, Ulf wrote: > Hi, > > So it seems not to be possible to do a heartbeat over disk. > Is it planned to introduce such a feature? > It would be a feature of the underlying communications layer. So you'd have to ask the heartbeat or corosync maintainers - but in both c

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Andrew Beekhof
On Tue, Apr 19, 2011 at 10:35 AM, Rakesh K wrote: > Andrew Beekhof writes: > > > Hi Andrew thanks for giving reply. > > The version of pacemaker i am using is pacemaker-1.0.9.1 Ok. Could be a bug. Can you attach the output of cibadmin -Ql when the cluster is in the state you describe? _

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:38 AM, Marek Marczykowski wrote: > On 04/19/11 10:29, Raoul Bhatia [IPAX] wrote: >> On 04/19/2011 10:20 AM, Marek Marczykowski wrote: >>> On 04/19/11 10:01, Raoul Bhatia [IPAX] wrote: what i can currently think of: 1. run a cronjob which periodically analyzes the bin

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Rakesh K
Andrew Beekhof writes: Hi Andrew thanks for giving reply. The version of pacemaker i am using is pacemaker-1.0.9.1 Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Pro

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:20 AM, Marek Marczykowski wrote: > On 04/19/11 10:01, Raoul Bhatia [IPAX] wrote: >> what i can currently think of: >> >> 1. run a cronjob which periodically analyzes the binlogs and will update >> the node's log-file and log-pos attributes if there are empty binlogs; >> (that's the

Re: [Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
On 04/19/2011 10:01 AM, Raoul Bhatia [IPAX] wrote: > the failover worked and wdb02 is up and running. > upon rejoin, wdb01 wanted to start syncing from mysql-bin.15, > position 24386 (as saved in the cib). > > this fails with error "Last_IO_Errno: 1236" and the message: >> Last_IO_Error: Got f

[Pacemaker] mysql m/s failover: 'Could not find first log file name in binary log index file'

2011-04-19 Thread Raoul Bhatia [IPAX]
hi, i'm starting a new thread to address a specific "Could not find first log file name in binary log index file" error upon failover. background: i currently have a two node mysql m/s setup. expire_logs_days (was) set to 7 days last failover happend > 7 days ago (therefore, binlogs have been pu

Re: [Pacemaker] Heartbeat over Disk or non IP possible?

2011-04-19 Thread Ulf
Hi, So it seems not to be possible to do a heartbeat over disk. Is it planned to introduce such a feature? Cheers, Ulf >On Sat, Apr 16, 2011 at 12:23 PM, Ulf wrote: > >Hi, > >is there a way to implement a heartbeat over disk? Or any other non IP > medium? > >I think the SFEX agent

[Pacemaker] Question of the syslog output in pacemaker-1.1

2011-04-19 Thread Yuusuke IIDA
Hi, Andrew I use corosync-1.3.0 and Pacemaker-1.1.5. The log outputs it via rsyslog. I changed syslog_facility of corosync.conf to local1 and was going to let a designated file output the log of the cluster. However, setting was not reflected for a process performed of fork by pacemakerd. fac

Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread Andrew Beekhof
What version of pacemaker? On Tue, Apr 19, 2011 at 9:10 AM, rakesh k wrote: > Hi All > > I had configured Heartbeeat, pacemkaer on my two VM's > > Cluster with two nodes, two nodes are running cent-os as operation systems. > > Cluster configured with 8 resources and defined order, taking referenc

[Pacemaker] Ordering set of resources, problem in ordering chain of resources

2011-04-19 Thread rakesh k
Hi All I had configured Heartbeeat, pacemkaer on my two VM's Cluster with two nodes, two nodes are running *cent-os* as operation systems. Cluster configured with 8 resources and defined order, taking reference pacemaker explained please find the order and co-location constraints, taken from ci