Re: [Pacemaker] pacemaker fails to start drbd using ocf:linbit:drbd

2010-07-01 Thread martin . braun
Hi Bart, . Just some more thoughts: Are you sure that drbd was really stopped? Does this error also happen after a clean restart (without drbd starting at runlevel), i.e. "lsmod | grep drbd" without results? How long does it take if you setup drbd (attach,syncer,connect,primary) manually? Wh

Re: [Pacemaker] pacemaker fails to start drbd using ocf:linbit:drbd

2010-07-01 Thread martin . braun
Hi Bart, my guess is that you did forget the start-delay attribute for the monitor operations, that's why you see the time-out error message. Here is an example: op monitor interval="20" role="Slave" timeout="20" start-delay="1m" \ op monitor interval="10" role="Master" timeo

Re: [Pacemaker] One or a few dedicated installations

2010-06-22 Thread martin . braun
Hi Martin, > I don't have a big cluster but I do have a few (5-15) active/passive > clusters with each having 2 nodes. I have a similar setup running : one 6 node cluster with 3 active/passive pairs and it is working fine. When you add more node configuration will get a bit long and unclear but

Re: [Pacemaker] VMWare Guest as Cluster Nodes - how to do stonith

2010-05-21 Thread martin . braun
Hi Sebastian, > Do you have any experiences with the libvirt? It sounds very > promising to me. Normally a simple soap call to reboot the guest > would be enough to have kind of stonith. I have some experiences with libvirt, but not yet with the ESX hypervisordriver. However with that driver c

Re: [Pacemaker] VMWare Guest as Cluster Nodes - how to do stonith

2010-05-20 Thread martin . braun
Hi Sebastian, > guests. Does anybody have experiences or advises on how to configure > stonith for vmware guests or maybe there is another solution to > build a reliable virtual cluster. See this thread: http://www.gossamer-threads.com/lists/linuxha/pacemaker/62850?search_string=Pacemaker%20in

Re: [Pacemaker] How SuSEfirewall2 affects on openais startup?

2010-05-12 Thread martin . braun
Hi Aleksey, > > Don't know much about SuSEfirewall2, but can't see how when it > > starts should influence the cluster, as long as it lets the > > traffic we need through. > > And what to do? First of all check the logs (/var/log/messages and /var/log/firewall). When you can't see anything suspic

Re: [Pacemaker] 2 web and 2 db servers both with drbd. Questions concerning the set-up

2010-05-05 Thread martin . braun
Hi Oliver, > > I almost have a set-up which looks like this: > > > > > > Master sideSlave side > > > > -virtual IP-- > > | | > > web1drbd/pacemaker--web2 > > | | > > internal internal >

Re: [Pacemaker] Failover constraint problem

2010-04-19 Thread martin . braun
Hi Sandor, > 1. If I migrate apache-group resorce to another node then nfs_client > won't release the /mnt mount point (I know according to this config it > should not). Refering to teh time-out message below, is it possible that stopping the nfs-client takes more than 20 sec? Perhaps you shou

[Pacemaker] Howto handle opt-in clusters WAS: Re: resource monitor operations on wrong nodes

2010-04-16 Thread martin . braun
Hi, > > I have a non-symmetric cluster (symmetric-cluster="false") with four > > nodes. > We still check _every_ node to be sure the resources aren't already > running there. OK, that is reasonable - but I have trouble with the logic of the messages: they are listed as failed actions, however if

[Pacemaker] resource monitor operations on wrong nodes

2010-04-09 Thread martin . braun
Hi, I have a non-symmetric cluster (symmetric-cluster="false") with four nodes. On two nodes I have allowed a resource group: location grpFS-pref1 grpFS 200: wdf-ux-0040 location grpFS-pref2 grpFS 200: wdf-ux-0041 grpFS is configured as: group grpFS resFS resVIP resAPP the other nodes are n

Re: [Pacemaker] disable failover when doing orderly reboot

2010-04-01 Thread martin . braun
Hi Gerry, > Stop all resources running on the node going to be shut down. > That's what you want in the end, isn't it? That should also work: set the second node to standby and do the reboot on the primary - when primary is up again you'll set the secondary back online. Best, Martin "Gerry K

Re: [Pacemaker] DRBD Management Console 0.7.0

2010-03-25 Thread martin . braun
Hi Rasto, I played around with the MC and it is really a promising integrative approach for managing a DRBD and Pacemaker Cluster. For now it is really nice for demonstration purposes like detaching the primary with failover. What I am missing is a resource cleanup (crm resource cleanup ), is t

[Pacemaker] Feedback: Website Updates

2010-03-24 Thread martin . braun
Hi, first of all: the succesful redesign of the clusterlabs website looks very nice and the splash page is a really good idea! However I found some small teething troubles: * "Explore" tab: - The link "Site updates" is not working yet. - I would also suggest to add a direct link to the wiki (Ho

[Pacemaker] Design: 8 vs 4x2 nodes Cluster

2010-03-18 Thread martin . braun
Hi there, I want to realize a rather complex setup, so I have a couple of questions: The cluster (as a shared nothing variant) should provide: * 4 services (=server) depending on each other. * 3 of them can only be realized as active/passive failover, synched with DRBD (M/S) * The servers ru

[Pacemaker] pacemaker-mgmt RPM for centos?

2010-02-10 Thread martin . braun
Hi all, I am setting up a pacemaker (1.0.7) cluster (corosync) with centos-5.4 - I used the repo: http://www.clusterlabs.org/rpm/epel-5/i386/. However I can't find the rpms for the Python GUI. Is there a rpm build for epel-5 or do I have to compile from source? If yes - which version of pacema

Re: [Pacemaker] The best way to take a CIB backup and rooback to a defined state

2009-10-06 Thread martin . braun
Hi, > - cibadmin -C --xml-file backupfile.xml > > This is a way, but maybe not the best? I found this a convenient way: - crm configure show > backup_config.xml - crash!! - cibadmin -E --force - edit configuration to delete last-lrm-refresh etc. - crm configure < backup_config.xml It's just

[Pacemaker] OCF resource agent: template for python?

2009-08-14 Thread martin . braun
Hi List, we would like to use a python script as an OCF resource agent script. Do you know any free templates for this? Are there any disadvantages to consider when not using bash as a script language for OCF? Thanks in advance, Martin InterComponentWare AG: Vorstand: Peter Reuschel (Vors.)

Re: [Pacemaker] monitoring events exported through web interface?

2009-06-24 Thread martin . braun
Hi Dimitar, > > I need to publish certain resource states in my cluster through > > a web interface which can be accessed remotely by load > > balancers and other health-checking software. Is there a > > standard way to do this? The "standard way" should be SNMP- nearly every health-checking sof

Re: [Pacemaker] API for programmatically controlling a heartbeat3cluster?

2009-06-22 Thread martin . braun
Hi Thomas, > what about the following cli tools: > > - crm > - crm_resource > - crm_standby > - crm_mon > - crm_* I know that I can do anything with these tools - but I thought about something more "direct", like SOAP, RPC etc. How is the Python-GUI doing all this - is the mgmtd just a wrapper

[Pacemaker] API for programmatically controlling a heartbeat 3 cluster?

2009-06-18 Thread martin . braun
Hi All, I want to be able to configure the CIB of a heartbeat/pacemaker cluster programmatically (Java). I don't want to reinvent the PythonGUI - I just want to be able to change some basic settings like the Virtual-IP Address (i.e. changing resource's attributes), set a node to standby or g