After following the wiki example for sharing an IP address (http://clusterlabs.org/wiki/Example_configurations), I'm able to manually fail over the resource with crm using the following statement (my nodes are ha1 and ha2):

        crm resource migrate failover-ip ha2


However, if I halt the box which currently owns the floating IP, or otherwise abruptly kill networking on it, the failover never automatically happens. I did follow the example explicitly, and the resource was initially created with:


        primitive failover-ip ocf:heartbeat:IPaddr params ip=192.168.7.250 op 
monitor interval=10


...so I'm not quite sure what the issue is. The messaging layer seems to work since crm status shows the node as being down, but the resource allocation layer seems to be failing, probably somewhere in the CRM...?


I have no firewall between these nodes, so I haven't run tcpdump either to see if the messages are making it, but I can't imagine that that's the issue here. This is what things look like after the simulated problem:


r...@ha1:~# crm status


============
Last updated: Thu May 28 16:31:20 2009
Current DC: ha1 (ha1)
Version: 1.0.2-c02b459053bfa44d509a2a0e0247b291d93662b7
2 Nodes configured.
1 Resources configured.
============

Node: ha1 (ha1): online
Node: ha2 (ha2): UNCLEAN (offline)

r...@ha1:~# ifconfig
eth0      Link encap:Ethernet  HWaddr 00:0c:29:cd:78:4e
          inet addr:192.168.7.134  Bcast:192.168.7.255  Mask:255.255.255.0
          inet6 addr: fe80::20c:29ff:fecd:784e/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:7212 errors:0 dropped:0 overruns:0 frame:0
          TX packets:12373 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:919781 (898.2 KB)  TX bytes:1489819 (1.4 MB)
          Base address:0x2000 Memory:d8920000-d8940000

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:624 errors:0 dropped:0 overruns:0 frame:0
          TX packets:624 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:61572 (60.1 KB)  TX bytes:61572 (60.1 KB)


r...@ha1:~# crm_resource -L
failover-ip     (ocf::heartbeat:IPaddr) Started


As you can see, nothing has happened. Hopefully someone else can identify my mistake before I do after having read this. Thanks in advance for any help.


-Ryan

_______________________________________________
Pacemaker mailing list
Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Reply via email to