Hi there,
I'm a newbie to cluster service. And I'm following the guide from IBM
(http://www.ibm.com/developerworks/linux/library/l-linux-ha/index.html)
with some diff of IP addresses and I don't have hb_gui, just CLI.
I'm running CentOS 5.4, here is my ha.cf file:
# Log to syslog as facility "daemon"
use_logd on
logfacility daemon
# List our cluster members (the realservers)
node lvs1
node lvs2
node lvs3
# Send one heartbeat each second
keepalive 3
# Warn when heartbeats are late
warntime 5
# Declare nodes dead after 10 seconds
deadtime 10
# Keep resources on their "preferred" hosts - needed for active/active
#auto_failback on
# The cluster nodes communicate on their heartbeat lan (.68.*) interfaces
ucast eth0 192.168.1.21
ucast eth0 192.168.1.22
ucast eth0 192.168.1.23
# Failover on network failures
# Make the default gateway on the public interface a node to ping
# (-m) -> For every connected node, add <integer> to the value set
# in the CIB, * Default=1
# (-d) -> How long to wait for no further changes to occur before
# updating the CIB with a changed attribute
# (-a) -> Name of the node attribute to set, * Default=pingd
respawn hacluster /usr/lib/heartbeat/pingd -m 100 -d 5s
# Ping our router to monitor ethernet connectivity
ping 192.168.121
#Enable version 2 functionality supporting clusters with > 2 nodes
crm yes
and haresources file:
lvs1 192.168.1.2/24/eth0/192.168.1.255
Now, after generate CIB.xml file then start heartbeat service, the
problem is the resource can NOT start, I don't know the reason why
because the error message is too simple.
# /usr/sbin/crm_resource --locate -r IPaddr_192_168_1_2
resource IPaddr_192_168_1_2 is NOT running
and:
# /usr/sbin/crm_mon
Defaulting to one-shot mode
You need to have curses available at compile time to enable console mode
============
Last updated: Tue Jun 14 10:59:52 2011
Current DC: lvs1 (0dd98014-6a88-4b58-a00d-a1fd8a18700e)
3 Nodes configured.
1 Resources configured.
============
Node: lvs3 (5aa44f75-9cef-4cf0-b83a-498835ccbd36): online
Node: lvs2 (6b789c73-8744-49e6-b3bb-4d99053450bd): online
Node: lvs1 (0dd98014-6a88-4b58-a00d-a1fd8a18700e): online
Failed actions:
IPaddr_192_168_1_2_start_0 (node=lvs3, call=3, rc=1): Error
IPaddr_192_168_1_2_start_0 (node=lvs1, call=3, rc=1): Error
IPaddr_192_168_1_2_start_0 (node=lvs2, call=3, rc=1): Error
After hours googling, I decide to post here with want to be helped.
Please give me some ideas to resolve the problem that caused resource
can't start.
Thank you very much,
--
Best regards,
Hai Nam, Nguyen
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems