Thanks,
I found some info about this issue at http://www.novell.com (Document ID:
7001469): ocfs2 on SLES10 NTS sanity check (OCFS2 HEARTBEAT)
quote:
"
...
If these checks are done, mode is set to "user" and everything seems to be
alright, but there are still problems with ocfs2 then the next step should
be to check the heartbeat settings.
If mode is "kernel" but there are still problems with ocfs2 then the next
step should be to contact NTS.
...
"
So, Novell recommends user "user" hertbeat mode instead of "kernel".
I'll try to use "hearbeat" + "heartbeat-stonith" packages from SLES
repository.
13.11.2009 23:45 пользователь Sunil Mushran <sunil.mush...@oracle.com>
написал:
Ping Novell Support. The base code works fine. This could be a bug in
sles10 sp2.
voino...@gmail.com wrote:
Hi
I have:
SLES 10 SP2 (2.6.16.60-0.21-smp)
ocfs2-tools-1.4.0-0.3
ocfs2console-1.4.0-0.3
and I can't change "heartbeat dead threshold" value.
Content of /etc/sysconfig/o2cb:
# O2CB_ENABLED: 'true' means to load the driver on boot.
O2CB_ENABLED=true
# O2CB_BOOTCLUSTER: If not empty, the name of a cluster to start.
O2CB_BOOTCLUSTER=ocfs2
# O2CB_HEARTBEAT_THRESHOLD: Iterations before a node is considered dead.
O2CB_HEARTBEAT_THRESHOLD=151
# O2CB_IDLE_TIMEOUT_MS: Time in ms before a network connection is
considered dead.
O2CB_IDLE_TIMEOUT_MS=10000
# O2CB_KEEPALIVE_DELAY_MS: Max time in ms before a keepalive packet is
sent
O2CB_KEEPALIVE_DELAY_MS=5000
# O2CB_RECONNECT_DELAY_MS: Min time in ms between connection attempts
O2CB_RECONNECT_DELAY_MS=2000
# O2CB_HEARTBEAT_MODE: Whether to use the native "kernel" or the "user"
# driven heartbeat (for example, for integration with heartbeat 2.0.x)
O2CB_HEARTBEAT_MODE="kernel"
so, O2CB_HEARTBEAT_THRESHOLD is 151.
Then I have restart service o2cb or even reboot server and...:
# service o2cb status
Module "configfs": Loaded
Filesystem "configfs": Mounted
Module "ocfs2_nodemanager": Loaded
Module "ocfs2_dlm": Loaded
Module "ocfs2_dlmfs": Loaded
Filesystem "ocfs2_dlmfs": Mounted
Checking O2CB cluster ocfs2: Online
Heartbeat dead threshold = 31
Network idle timeout: 10000
Network keepalive delay: 5000
Network reconnect delay: 2000
Checking O2CB heartbeat: Not active
so, Heartbeat dead threshold is 31!!!
ok. I'll try use configure...
# service o2cb configure
Load O2CB driver on boot (y/n) [y]:
Cluster to start on boot (Enter "none" to clear) [ocfs2]:
Specify heartbeat dead threshold (>=7) [151]: 151
Specify network idle timeout in ms (>=5000) [10000]:
Specify network keepalive delay in ms (>=1000) [5000]:
Specify network reconnect delay in ms (>=2000) [2000]:
Use user-space driven heartbeat? (y/n) [n]:
Writing O2CB configuration: OK
O2CB cluster ocfs2 already online
so, we see that "Specify heartbeat dead threshold" get my current value
(151) from /etc/sysconfig/o2cb.
but again we have:
# service o2cb status
...
Heartbeat dead threshold = 31
...
trying check file from configfs:
# cat /sys/kernel/config/cluster/ocfs2/heartbeat/dead_threshold
31
# echo 151 > /sys/kernel/config/cluster/ocfs2/heartbeat/dead_threshold
# cat /sys/kernel/config/cluster/ocfs2/heartbeat/dead_threshold
31
# ls -l /sys/kernel/config/cluster/ocfs2/heartbeat/dead_threshold
-rw-r--r-- 1 root root 4096 2009-11-13 17:42
/sys/kernel/config/cluster/ocfs2/heartbeat/dead_threshold
what's a problem?
------------------------------------------------------------------------
_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users
_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users