On 10 December 2010 04:36, Preeti Jain <[email protected]> wrote:
> Hello list,
>  I am testing network failure case by removing nic cable on one node and 
> getting
> unwanted outcomes as whole cluster gets disturbed and resource appears to move
> on different nodes until it gets stabled on one node and it is also resulting 
> in
> failback.
> Like if i remove nic cable from node 1 then failover happens it takes some 
> time
> to move to node 2 but when once again i plugin cable on node 1 a kind of split
> brain happens and resource take sometime to get stabled on node 1 resulting
> failback which is again not desired as it should stay on node 2...
> Every node says like other cluster nodes coming after partition
>
> part of log file on node 1 after nic plugin
> heartbeat[2521]: 2010/12/08_16:50:02 CRIT: Cluster node Node2 returning after
> partition.
> heartbeat[2521]: 2010/12/08_16:50:02 info: For information on cluster
> partitions, See URL: http://linux-ha.org/SplitBrain
> heartbeat[2521]: 2010/12/08_16:50:02 WARN: Deadtime value may be too small.
> heartbeat[2521]: 2010/12/08_16:50:02 info: See FAQ for information on tuning
> deadtime.
> heartbeat[2521]: 2010/12/08_16:50:02 info: URL: http://linux-
> ha.org/FAQ#heavy_load
> heartbeat[2521]: 2010/12/08_16:50:02 info: Link Node2:eth0 up.
> heartbeat[2521]: 2010/12/08_16:50:02 WARN: Late heartbeat: Node Node2: 
> interval
> 781870 ms
> heartbeat[2521]: 2010/12/08_16:50:02 info: Status update for node Node2: 
> status
> active
> heartbeat[2521]: 2010/12/08_16:50:03 info: Link Node3:eth0 up.
> heartbeat[2521]: 2010/12/08_16:50:03 info: Link Node4:eth0 up.
> heartbeat[2521]: 2010/12/08_16:50:03 CRIT: Cluster node Node4 returning after
> partition.
> heartbeat[2521]: 2010/12/08_16:50:03 info: For information on cluster
> partitions, See URL: http://linux-ha.org/SplitBrain
> heartbeat[2521]: 2010/12/08_16:50:03 WARN: Deadtime value may be too small.
> heartbeat[2521]: 2010/12/08_16:50:03 info: See FAQ for information on tuning
> deadtime.
> heartbeat[2521]: 2010/12/08_16:50:03 info: URL: http://linux-
> ha.org/FAQ#heavy_load
> heartbeat[2521]: 2010/12/08_16:50:03 WARN: Late heartbeat: node Node4: 
> interval
> 782200 ms
> heartbeat[2521]: 2010/12/08_16:50:03 info: Status update for node Node4: 
> status
> active
> heartbeat[2521]: 2010/12/08_16:50:03 info: Link Node5:eth0 up.
> heartbeat[2521]: 2010/12/08_16:50:04 CRIT: Cluster node Node2 returning after
> partition.
> heartbeat[2521]: 2010/12/08_16:50:04 info: For information on cluster
> partitions, See URL: http://linux-ha.org/SplitBrain
> heartbeat[2521]: 2010/12/08_16:50:04 WARN: Deadtime value may be too small.
> heartbeat[2521]: 2010/12/08_16:50:04 info: See FAQ for information on tuning
> deadtime.
> heartbeat[2521]: 2010/12/08_16:50:04 info: URL: http://linux-
> ha.org/FAQ#heavy_load
> heartbeat[2521]: 2010/12/08_16:50:04 WARN: Late heartbeat: node Node2: 
> interval
> 784380 ms
> heartbeat[2521]: 2010/12/08_16:50:04 info: Status update for node Node2: 
> status
> active
> heartbeat[2521]: 2010/12/08_16:50:04 CRIT: Cluster node Node5 returning after
> partition.
> heartbeat[2521]: 2010/12/08_16:50:04 info: For information on cluster
> partitions, See URL: http://linux-ha.org/SplitBrain
> heartbeat[2521]: 2010/12/08_16:50:04 WARN: Deadtime value may be too small.
> heartbeat[2521]: 2010/12/08_16:50:04 info: See FAQ for information on tuning
> deadtime.
> heartbeat[2521]: 2010/12/08_16:50:04 info: URL: http://linux-
> ha.org/FAQ#heavy_load
> heartbeat[2521]: 2010/12/08_16:50:04 WARN: Late heartbeat: node Node5: 
> interval
> 784390 ms
> heartbeat[2521]: 2010/12/08_16:50:04 info: Status update for node Node5: 
> status
> active
>
>
>
> Any solution for this problem...
>
> Regards
> Preeti
>
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>

You don't give much of details on your configuration, but in general
using fencing device helps to avoid this kind of problems.
Pavlos
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to