>>> Lars Marowsky-Bree <[email protected]> schrieb am 05.02.2014 um 12:36 in
Nachricht
<[email protected]>:
> On 2014-02-05T12:24:00, Ulrich Windl <[email protected]>
wrote:
> 
>> I had a problem where "O2CB stop" fenced the node that was shut down:
>> I had updated the kernel, and then rebooted. As part of shutdown, the 
> cluster stack was stopped. In turn, the "O2CB" resource was stopped.
>> Unfortunately this caused an error like (SLES11 SP3):
>> 
>> ---
>> modprobe: FATAL: Could not load /lib/modules/3.0.101-0.8-xen/modules.dep:
No 
> such file or directory
>> o2cb(prm_O2CB)[19908]: ERROR: Unable to unload module: ocfs2
>> ---
>> 
>> This in turn caused a node fence, which ruined the clean reboot.
>> 
>> So why is the RA messing with the kernel module on stop?
> 
> Because customers complained about the new module not being picked up if
> they upgrade ocfs2-kmp and restarted the cluster stack on a node. It's
> incredibly hard to please everyone, alas ...

I think the proper way would be this:
Stop your OCFS2 resources, rmmod the module, [modprobe the module to re-insert
the new version], start your OCFS2 resources.

I guess the kernel update is more common than the "just the ocfs2-kmp update"

> 
> The right way to update a cluster node is anyway this one:
> 
> 1. Stop the cluster stack
> 2. Update/upgrade/reboot as needed
> 3. Restart the cluster stack
> 
> This would avoid this error too. Or keeping multiple kernel versions in
> parallel (which also helps if a kernel update no longer boots for some
> reason). Removing the running kernel package is usually not a great
> idea; I prefer to remove them after having successfully rebooted only,
> because you *never* know if you may have to reload a module.

There's another way: (Like HP-UX learned to do it): Defer changes to the
running kernel until shutdown/reboot.

> 
> 
> Regards,
>     Lars
> 
> -- 
> Architect Storage/HA
> SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer,

> HRB 21284 (AG Nürnberg)
> "Experience is the name everyone gives to their mistakes." -- Oscar Wilde
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected] 
> http://lists.linux-ha.org/mailman/listinfo/linux-ha 
> See also: http://linux-ha.org/ReportingProblems 


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to