Re: [lopsa-tech] Need ideas/suggestions for bringing several VMs back online after an outage

Jonathan Tue, 29 Oct 2013 18:45:34 -0700

On Tue, Oct 29, 2013 at 8:27 PM, Mathew Snyder<mathew.sny...@gmail.com <mailto:mathew.sny...@gmail.com>> wrote:
    I'm looking at information for the onerror=panic option. What
    happens when I cause a kernel panic besides the system essentially
    becoming inoperable? Does it automatically force a fsck on the
    next reboot? So far, everything I've seen indicates that it simply
    creates a crash dump. That really isn't all that useful in this
    situation as we know what causes the problem.
On 30/10/2013 00:36, Brandon Allbery wrote:
fsck and normal shutdown both set a flag in the superblock indicatingthat the filesystem is clean; if this flag is not set then fsck isforced on reboot. Although, also important here, forcing a panic keepsthe system from pointlessly trying to continue and behaving weirdly ifthe disks vanish out from under it.

As Brandon says, a panic stops the machine dead in its tracks, andcauses it to reboot, with an fsck on the way up. I've seen cases wherea SAN gets heavily loaded (e.g. a boot storm) where running VMs will seeSCSI timeouts and force file systems read-only, but ONLY on the activefile systems. For example, /var may become read-only whilst / remainswriteable. This gets ugly. I've found systems which happily answertheir Nagios probes, but some volume is essentially off-line. Once afile system is read-only you are not going to be able to write anypending data to it, so you might as well crash. If the SAN was justshort-term overloaded, the system will likely come straight back up. Ifthe SAN is unavailable, the system will be unable to boot, but a downedhost is easier to spot than one with a random volume in read-only mode.


Jonathan.

_______________________________________________
Tech mailing list
Tech@lists.lopsa.org
https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech
This list provided by the League of Professional System Administrators
 http://lopsa.org/

Re: [lopsa-tech] Need ideas/suggestions for bringing several VMs back online after an outage

Reply via email to