Dick Davies wrote:
On 15/11/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:

>I suppose it depends how 'catastrophic' the failture is, but if it's
>very low level,
>booting another root probabyl won't help, and if it's too high level, how will
>you detect it (i.e. you've booted the kernel, but it is buggy).

If it panics (but not too early) or fails to come up properly?

Detecting 'come up properly' sounds hard
(as in 'turing test hard') to me.

It isn't as hard as that. In fact I don't think it is actually that hard at all in the current Solaris architecture.

SMF helps a huge amount in this area because it has lots of failure detection and milestone/dependency concepts built in.

I think we first need to define what state "up" actually is. Is it the kernel booted ? Is it the root file system mounted ? Is it we reached milestone all ? Is it we reached milestone all with no services in maintenance ? Is it no services in maintenance that weren't on the last boot ?

--
Darren J Moffat
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to