One of the big oops moments many years ago was more a lack of
knowledge than an oops at the keyboard.

I had an AIX file server with 4 disk contollers in it.  A couple
of disks on each controller.

The typical / /usr ... stuff was on disk 0 controller 0
along with some basic swap space.

disk 0 on controller 1 had additional swap space plus some 
file system stuff.

disk 0 on controllers 2 and 3 were made into 1 big filesystem
disk 1 on controllers 2 and 3 were made into another filesystem

All other disks on the system had additional filesystems on them.

System crashed one day.  Found out later the problem was that
contoller 1 developed a problem and disappered.

I rebooted the system and it does the normal filesystem checks.
Since the system had lots of disk and it crashed, it was going
to take some time for the fsck to complete.  Off to the fridge
to grab a cold dew.  Being used to BSD4.2 and SunOS systems, if
the system can't fsck everything it stops.  It should be
safe til I get back....  As Gomer Pyle would say, "Suprise!
Suprise! Suprise!".  Not so with the AIX of the day.    The
system just skips the the failed disks and goes into multi
user mode.

The result is the system now takes that nice big partition on
the "new" controller 1 as swap and therefore poof goes the 
filesystem.  Thank the automated tape monkeys that completed
the backups of the system just a couple hours before the
controller went byebye.

This got reported to IBM as a bug as well and fixed in a later
release of the system.

-_Gene
_______________________________________________
Discuss mailing list
Discuss@lists.lopsa.org
https://lists.lopsa.org/cgi-bin/mailman/listinfo/discuss
This list provided by the League of Professional System Administrators
 http://lopsa.org/

Reply via email to