I keep having an odd problem. The system will lockup completely, be unreachable from other hosts on my LAN, and require a power cycle to get going again... Ya, I know, not a nice thing to do to your disks.
Anyhow, I regularly fail the normal boot fsck on my /usr partition and need to run it manually. The manual fsck finds and deletes some of the old working directories ("w-") in my ports tree. The system is using an LSI MegaRAID SATA 150-6 with six 160GB drives. They are configure into a single mirrored+striped logical volume. Yes, I've got backups via dump(8) saved to a PATA disk (wd0), so I'm not too worried about the data. The wd0 disk is in a removable carrier and is normally not installed. Yes, there are multiple backups on multiple PATA disks. On the main logical disk (sd0) partition sizes were kept sane (100GB) considering the amount of RAM I have (1GB). # df -h Filesystem Size Used Avail Capacity Mounted on /dev/sd0a 1006M 53.6M 902M 6% / /dev/sd0i 98.4G 21.3G 72.2G 23% /arc /dev/sd0h 98.4G 10.7G 82.8G 11% /home /dev/sd0d 3.9G 18.0K 3.7G 0% /tmp /dev/sd0g 39.4G 18.5G 18.9G 49% /usr /dev/sd0e 3.9G 74.2M 3.7G 2% /var /dev/wd0a 230G 94.6G 124G 43% /mnt My *guess* is one of the raid disks is slowly failing but I'm not seeing any errors in /var/log/messages What's a good way to go about trouble shooting this situation and hopefully isolating which disk is failing (assuming my guess is correct). dmesg below Thanks, JCR dmesg.boot OpenBSD 4.2-stable (GENERIC.MP) #0: Fri Feb 29 10:58:12 PST 2008 [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC.MP cpu0: Intel(R) Xeon(TM) CPU 2.00GHz ("GenuineIntel" 686-class) 2 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM real mem = 1072754688 (1023MB) avail mem = 1029603328 (981MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 11/01/04, BIOS32 rev. 0 @ 0xffe90, SMBIOS rev. 2.3 @ 0xf0450 (114 entries) bios0: vendor Dell Computer Corporation version "A11" date 11/01/2004 bios0: Dell Computer Corporation Precision WorkStation 530 MT apm0 at bios0: Power Management spec V1.2 apm0: AC on, battery charge unknown apm0: flags 30102 dobusy 0 doidle 1 pcibios0 at bios0: rev 2.1 @ 0xf0000/0x10000 pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfba00/224 (12 entries) pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82801BA LPC" rev 0x00) pcibios0: PCI bus #4 is the last bus bios0: ROM list: 0xc0000/0x8800 0xc8800/0x2800 0xcb000/0x1800 mainbus0: Intel MP Specification (Version 1.4) cpu0 at mainbus0: apid 0 (boot processor) cpu0: apic clock running at 99 MHz cpu1 at mainbus0: apid 1 (application processor) cpu1: Intel(R) Xeon(TM) CPU 2.00GHz ("GenuineIntel" 686-class) 2 GHz cpu1: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM mainbus0: bus 0 is type PCI mainbus0: bus 1 is type PCI mainbus0: bus 2 is type PCI mainbus0: bus 3 is type PCI mainbus0: bus 4 is type PCI mainbus0: bus 5 is type ISA ioapic0 at mainbus0: apid 2 pa 0xfec00000, version 20, 24 pins ioapic0: misconfigured as apic 0, remapped to apid 2 pci0 at mainbus0 bus 0: configuration mode 1 (no bios) pchb0 at pci0 dev 0 function 0 "Intel 82860 Host" rev 0x04: rng active, 8000000Kb/sec ppb0 at pci0 dev 1 function 0 "Intel 82850/82860 AGP" rev 0x04 pci1 at ppb0 bus 1 vga1 at pci1 dev 0 function 0 "Matrox MGA G400/G450 AGP" rev 0x82 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) ppb1 at pci0 dev 2 function 0 "Intel 82860 PCI-PCI" rev 0x04 pci2 at ppb1 bus 2 ppb2 at pci2 dev 31 function 0 "Intel 82806AA" rev 0x03 pci3 at ppb2 bus 3 "Intel 82806AA APIC" rev 0x01 at pci3 dev 0 function 0 not configured ami0 at pci3 dev 13 function 0 "Symbios Logic MegaRAID" rev 0x01: apic 2 int 21 (irq 11) ami0: LSI 523, 32b, FW 713R, BIOS vG121, 64MB RAM ami0: 1 channels, 0 FC loops, 1 logical drives scsibus0 at ami0: 40 targets sd0 at scsibus0 targ 0 lun 0: <AMI, Host drive #00, > SCSI2 0/direct fixed sd0: 457749MB, 58354 cyl, 255 head, 63 sec, 512 bytes/sec, 937469952 sec total scsibus1 at ami0: 16 targets ppb3 at pci0 dev 30 function 0 "Intel 82801BA AGP" rev 0x04 pci4 at ppb3 bus 4 xl0 at pci4 dev 11 function 0 "3Com 3c905C 100Base-TX" rev 0x78: apic 2 int 23 (irq 10), address 00:06:5b:87:ad:bd exphy0 at xl0 phy 24: 3Com internal media interface "TI TSB12LV26 FireWire" rev 0x00 at pci4 dev 12 function 0 not configured em0 at pci4 dev 14 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 18 (irq 11), address 00:02:b3:96:13:11 ichpcib0 at pci0 dev 31 function 0 "Intel 82801BA LPC" rev 0x04: 24-bit timer at 3579545Hz pciide0 at pci0 dev 31 function 1 "Intel 82801BA IDE" rev 0x04: DMA, channel 0 wired to compatibility, channel 1 wired to compatibility wd0 at pciide0 channel 0 drive 0: <Max 6Y250L6> wd0: 16-sector PIO, LBA48, 239372MB, 490234752 sectors wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5 atapiscsi0 at pciide0 channel 1 drive 0 scsibus2 at atapiscsi0: 2 targets cd0 at scsibus2 targ 0 lun 0: <LITE-ON, LTR-48246S, SUS5> SCSI0 5/cdrom removable atapiscsi1 at pciide0 channel 1 drive 1 scsibus3 at atapiscsi1: 2 targets cd1 at scsibus3 targ 0 lun 0: <CREATIVE, DVD-ROM DVD2240E, 1.3A> SCSI0 5/cdrom removable cd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2 cd1(pciide0:1:1): using PIO mode 4, DMA mode 2 uhci0 at pci0 dev 31 function 2 "Intel 82801BA USB" rev 0x04: apic 2 int 19 (irq 11) ichiic0 at pci0 dev 31 function 3 "Intel 82801BA SMBus" rev 0x04: apic 2 int 17 (irq 11) iic0 at ichiic0 admtemp0 at iic0 addr 0x18: adm1023 uhci1 at pci0 dev 31 function 4 "Intel 82801BA USB" rev 0x04: apic 2 int 23 (irq 10) auich0 at pci0 dev 31 function 5 "Intel 82801BA AC97" rev 0x04: apic 2 int 17 (irq 11), ICH2 AC97 ac97: codec id 0x41445360 (Analog Devices AD1885) ac97: codec features headphone, Analog Devices Phat Stereo audio0 at auich0 isa0 at ichpcib0 isadma0 at isa0 pckbc0 at isa0 port 0x60/5 pckbd0 at pckbc0 (kbd slot) pckbc0: using irq 1 for kbd slot wskbd0 at pckbd0: console keyboard, using wsdisplay0 pmsi0 at pckbc0 (aux slot) pckbc0: using irq 12 for aux slot wsmouse0 at pmsi0 mux 0 pcppi0 at isa0 port 0x61 midi0 at pcppi0: <PC speaker> spkr0 at pcppi0 lpt0 at isa0 port 0x378/4 irq 7 npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo fdc0 at isa0 port 0x3f0/6 irq 6 drq 2 fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec usb0 at uhci0: USB revision 1.0 uhub0 at usb0: Intel UHCI root hub, rev 1.00/1.00, addr 1 usb1 at uhci1: USB revision 1.0 uhub1 at usb1: Intel UHCI root hub, rev 1.00/1.00, addr 1 pctr: user-level cycle counter enabled mtrr: Pentium Pro MTRR support apm0: disconnected dkcsum: sd0 matches BIOS drive 0x80 dkcsum: wd0 matches BIOS drive 0x81 root on sd0a swap on sd0b dump on sd0b