I keep having an odd problem. The system will lockup completely, be 
unreachable from other hosts on my LAN, and require a power cycle to 
get going again... Ya, I know, not a nice thing to do to your disks.

Anyhow, I regularly fail the normal boot fsck on my /usr partition and 
need to run it manually. The manual fsck finds and deletes some of the 
old working directories ("w-") in my ports tree.

The system is using an LSI MegaRAID SATA 150-6 with six 160GB drives. 
They are configure into a single mirrored+striped logical volume.

Yes, I've got backups via dump(8) saved to a PATA disk (wd0), so I'm not 
too worried about the data. The wd0 disk is in a removable carrier and 
is normally not installed. Yes, there are multiple backups on multiple 
PATA disks.

On the main logical disk (sd0) partition sizes were kept sane (100GB) 
considering the amount of RAM I have (1GB).

# df -h
Filesystem     Size    Used   Avail Capacity  Mounted on
/dev/sd0a     1006M   53.6M    902M     6%    /
/dev/sd0i     98.4G   21.3G   72.2G    23%    /arc
/dev/sd0h     98.4G   10.7G   82.8G    11%    /home
/dev/sd0d      3.9G   18.0K    3.7G     0%    /tmp
/dev/sd0g     39.4G   18.5G   18.9G    49%    /usr
/dev/sd0e      3.9G   74.2M    3.7G     2%    /var
/dev/wd0a      230G   94.6G    124G    43%    /mnt

My *guess* is one of the raid disks is slowly failing but I'm not seeing 
any errors in /var/log/messages

What's a good way to go about trouble shooting this situation and 
hopefully isolating which disk is failing (assuming my guess is 
correct). dmesg below

Thanks,
JCR




dmesg.boot
OpenBSD 4.2-stable (GENERIC.MP) #0: Fri Feb 29 10:58:12 PST 2008
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC.MP
cpu0: Intel(R) Xeon(TM) CPU 2.00GHz ("GenuineIntel" 686-class) 2 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM
real mem  = 1072754688 (1023MB)
avail mem = 1029603328 (981MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 11/01/04, BIOS32 rev. 0 @ 0xffe90, 
SMBIOS rev. 2.3 @ 0xf0450 (114 entries)
bios0: vendor Dell Computer Corporation version "A11" date 11/01/2004
bios0: Dell Computer Corporation Precision WorkStation 530 MT
apm0 at bios0: Power Management spec V1.2
apm0: AC on, battery charge unknown
apm0: flags 30102 dobusy 0 doidle 1
pcibios0 at bios0: rev 2.1 @ 0xf0000/0x10000
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfba00/224 (12 entries)
pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82801BA LPC" rev 
0x00)
pcibios0: PCI bus #4 is the last bus
bios0: ROM list: 0xc0000/0x8800 0xc8800/0x2800 0xcb000/0x1800
mainbus0: Intel MP Specification (Version 1.4)
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: apic clock running at 99 MHz
cpu1 at mainbus0: apid 1 (application processor)
cpu1: Intel(R) Xeon(TM) CPU 2.00GHz ("GenuineIntel" 686-class) 2 GHz
cpu1: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM
mainbus0: bus 0 is type PCI   
mainbus0: bus 1 is type PCI   
mainbus0: bus 2 is type PCI   
mainbus0: bus 3 is type PCI   
mainbus0: bus 4 is type PCI   
mainbus0: bus 5 is type ISA   
ioapic0 at mainbus0: apid 2 pa 0xfec00000, version 20, 24 pins
ioapic0: misconfigured as apic 0, remapped to apid 2
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel 82860 Host" rev 0x04: rng active, 
8000000Kb/sec
ppb0 at pci0 dev 1 function 0 "Intel 82850/82860 AGP" rev 0x04
pci1 at ppb0 bus 1
vga1 at pci1 dev 0 function 0 "Matrox MGA G400/G450 AGP" rev 0x82
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
ppb1 at pci0 dev 2 function 0 "Intel 82860 PCI-PCI" rev 0x04
pci2 at ppb1 bus 2
ppb2 at pci2 dev 31 function 0 "Intel 82806AA" rev 0x03
pci3 at ppb2 bus 3
"Intel 82806AA APIC" rev 0x01 at pci3 dev 0 function 0 not configured
ami0 at pci3 dev 13 function 0 "Symbios Logic MegaRAID" rev 0x01: apic 2 
int 21 (irq 11)
ami0: LSI 523, 32b, FW 713R, BIOS vG121, 64MB RAM
ami0: 1 channels, 0 FC loops, 1 logical drives
scsibus0 at ami0: 40 targets
sd0 at scsibus0 targ 0 lun 0: <AMI, Host drive #00, > SCSI2 0/direct 
fixed
sd0: 457749MB, 58354 cyl, 255 head, 63 sec, 512 bytes/sec, 937469952 sec 
total
scsibus1 at ami0: 16 targets
ppb3 at pci0 dev 30 function 0 "Intel 82801BA AGP" rev 0x04
pci4 at ppb3 bus 4
xl0 at pci4 dev 11 function 0 "3Com 3c905C 100Base-TX" rev 0x78: apic 2 
int 23 (irq 10), address 00:06:5b:87:ad:bd
exphy0 at xl0 phy 24: 3Com internal media interface
"TI TSB12LV26 FireWire" rev 0x00 at pci4 dev 12 function 0 not 
configured
em0 at pci4 dev 14 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 
2 int 18 (irq 11), address 00:02:b3:96:13:11
ichpcib0 at pci0 dev 31 function 0 "Intel 82801BA LPC" rev 0x04: 24-bit 
timer at 3579545Hz
pciide0 at pci0 dev 31 function 1 "Intel 82801BA IDE" rev 0x04: DMA, 
channel 0 wired to compatibility, channel 1 wired to compatibility
wd0 at pciide0 channel 0 drive 0: <Max 6Y250L6>
wd0: 16-sector PIO, LBA48, 239372MB, 490234752 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5
atapiscsi0 at pciide0 channel 1 drive 0
scsibus2 at atapiscsi0: 2 targets
cd0 at scsibus2 targ 0 lun 0: <LITE-ON, LTR-48246S, SUS5> SCSI0 5/cdrom 
removable
atapiscsi1 at pciide0 channel 1 drive 1
scsibus3 at atapiscsi1: 2 targets
cd1 at scsibus3 targ 0 lun 0: <CREATIVE, DVD-ROM DVD2240E, 1.3A> SCSI0 
5/cdrom removable
cd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
cd1(pciide0:1:1): using PIO mode 4, DMA mode 2
uhci0 at pci0 dev 31 function 2 "Intel 82801BA USB" rev 0x04: apic 2 int 
19 (irq 11)
ichiic0 at pci0 dev 31 function 3 "Intel 82801BA SMBus" rev 0x04: apic 2 
int 17 (irq 11)
iic0 at ichiic0
admtemp0 at iic0 addr 0x18: adm1023
uhci1 at pci0 dev 31 function 4 "Intel 82801BA USB" rev 0x04: apic 2 int 
23 (irq 10)
auich0 at pci0 dev 31 function 5 "Intel 82801BA AC97" rev 0x04: apic 2 
int 17 (irq 11), ICH2 AC97
ac97: codec id 0x41445360 (Analog Devices AD1885)
ac97: codec features headphone, Analog Devices Phat Stereo
audio0 at auich0
isa0 at ichpcib0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pmsi0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pmsi0 mux 0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
spkr0 at pcppi0
lpt0 at isa0 port 0x378/4 irq 7
npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
usb0 at uhci0: USB revision 1.0
uhub0 at usb0: Intel UHCI root hub, rev 1.00/1.00, addr 1
usb1 at uhci1: USB revision 1.0
uhub1 at usb1: Intel UHCI root hub, rev 1.00/1.00, addr 1
pctr: user-level cycle counter enabled
mtrr: Pentium Pro MTRR support
apm0: disconnected
dkcsum: sd0 matches BIOS drive 0x80
dkcsum: wd0 matches BIOS drive 0x81
root on sd0a swap on sd0b dump on sd0b

Reply via email to