Hi,

Ok, those rl1: watchdog timeouts didn't ring a bell with me because I'd seen them before; however a quick grep in the logs (which date back to May 25) show no other watchdog timeout matches.

To try and avoid being incomplete again, I'll just attach the full dmesg below.

Jeremy Chadwick wrote:
On Wed, Aug 06, 2008 at 11:37:16AM +0200, Sebastiaan van Erk wrote:
Yes, good thing you pointed this out, I hadn't seen those yet:

Aug  5 11:15:05 piglet kernel: rl1: watchdog timeout
Aug  5 11:15:05 piglet kernel: ad6: TIMEOUT - WRITE_DMA retrying (1 retry left) 
LBA=218885455
Aug  5 11:15:05 piglet kernel: ad4: TIMEOUT - WRITE_DMA retrying (1 retry left) 
LBA=218885455
Aug  5 11:15:10 piglet kernel: rl1: watchdog timeout
Aug  5 11:15:31 piglet kernel: rl1: watchdog timeout
Aug  5 11:15:31 piglet kernel: ad6: FAILURE - device detached
Aug  5 11:15:31 piglet kernel: subdisk6: detached
Aug  5 11:15:31 piglet kernel: ad6: detached
Aug  5 11:15:31 piglet kernel: rl1: watchdog timeout
Aug  5 11:15:31 piglet kernel: rl1: watchdog timeout
Aug  5 11:15:31 piglet kernel: ad4: FAILURE - device detached
Aug  5 11:15:31 piglet kernel: subdisk4: detached
Aug  5 11:15:31 piglet kernel: ad4: detached
Aug  5 11:15:31 piglet kernel: GEOM_MIRROR: Device gm1: provider ad6 
disconnected.
Aug  5 11:15:31 piglet kernel: GEOM_MIRROR: Device gm1: provider ad4 
disconnected.
Aug  5 11:15:31 piglet kernel: GEOM_MIRROR: Device gm1: provider mirror/gm1 
destroyed.
Aug  5 11:15:31 piglet kernel: GEOM_MIRROR: Device gm1 destroyed.
Aug  5 11:15:31 piglet kernel: 
g_vfs_done():mirror/gm1s1e[WRITE(offset=111376236544, length=16384)] error = 6

Kudos to Andrey for asking a simple yet incredibly benefitial question.

You have a much greater problem here, and it doesn't look specific to
your disks.  It looks as if an interrupt is stalled or locked.  I'm
willing to bet your rl1 Realtek NIC and your ATA controller (associated
with disks ad4 and ad6) use the same IRQ.  vmstat -i output should help
clear that up, or dmesg output.

I'll tell you that there have been some watchdog timeout fixes committed
to rl(4) in recent months, depending upon what specific model and
revision of Realtek NIC you have.  No offence intended, but Realtek is
definitely the worst of the bunch.  I'm willing to bet it's an on-board
NIC too.  :-)

Actually, I have 3 NICs in my PC (all of them in use). My machine is the server/router in my home network, so it has the onboard vr0 NIC connected to my ADSL modem, the rl0 nic connected to my internal wired lan, and the rl1 nic connected to my wireless router (my internal wired lan is firewalled from the wireless, since I don't really trust wireless security ;-)).

I'm CC'ing PYUN Yong-Hyeon here, as he presently maintains/works on the
rl(4) driver, and might be able to help determine if the Realtek NIC is
what's causing all of this, or if the ATA chipset (is this the VIA?  We
don't know yet) is causing it first.

Finally, what motherboard brand and model is this, and what BIOS
revision or version?

I attached the output of dmidecode (and dmesg), hopefully that contains all you need to know.

BTW: I did a reply all, but I'm not sure if that is the "right" policy here. If I'm bothering anybody with this and they prefer to only see the mail on the list, then please let me know!

Regards and thanks for all the help,
Sebastiaan
Copyright (c) 1992-2008 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.3-PRERELEASE #20: Wed Jan  2 19:48:49 CET 2008
    [EMAIL PROTECTED]:/usr/obj/usr/src/sys/PIGLET
MPTable: <OEM00000 PROD00000000>
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: AMD Sempron(tm) Processor 2600+ (1599.83-MHz 686-class CPU)
  Origin = "AuthenticAMD"  Id = 0x20fc2  Stepping = 2
  
Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2>
  Features2=0x1<SSE3>
  AMD Features=0xe2500800<SYSCALL,NX,MMX+,FFXSR,LM,3DNow!+,3DNow!>
  AMD Features2=0x1<LAHF>
real memory  = 1056964608 (1008 MB)
avail memory = 1020919808 (973 MB)
ioapic0: Assuming intbase of 0
ioapic0 <Version 0.3> irqs 0-23 on motherboard
kbd1 at kbdmux0
ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
hptrr: HPT RocketRAID controller driver v1.1 (Jan  2 2008 19:48:29)
cpu0 on motherboard
pcib0: <MPTable Host-PCI bridge> pcibus 0 on motherboard
pci0: <PCI bus> on pcib0
agp0: <VIA 8380 host to PCI bridge> mem 0xe8000000-0xefffffff at device 0.0 on 
pci0
pcib1: <MPTable PCI-PCI bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib1
pci1: <display, VGA> at device 0.0 (no driver attached)
rl0: <RealTek 8139 10/100BaseTX> port 0xd000-0xd0ff mem 0xf6084000-0xf60840ff 
irq 16 at device 8.0 on pci0
miibus0: <MII bus> on rl0
rlphy0: <RealTek internal media interface> on miibus0
rlphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
rl0: Ethernet address: 00:50:fc:57:a2:4b
rl1: <RealTek 8139 10/100BaseTX> port 0xd100-0xd1ff mem 0xf6080000-0xf60800ff 
irq 17 at device 9.0 on pci0
miibus1: <MII bus> on rl1
rlphy1: <RealTek internal media interface> on miibus1
rlphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
rl1: Ethernet address: 00:50:fc:44:23:0e
atapci0: <SiI SiI 3512 SATA150 controller> port 
0xd200-0xd207,0xd300-0xd303,0xd400-0xd407,0xd500-0xd503,0xd600-0xd60f mem 
0xf6081000-0xf60811ff irq 18 at device 10.0 on pci0
ata2: <ATA channel 0> on atapci0
ata3: <ATA channel 1> on atapci0
atapci1: <VIA 6420 SATA150 controller> port 
0xd700-0xd707,0xd800-0xd803,0xd900-0xd907,0xda00-0xda03,0xdb00-0xdb0f,0xdc00-0xdcff
 irq 20 at device 15.0 on pci0
ata4: <ATA channel 0> on atapci1
ata5: <ATA channel 1> on atapci1
atapci2: <VIA 8237 UDMA133 controller> port 
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xdd00-0xdd0f at device 15.1 on pci0
ata0: <ATA channel 0> on atapci2
ata1: <ATA channel 1> on atapci2
uhci0: <VIA 83C572 USB controller> port 0xde00-0xde1f irq 21 at device 16.0 on 
pci0
uhci0: [GIANT-LOCKED]
usb0: <VIA 83C572 USB controller> on uhci0
usb0: USB revision 1.0
uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: <VIA 83C572 USB controller> port 0xdf00-0xdf1f irq 21 at device 16.1 on 
pci0
uhci1: [GIANT-LOCKED]
usb1: <VIA 83C572 USB controller> on uhci1
usb1: USB revision 1.0
uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2: <VIA 83C572 USB controller> port 0xe000-0xe01f irq 21 at device 16.2 on 
pci0
uhci2: [GIANT-LOCKED]
usb2: <VIA 83C572 USB controller> on uhci2
usb2: USB revision 1.0
uhub2: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhci3: <VIA 83C572 USB controller> port 0xe100-0xe11f irq 21 at device 16.3 on 
pci0
uhci3: [GIANT-LOCKED]
usb3: <VIA 83C572 USB controller> on uhci3
usb3: USB revision 1.0
uhub3: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
ehci0: <VIA VT6202 USB 2.0 controller> mem 0xf6082000-0xf60820ff irq 21 at 
device 16.4 on pci0
ehci0: [GIANT-LOCKED]
usb4: EHCI version 1.0
usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3
usb4: <VIA VT6202 USB 2.0 controller> on ehci0
usb4: USB revision 2.0
uhub4: VIA EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub4: 8 ports with 8 removable, self powered
isab0: <PCI-ISA bridge> at device 17.0 on pci0
isa0: <ISA bus> on isab0
pcm0: <VIA VT8237> port 0xe200-0xe2ff irq 22 at device 17.5 on pci0
pcm0: <VIA Technologies VIA1617A AC97 Codec>
pcm0: <VIA DXS Enabled: DXS 4 / SGD 0 / REC 1>
vr0: <VIA VT6102 Rhine II 10/100BaseTX> port 0xe300-0xe3ff mem 
0xf6083000-0xf60830ff irq 23 at device 18.0 on pci0
vr0: Quirks: 0x0
miibus2: <MII bus> on vr0
ukphy0: <Generic IEEE 802.3u media interface> on miibus2
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
vr0: Ethernet address: 00:13:d3:16:75:97
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xcc000-0xd3fff,0xd4000-0xd87ff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
fdc0: <Enhanced floppy controller> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0: configured irq 4 not in bitmap of probed irqs 0
sio0: port may not be enabled
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
unknown: <PNP0303> can't assign resources (port)
unknown: <PNP0c01> can't assign resources (memory)
unknown: <PNP0c02> can't assign resources (memory)
unknown: <PNP0501> can't assign resources (port)
unknown: <PNP0700> can't assign resources (port)
unknown: <PNP0400> can't assign resources (port)
unknown: <PNP0501> can't assign resources (port)
Timecounter "TSC" frequency 1599832423 Hz quality 800
Timecounters tick every 1.000 msec
hptrr: no controller detected.
ad0: 286188MB <Maxtor 6L300R0 BAH41G10> at ata0-master UDMA133
ad1: 239372MB <Maxtor 6L250R0 BAH41G10> at ata0-slave UDMA133
acd0: DMA limited to UDMA33, device found non-ATA66 cable
acd0: DVDR <LITE-ON DVDRW SHW-1635S/YS0N> at ata1-master UDMA33
ad4: 953869MB <SAMSUNG HD103UJ 1AA01112> at ata2-master SATA150
ad6: 953869MB <SAMSUNG HD103UJ 1AA01112> at ata3-master SATA150
ad8: 239372MB <Maxtor 6L250S0 BANC1G10> at ata4-master SATA150
ad10: 239372MB <Maxtor 6L250S0 BANC1G10> at ata5-master SATA150
GEOM_MIRROR: Device gm1 created (id=560014299).
GEOM_MIRROR: Device gm1: provider ad4 detected.
GEOM_MIRROR: Device gm1: provider ad6 detected.
GEOM_MIRROR: Device gm1: provider ad6 activated.
GEOM_MIRROR: Device gm1: provider mirror/gm1 launched.
GEOM_MIRROR: Device gm1: rebuilding provider ad4.
GEOM_MIRROR: Device gm0 created (id=1464068017).
GEOM_MIRROR: Device gm0: provider ad8 detected.
GEOM_MIRROR: Device gm0: provider ad10 detected.
GEOM_MIRROR: Device gm0: provider ad10 activated.
GEOM_MIRROR: Device gm0: provider ad8 activated.
GEOM_MIRROR: Device gm0: provider mirror/gm0 launched.
Trying to mount root from ufs:/dev/ad0s1a
WARNING: / was not properly dismounted
WARNING: /tmp was not properly dismounted
WARNING: /usr was not properly dismounted
WARNING: /var was not properly dismounted
/var: mount pending error: blocks 4 files 1
WARNING: /shared was not properly dismounted
WARNING: /mirror was not properly dismounted
WARNING: R/W mount of /data denied.  Filesystem is not clean - run fsck
rl0: link state changed to UP
rl1: link state changed to UP
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
WARNING: R/W mount of /data denied.  Filesystem is not clean - run fsck
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
ukbd0: Logitech HID compliant keyboard, rev 1.10/1.80, addr 2, iclass 3/1
kbd2 at ukbd0
uhid0: Logitech HID compliant keyboard, rev 1.10/1.80, addr 2, iclass 3/1
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
arplookup 169.254.8.225 failed: host is not on local network
ad4: FAILURE - SMART status=51<READY,DSC,ERROR> error=4<ABORTED>
ad6: FAILURE - SMART status=51<READY,DSC,ERROR> error=4<ABORTED>
ad4: FAILURE - SMART status=51<READY,DSC,ERROR> error=4<ABORTED>
ad4: FAILURE - SMART status=51<READY,DSC,ERROR> error=4<ABORTED>
ad6: FAILURE - SMART status=51<READY,DSC,ERROR> error=4<ABORTED>
GEOM_MIRROR: Device gm1: rebuilding provider ad4 finished.
GEOM_MIRROR: Device gm1: provider ad4 activated.
vr0: link state changed to DOWN
vr0: link state changed to UP
vr0: link state changed to DOWN
vr0: link state changed to UP

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

Reply via email to