Re: MegaRAID 'Bad Slot' Kernel message and crash.

2005-01-14 Thread Scott Long
Tony Byrne wrote: Hello Doug, Friday, January 14, 2005, 1:35:16 AM, you wrote: DW> scottl has been able to reproduce this on a U320 controller he has. I only DW> have U160 equipment and can't get the txn rate up high enough to reproduce DW> the issue. The driver needs KTR instrumentation so we can

Re[2]: MegaRAID 'Bad Slot' Kernel message and crash.

2005-01-14 Thread Tony Byrne
Hello Doug, DW> scottl has been able to reproduce this on a U320 controller he has. Scott, We want to take this box into the lab as soon as we have a replacement for it, but we could do with knowing the best way to generate the problem on demand. So far the amr driver only seems to wedge for us

Re[2]: MegaRAID 'Bad Slot' Kernel message and crash.

2005-01-14 Thread Tony Byrne
Hello Doug, Friday, January 14, 2005, 1:35:16 AM, you wrote: DW> scottl has been able to reproduce this on a U320 controller he has. I only DW> have U160 equipment and can't get the txn rate up high enough to reproduce DW> the issue. The driver needs KTR instrumentation so we can see where the D

Re: MegaRAID 'Bad Slot' Kernel message and crash.

2005-01-13 Thread Doug White
On Tue, 11 Jan 2005, Tony Byrne wrote: > Basically, after some amount of uptime the kernel will emit a "amr0: > Bad slot x completed" message and pretty soon after this the box goes into a > partially unresponsive state forcing us to reboot it. So far the only > thing triggering the problem is th

Re: MegaRAID 'Bad Slot' Kernel message and crash.

2005-01-12 Thread Doug Ambrisko
Tony Byrne writes: | Basically, after some amount of uptime the kernel will emit a "amr0: | Bad slot x completed" message and pretty soon after this the box goes into a | partially unresponsive state forcing us to reboot it. So far the only | thing triggering the problem is the nightly jobs, where

MegaRAID 'Bad Slot' Kernel message and crash.

2005-01-11 Thread Tony Byrne
Folks, I kicked off a thread just before the holidays regarding some problems we are having with an Intel SRCU42X RAID controller in a dual processor production server originally under 5.3-STABLE and now under 4.10-STABLE. The thread ran out of steam, with no resolution to the problem, but I'm ho

Re[2]: MegaRAID 'Bad Slot' Kernel message and crash.

2004-12-29 Thread Tony Byrne
Hello Scott, Wednesday, December 29, 2004, 2:46:04 PM, you wrote: SL> I've been seeing this problem recently too. I believe that there is SL> some sort of timing bug/race in the driver, but I haven't been able to SL> figure it out yet. It also seems to be related to panic from the block SL> lay

Re: MegaRAID 'Bad Slot' Kernel message and crash.

2004-12-29 Thread Scott Long
Tony Byrne wrote: Folks, We have a 4.10-STABLE production server which has an Intel SRCU42X RAID controller installed: amr0: mem 0xfe58-0xfe5f,0xfbef-0xfbef irq 22 at device 0.0 on pci4 amr0: Firmware 411M, BIOS H404, 128MB RAM The server crashed yesterday in the small hours of

Re: MegaRAID 'Bad Slot' Kernel message and crash.

2004-12-29 Thread Frode Nordahl
On Dec 29, 2004, at 12:18, Tony Byrne wrote: The amr driver man page says that this message is indicative of a firmware or hardware problem with the controller, but we are not convinced. We experienced the same message and lockups daily during stress testing of the box under FreeBSD 5.3 and this

MegaRAID 'Bad Slot' Kernel message and crash.

2004-12-29 Thread Tony Byrne
Folks, We have a 4.10-STABLE production server which has an Intel SRCU42X RAID controller installed: amr0: mem 0xfe58-0xfe5f,0xfbef-0xfbef irq 22 at device 0.0 on pci4 amr0: Firmware 411M, BIOS H404, 128MB RAM The server crashed yesterday in the small hours of the morning and