Re: aic94xx: failing on high load (another data point)

2008-02-20 Thread James Bottomley
On Wed, 2008-02-20 at 17:54 +0800, Keith Hopkins wrote: > On 02/20/2008 11:48 AM, James Bottomley wrote: > > On Tue, 2008-02-19 at 10:22 -0600, James Bottomley wrote: > >> I'll see if I can come up with patches to fix this ... or at least > >> mitigate the problems it causes. > > > > Darrick's wor

Re: aic94xx: failing on high load (another data point)

2008-02-20 Thread Keith Hopkins
On 02/20/2008 11:48 AM, James Bottomley wrote: > On Tue, 2008-02-19 at 10:22 -0600, James Bottomley wrote: >> I'll see if I can come up with patches to fix this ... or at least >> mitigate the problems it causes. > > Darrick's working on the ascb sequencer use after free problem. > > I looked int

Re: aic94xx: failing on high load (another data point)

2008-02-19 Thread James Bottomley
On Tue, 2008-02-19 at 10:22 -0600, James Bottomley wrote: > I'll see if I can come up with patches to fix this ... or at least > mitigate the problems it causes. Darrick's working on the ascb sequencer use after free problem. I looked into some of the error handling in libsas, and apparently that

Re: aic94xx: failing on high load (another data point)

2008-02-19 Thread James Bottomley
On Mon, 2008-02-18 at 22:26 +0800, Keith Hopkins wrote: > Well, that made life interesting > but didn't seem to fix anything. > > The behavior is about the same as before, but with more verbose > errors. I failed one member of the raid and had it rebuild as a > test...which hangs for a whil

Re: aic94xx: failing on high load (another data point)

2008-02-18 Thread James Bottomley
On Mon, 2008-02-18 at 22:26 +0800, Keith Hopkins wrote: > Well, that made life interesting > but didn't seem to fix anything. > > The behavior is about the same as before, but with more verbose > errors. I failed one member of the raid and had it rebuild as a > test...which hangs for a whil

Re: aic94xx: failing on high load (another data point)

2008-02-18 Thread Keith Hopkins
On 02/15/2008 11:28 PM, James Bottomley wrote: > On Fri, 2008-02-15 at 00:11 +0800, Keith Hopkins wrote: >> On 01/31/2008 03:29 AM, Darrick J. Wong wrote: >>> On Wed, Jan 30, 2008 at 06:59:34PM +0800, Keith Hopkins wrote: V28. My controller functions well with a single drive (low-medium load)

Re: aic94xx: failing on high load (another data point)

2008-02-15 Thread Keith Hopkins
On 02/15/2008 11:28 PM, James Bottomley wrote: > If you're willing, could you try this patch which may correct the > problem? It's sort of like falling off a cliff: if you never go near > the edge (i.e. you upgrade the drive fw) you never fall off; > alternatively, it would be nice if you could he

Re: aic94xx: failing on high load (another data point)

2008-02-15 Thread James Bottomley
On Fri, 2008-02-15 at 00:11 +0800, Keith Hopkins wrote: > On 01/31/2008 03:29 AM, Darrick J. Wong wrote: > > On Wed, Jan 30, 2008 at 06:59:34PM +0800, Keith Hopkins wrote: > >> V28. My controller functions well with a single drive (low-medium load). > >> Unfortunately, all attempts to get the mi

Re: aic94xx: failing on high load (another data point)

2008-02-14 Thread Keith Hopkins
On 01/31/2008 03:29 AM, Darrick J. Wong wrote: > On Wed, Jan 30, 2008 at 06:59:34PM +0800, Keith Hopkins wrote: >> V28. My controller functions well with a single drive (low-medium load). >> Unfortunately, all attempts to get the mirrors in sync fail and usually hang >> the whole box. > > Adap

Re: aic94xx: failing on high load (another data point)

2008-01-30 Thread Darrick J. Wong
On Wed, Jan 30, 2008 at 06:59:34PM +0800, Keith Hopkins wrote: > > V28. My controller functions well with a single drive (low-medium load). > Unfortunately, all attempts to get the mirrors in sync fail and usually hang > the whole box. Adaptec posted a V30 sequencer on their website; does tha

Re: aic94xx: failing on high load (another data point)

2008-01-30 Thread Keith Hopkins
On 01/30/2008 05:14 PM, Jan Sembera wrote: > > We tried firmware versions V28, V30, and even V32 that is, as > far as I know, not yet available on adaptec website. All of them were > unfortunately displaying exactly the same behaviour :-(. Did you get your > SAS controller working? And if so

Re: aic94xx: failing on high load (another data point)

2008-01-30 Thread Keith Hopkins
> We've tried new adaptec firmware shipped with SLES and we got > ourselves new error string that appears just above error messages that you > have seen before and that were attached to the original message: > kernel: aic94xx: escb_tasklet_complete: REQ_TASK_ABORT, reason=0x6 > kernel: aic94xx