Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Patrick Mansfield
On Fri, Apr 22, 2005 at 12:52:56AM +0200, Lars Marowsky-Bree wrote: > On 2005-04-21T15:13:16, Patrick Mansfield <[EMAIL PROTECTED]> wrote: > > > > The most recent udm patchset has a patch by Jens Axboe and myself to > > > pass up sense data / error codes in the bio so the dm mpath module can > > >

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Lars Marowsky-Bree
On 2005-04-21T15:13:16, Patrick Mansfield <[EMAIL PROTECTED]> wrote: > > The most recent udm patchset has a patch by Jens Axboe and myself to > > pass up sense data / error codes in the bio so the dm mpath module can > > deal with it. > But the scmd->result is not passed back. Bear with me and

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Lars Marowsky-Bree
On 2005-04-21T23:33:57, Andreas Herrmann <[EMAIL PROTECTED]> wrote: > Well, there are various situations when all paths to the ESS are > "temporarily unavailable". In some cases TASK_SET_FULL/BUSY is > reported as it should be. Not sure whether this sense data is decoded and handled correctly in

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Lars Marowsky-Bree
On 2005-04-21T18:01:04, "goggin, edward" <[EMAIL PROTECTED]> wrote: > > If we can't differentiate in the kernel where we have the IO error > > details available, then how would user-space? You're not solving the > > problem ;-) > Maybe not completely, but at least an inquiry of page 83 will not tr

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Patrick Mansfield
On Thu, Apr 21, 2005 at 09:54:35PM +0200, Lars Marowsky-Bree wrote: > > We need a patch like Mike Christie had, this: > > > > http://marc.theaimsgroup.com/?l=linux-kernel&m=107961883914541&w=2 > > > > The scsi core should decode the sense data and pass up the result, then dm > > need not decode

RE: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread goggin, edward
> -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of Lars > Marowsky-Bree > Sent: Thursday, April 21, 2005 5:50 PM > To: device-mapper development; Andreas Herrmann > Cc: Linux SCSI > Subject: Re: [dm-devel] Re: fastfail opera

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Lars Marowsky-Bree
On 2005-04-21T17:31:46, "goggin, edward" <[EMAIL PROTECTED]> wrote: > > No. Basically every time out error creates a "dunno why" error right > > now - could be the storage system itself, could be the network in > > between. > > > I was really thinking of the code where the sense key/asc/ascq makes

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread David S. Miller
Please don't add "linux-scsi-owner" to the CC: list like that. That goes to the list administrator (currently me), not the linux-scsi mailing list. There seems to be a rather prominent influx of people sending posts to the *-owner address lately, I wonder why as nothing has materially changed in

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Andreas Herrmann
Lars Marowsky-Bree <[EMAIL PROTECTED]> 21.04.2005 21:54 > On 2005-04-21T09:42:05, Patrick Mansfield <[EMAIL PROTECTED]> wrote: > > On Tue, Apr 19, 2005 at 07:19:53PM +0200, Andreas Herrmann wrote: > > > > We need a patch like Mike Christie had, this: > > > > http://marc.th

RE: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread goggin, edward
> -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of Lars > Marowsky-Bree > Sent: Thursday, April 21, 2005 5:19 PM > To: device-mapper development; Andreas Herrmann > Cc: Linux SCSI > Subject: Re: [dm-devel] Re: fastfail opera

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Lars Marowsky-Bree
On 2005-04-21T17:02:44, "goggin, edward" <[EMAIL PROTECTED]> wrote: > Depending on the "queue_if_no_path" feature has the current undesirable > side-effect of requiring intervention of the user space multipath components > to reinstate at least one of the paths to a useable state in the multipath

RE: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread goggin, edward
On Thursday, April 21, 2005 3:55 PM, Lars Marowsky-Bree wrote: > Together with the "queue_if_no_path" feature flag for dm-mpath that > should do what you need to handle this (arguably broken) array > behaviour: It'll queue until the error goes away and > multipathd retests > and reactivates the p

Re: [dm-devel] Re: fastfail operation and retries

2005-04-21 Thread Lars Marowsky-Bree
On 2005-04-21T09:42:05, Patrick Mansfield <[EMAIL PROTECTED]> wrote: > On Tue, Apr 19, 2005 at 07:19:53PM +0200, Andreas Herrmann wrote: > > Hi, > > > > I have question(s) regarding the fastfail operation of the SCSI stack. > > > > Performing multipath-tests with an IBM ESS I encountered problem

Re: fastfail operation and retries

2005-04-21 Thread Patrick Mansfield
On Tue, Apr 19, 2005 at 07:19:53PM +0200, Andreas Herrmann wrote: > Hi, > > I have question(s) regarding the fastfail operation of the SCSI stack. > > Performing multipath-tests with an IBM ESS I encountered problems. > During certain operations on an ESS (quiesce/resume and such) requests > on a

Re: fastfail operation and retries

2005-04-20 Thread Andreas Herrmann
??? <[EMAIL PROTECTED]> wrote: 20.04.2005 03:17 > what multipath are you using? Software, or hardware, > or both? We are using udm with evms (Linux on zSeries). Hardware setup is: - switched fabric FC-SAN, - 4 paths to each FC-LUN on the ESS 800 All 4 paths are "failing fast" du

fastfail operation and retries

2005-04-19 Thread Andreas Herrmann
Hi, I have question(s) regarding the fastfail operation of the SCSI stack. Performing multipath-tests with an IBM ESS I encountered problems. During certain operations on an ESS (quiesce/resume and such) requests on all paths fail temporarily with an data underrun (resid is set in the FCP-respons