Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-05-01 Thread Andy Farkas
On Fri, Apr 30, 2010 at 4:42 AM, Pete French wrote: > > I've copied in the original poster of the problem to see how he is > doing, but as far as I am concerned the problem has gone away. Certainly > the things I was doing before to triger it no longer do so. Of course > in the normal state of thi

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Alexander Motin
Scott Long wrote: > On Apr 29, 2010, at 10:56 PM, Alexander Motin wrote: >> Scott Long wrote: >>> On Apr 29, 2010, at 7:47 AM, Robert Noland wrote: Scott Long wrote: > On Apr 29, 2010, at 2:50 AM, Pete French wrote: >>> Thanks. First step successful - I can steadily reproduce problem o

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Scott Long
On Apr 29, 2010, at 10:56 PM, Alexander Motin wrote: > Scott Long wrote: >> On Apr 29, 2010, at 7:47 AM, Robert Noland wrote: >>> >>> Scott Long wrote: On Apr 29, 2010, at 2:50 AM, Pete French wrote: >> Thanks. First step successful - I can steadily reproduce problem on >> CURRENT. ra

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Alexander Motin
Scott Long wrote: > On Apr 29, 2010, at 7:47 AM, Robert Noland wrote: >> >> Scott Long wrote: >>> On Apr 29, 2010, at 2:50 AM, Pete French wrote: > Thanks. First step successful - I can steadily reproduce problem on > CURRENT. raidtest with 200 I/O streams over gmirror of two disks on same

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Scott Long
On Apr 29, 2010, at 7:47 AM, Robert Noland wrote: > > > Scott Long wrote: >> On Apr 29, 2010, at 2:50 AM, Pete French wrote: Thanks. First step successful - I can steadily reproduce problem on CURRENT. raidtest with 200 I/O streams over gmirror of two disks on same channel triggers

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Pete French
> I'm glad to hear it. But gmirror rebuild itself may be not enough for > test. It uses very few requests same time. You should manage "Queue > full" state, so you should make at least 150 concurrent write requests > to the mirror running same time. Am going to hammer it for a bit with a number of

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Alexander Motin
Pete French wrote: > ...and my other test amchine just completed a gmirror rebuild as well, with no > problems. So intially it does look very much like it > is fixed. Thanks Alexander! IIf I have any mmore problems I will > let you know I'm glad to hear it. But gmirror rebuild itself may be not en

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Pete French
...and my other test amchine just completed a gmirror rebuild as well, with no problems. So intially it does look very much like it is fixed. Thanks Alexander! IIf I have any mmore problems I will let you know -pete. ___ freebsd-stable@freebsd.org mailin

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Pete French
> Seems like I've found the reason. Attached patch fixes problem for me. Inetersting - one of my machines has ginished a gmirror resync. The first time I tried this it did lock up, but with media rea errors (which may be genuine on these old drives). But this tiime it has finished, and without the

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Pete French
> Seems like I've found the reason. Attached patch fixes problem for me. Thanks, am trying this now -pete. ___ freebsd-stable@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-stable To unsubscribe, send any mail to "freebs

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Alexander Motin
Alexander Motin wrote: > Pete French wrote: >>> I have some 29160N locally and I'll try to reproduce this. >> I would suggest you try gmirror across two drives - that is how >> both myself and the original poster first noticed the issue. > > Thanks. First step successful - I can steadily reproduce

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Robert Noland
Scott Long wrote: On Apr 29, 2010, at 2:50 AM, Pete French wrote: Thanks. First step successful - I can steadily reproduce problem on CURRENT. raidtest with 200 I/O streams over gmirror of two disks on same channel triggers issue in seconds. Any I/O on channel dying after both disks report "Q

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Scott Long
On Apr 29, 2010, at 2:50 AM, Pete French wrote: >> Thanks. First step successful - I can steadily reproduce problem on >> CURRENT. raidtest with 200 I/O streams over gmirror of two disks on same >> channel triggers issue in seconds. Any I/O on channel dying after both >> disks report "Queue full"

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Pete French
> Thanks. First step successful - I can steadily reproduce problem on > CURRENT. raidtest with 200 I/O streams over gmirror of two disks on same > channel triggers issue in seconds. Any I/O on channel dying after both > disks report "Queue full" error same time. The rest of system works > fine. If

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-29 Thread Alexander Motin
Pete French wrote: >> Thanks. First step successful - I can steadily reproduce problem on >> CURRENT. raidtest with 200 I/O streams over gmirror of two disks on same >> channel triggers issue in seconds. Any I/O on channel dying after both >> disks report "Queue full" error same time. The rest of s

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-28 Thread Alexander Motin
Pete French wrote: >> I have some 29160N locally and I'll try to reproduce this. > > I would suggest you try gmirror across two drives - that is how > both myself and the original poster first noticed the issue. Thanks. First step successful - I can steadily reproduce problem on CURRENT. raidtest

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-28 Thread Pete French
> I have some 29160N locally and I'll try to reproduce this. I would suggest you try gmirror across two drives - that is how both myself and the original poster first noticed the issue. cheers, -pete. ___ freebsd-stable@freebsd.org mailing list http:/

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-28 Thread Alexander Motin
Andy Farkas wrote: > RELENG_8 csup'd with date=2010.02.14.00.00 works perfectly for days. > > RELENG_8 csup'd with date=2010.02.15.00.00 dead-locks the disk I/O > subsystem. Network still operational but anything needing disk hangs. > Power-cycle required. > > kernel config is GENERIC with KDB, D

Re: MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-27 Thread Pete French
> RELENG_8 csup'd with date=2010.02.14.00.00 works perfectly for days. > > RELENG_8 csup'd with date=2010.02.15.00.00 dead-locks the disk I/O > subsystem. Network still operational but anything needing disk hangs. > Power-cycle required. An aditional point (and thanks to Andy for doing all the wor

MFC of "Large set of CAM improvements" breaks I/O to Adaptec 29160 SCSI controller

2010-04-27 Thread Andy Farkas
Hi, firstly: RELENG_8 csup'd with date=2010.02.14.00.00 works perfectly for days. RELENG_8 csup'd with date=2010.02.15.00.00 dead-locks the disk I/O subsystem. Network still operational but anything needing disk hangs. Power-cycle required. kernel config is GENERIC with KDB, DDB and BREAK_TO_DEB