Re: mdadm's raid1 will not eliminate abnormal disk after 5 seconds under IO pressure

2019-01-30 Thread Xiao Ni
Yes, the failfast is used to fix the problem you described. It can't remove the active disk until all pending I/O finish without failfast. If there is no pending I/O, it can be removed immediately. Thanks Xiao On 01/30/2019 10:14 PM, 李春 wrote: I have read the description of the failfast f

[Bug 202425] 3w-9xxx: 3ware 9650SE-2LP RAID controller not working on AMD Ryzen system

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=202425 --- Comment #3 from Bart Van Assche (bvanass...@acm.org) --- On 1/30/19 7:42 PM, bugzilla-dae...@bugzilla.kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=202425 > > --- Comment #2 from robert.smit...@protonmail.com --- > (In reply

[Bug 202425] 3w-9xxx: 3ware 9650SE-2LP RAID controller not working on AMD Ryzen system

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=202425 --- Comment #2 from robert.smit...@protonmail.com --- (In reply to Bart Van Assche from comment #1) > Can you bisect this issue to identify the commit that introduced this > regression? I found it. The commit is: 60db3a4d8cc9073cf56264785197ba75e

Re: [PATCH v2 1/7] Introduce the bidi_supported flag in the host template structure

2019-01-30 Thread Martin K. Petersen
Bart, > It would help if you could explain why you are so strongly opposed > against keeping BIDI support in the kernel. My preference is to remove it because there is simply no good use case for it. The fact that this peculiar command was once defined and has since been obsoleted doesn't impl

Re: [PATCH v2 7/7] scsi: libsas: fix issue of swapping two sas disks

2019-01-30 Thread Jason Yan
On 2019/1/31 1:53, John Garry wrote: On 30/01/2019 08:24, Jason Yan wrote: The work flow of revalidation now is scanning expander phy by the sequence of the phy and check if the phy have changed. This will leads to an issue of swapping two sas disks on one expander. Assume we have two sas di

Re: [PATCH v2 1/7] Introduce the bidi_supported flag in the host template structure

2019-01-30 Thread Martin K. Petersen
Doug, > The T10 members that I have spoken to, have expressed surprise that > Linux kernel developers are contemplating removing SCSI bidi support. > True, XDWRITEREAD and bidi friends have been removed from draft SBC-4 > but they are still used by RAID vendors and there is no move to deprecate

Re: [PATCH v2 6/7] scsi: libsas: reset the phy address if discover failed

2019-01-30 Thread Jason Yan
On 2019/1/31 1:36, John Garry wrote: On 30/01/2019 08:24, Jason Yan wrote: When we failed to discover the device, the phy address is still kept in ex_phy. So when the next time we revalidate this phy the address and device type is the same, it will be considered as flutter and will not be dis

Re: [PATCH v2 4/7] scsi: libsas: split the replacement of sas disks in two steps

2019-01-30 Thread Jason Yan
On 2019/1/31 1:22, John Garry wrote: On 30/01/2019 08:24, Jason Yan wrote: Now if a new device replaced a old device, the sas address will change. Hmmm... not if it's a SATA disk, which would have some same invented SAS address. Yes, it's only for a SAS disk. We unregister the old devi

Re: [PATCH v2 3/7] scsi: libsas: optimize the debug print of the revalidate process

2019-01-30 Thread Jason Yan
On 2019/1/31 0:41, John Garry wrote: On 30/01/2019 08:24, Jason Yan wrote: sas_rediscover() returns error code if discover failed for a expander phy. And sas_ex_revalidate_domain() only returns the last phy's error code. So when sas_revalidate_domain() prints the return value of the discover

Re: [PATCH v2 2/7] scsi: libsas: only clear phy->in_shutdown after shutdown event done

2019-01-30 Thread Jason Yan
On 2019/1/31 0:26, John Garry wrote: On 30/01/2019 08:24, Jason Yan wrote: When the event queue is full of phy up and down events and reached the threshold, we will queue a shutdown-event, and set phy->in_shutdown so that we will not queue a shutdown-event again. But before the shutdown-event

Re: [PATCH v2 1/7] scsi: libsas: reset the negotiated_linkrate when phy is down

2019-01-30 Thread Jason Yan
On 2019/1/30 21:08, John Garry wrote: On 30/01/2019 08:24, Jason Yan wrote: If the device is unplugged or disconnected, the negotiated_linkrate still can be seen from the userspace by sysfs. This makes people confused and leaks information of the device last used. So let's reset the negotiate

[PATCH] scsi: aic94xx: fix module loading

2019-01-30 Thread James Bottomley
The aic94xx driver is currently failing to load with errors like sysfs: cannot create duplicate filename '/devices/pci:00/:00:03.0/:02:00.3/:07:02.0/revision' Because the PCI code had recently added a file named 'revision' to every PCI device. Fix this by renaming the aic94xx re

[PATCH v2] RDMA/srp: Rework SCSI device reset handling

2019-01-30 Thread Bart Van Assche
Since .scsi_done() must only be called after scsi_queue_rq() has finished, make sure that the SRP initiator driver does not call .scsi_done() while scsi_queue_rq() is in progress. Although invoking sg_reset -d while I/O is in progress works fine with kernel v4.20 and before, that is not the case wi

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Jason Gunthorpe
On Tue, Jan 22, 2019 at 10:25:20AM -0800, Bart Van Assche wrote: > The default behavior of the SCSI core is to set the block layer request > queue parameter max_segment_size to 64 KB. That means that elements of > scatterlists are limited to 64 KB. Since RDMA adapters support larger > sizes, increa

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Bart Van Assche
On Wed, 2019-01-30 at 17:50 -0500, Douglas Gilbert wrote: > On 2019-01-30 3:23 p.m., Bart Van Assche wrote: > > On Tue, 2019-01-22 at 15:47 -0500, Douglas Gilbert wrote: > > > On 2019-01-22 1:25 p.m., Bart Van Assche wrote: > > > > The default behavior of the SCSI core is to set the block layer req

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Douglas Gilbert
On 2019-01-30 3:23 p.m., Bart Van Assche wrote: On Tue, 2019-01-22 at 15:47 -0500, Douglas Gilbert wrote: On 2019-01-22 1:25 p.m., Bart Van Assche wrote: The default behavior of the SCSI core is to set the block layer request queue parameter max_segment_size to 64 KB. That means that elements o

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Bart Van Assche
On Tue, 2019-01-22 at 15:47 -0500, Douglas Gilbert wrote: > On 2019-01-22 1:25 p.m., Bart Van Assche wrote: > > The default behavior of the SCSI core is to set the block layer request > > queue parameter max_segment_size to 64 KB. That means that elements of > > scatterlists are limited to 64 KB. S

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Bart Van Assche
On Wed, 2019-01-30 at 12:21 -0700, Jason Gunthorpe wrote: > On Wed, Jan 30, 2019 at 10:59:34AM -0800, Bart Van Assche wrote: > > On Tue, 2019-01-22 at 10:25 -0800, Bart Van Assche wrote: > > > The default behavior of the SCSI core is to set the block layer request > > > queue parameter max_segment_

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Jason Gunthorpe
On Wed, Jan 30, 2019 at 10:59:34AM -0800, Bart Van Assche wrote: > On Tue, 2019-01-22 at 10:25 -0800, Bart Van Assche wrote: > > The default behavior of the SCSI core is to set the block layer request > > queue parameter max_segment_size to 64 KB. That means that elements of > > scatterlists are li

Re: [PATCH for-next] scsi: ufs: Update dwc driver maintainer to Pedro Sousa

2019-01-30 Thread Joao Pinto
Hello Bart, First of all thanks for the feedback. On 1/30/2019 5:54 PM, Bart Van Assche wrote: > On Wed, 2019-01-30 at 18:48 +0100, Joao Pinto wrote: >> Currently I am managing the Synopsys drivers & tools team (full-time) and >> so I am passing the DWC UFS driver maintenance to Pedro Sousa. >> >

Re: [PATCH] RDMA/srp: Increase max_segment_size

2019-01-30 Thread Bart Van Assche
On Tue, 2019-01-22 at 10:25 -0800, Bart Van Assche wrote: > The default behavior of the SCSI core is to set the block layer request > queue parameter max_segment_size to 64 KB. That means that elements of > scatterlists are limited to 64 KB. Since RDMA adapters support larger > sizes, increase max_

Re: [PATCH v2 7/7] scsi: libsas: fix issue of swapping two sas disks

2019-01-30 Thread John Garry
On 30/01/2019 08:24, Jason Yan wrote: The work flow of revalidation now is scanning expander phy by the sequence of the phy and check if the phy have changed. This will leads to an issue of swapping two sas disks on one expander. Assume we have two sas disks, connected with expander phy10 and ph

Re: [PATCH for-next] scsi: ufs: Update dwc driver maintainer to Pedro Sousa

2019-01-30 Thread Bart Van Assche
On Wed, 2019-01-30 at 18:48 +0100, Joao Pinto wrote: > Currently I am managing the Synopsys drivers & tools team (full-time) and > so I am passing the DWC UFS driver maintenance to Pedro Sousa. > > Signed-off-by: Joao Pinto > Cc: Pedro Sousa > Cc: Marc Gonzalez > Cc: Alex Lemberg > --- > MAIN

[PATCH] qla2xxx: Add new FC-NVMe enable BIT to enable FC-NVMe feature

2019-01-30 Thread Himanshu Madhani
From: Giridhar Malavali This patch adds new BIT detection to enable FC-NVMe feature in the driver. Signed-off-by: Giridhar Malavali Signed-off-by: Himanshu Madhani --- Hi Martin, This patch adds additional bit to enable FC-NVMe in the driver. Please apply this patch to 5.1/scsi-queue at you

[PATCH for-next] scsi: ufs: Update dwc driver maintainer to Pedro Sousa

2019-01-30 Thread Joao Pinto
Currently I am managing the Synopsys drivers & tools team (full-time) and so I am passing the DWC UFS driver maintenance to Pedro Sousa. Signed-off-by: Joao Pinto Cc: Pedro Sousa Cc: Marc Gonzalez Cc: Alex Lemberg --- MAINTAINERS | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --g

Re: [PATCH v2 6/7] scsi: libsas: reset the phy address if discover failed

2019-01-30 Thread John Garry
On 30/01/2019 08:24, Jason Yan wrote: When we failed to discover the device, the phy address is still kept in ex_phy. So when the next time we revalidate this phy the address and device type is the same, it will be considered as flutter and will not be discovered again. So the device will not be

Re: [PATCH v2 4/7] scsi: libsas: split the replacement of sas disks in two steps

2019-01-30 Thread John Garry
On 30/01/2019 08:24, Jason Yan wrote: Now if a new device replaced a old device, the sas address will change. Hmmm... not if it's a SATA disk, which would have some same invented SAS address. We unregister the old device and discover the new device in one revalidation process. But after we

Re: [PATCH v2 3/7] scsi: libsas: optimize the debug print of the revalidate process

2019-01-30 Thread John Garry
On 30/01/2019 08:24, Jason Yan wrote: sas_rediscover() returns error code if discover failed for a expander phy. And sas_ex_revalidate_domain() only returns the last phy's error code. So when sas_revalidate_domain() prints the return value of the discover process, we do not know if the revalidati

Re: [PATCH v2 1/7] Introduce the bidi_supported flag in the host template structure

2019-01-30 Thread Bart Van Assche
On Tue, 2019-01-29 at 09:35 +0100, Christoph Hellwig wrote: > I disagree with investing further effort into BIDI support. It is > dead for all practical purposes in standards and implementation, > and the fact that we found all these bugs in it just further confirms > that. The only answer to tha

Re: [PATCH v2 2/7] scsi: libsas: only clear phy->in_shutdown after shutdown event done

2019-01-30 Thread John Garry
On 30/01/2019 08:24, Jason Yan wrote: When the event queue is full of phy up and down events and reached the threshold, we will queue a shutdown-event, and set phy->in_shutdown so that we will not queue a shutdown-event again. But before the shutdown-event can be executed, every phy-down event wi

[Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=201609 --- Comment #10 from Emil Velikov (emil.l.veli...@gmail.com) --- (In reply to James.Bottomley from comment #9) > On Wed, 2019-01-30 at 11:43 +, bugzilla-dae...@bugzilla.kernel.org > wrote: > > https://bugzilla.kernel.org/show_bug.cgi?id=201609

[Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=201609 --- Comment #9 from james.bottom...@hansenpartnership.com --- On Wed, 2019-01-30 at 11:43 +, bugzilla-dae...@bugzilla.kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=201609 > > Emil Velikov (emil.l.veli...@gmail.com) changed:

Re: [Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread James Bottomley
On Wed, 2019-01-30 at 11:43 +, bugzilla-dae...@bugzilla.kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=201609 > > Emil Velikov (emil.l.veli...@gmail.com) changed: > >What|Removed |Added >

[Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=201609 --- Comment #8 from james.bottom...@hansenpartnership.com --- On Wed, 2019-01-30 at 14:21 +, bugzilla-dae...@bugzilla.kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=201609 > > Bjorn Helgaas (bhelg...@google.com) changed: > >

Re: [Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread James Bottomley
On Wed, 2019-01-30 at 14:21 +, bugzilla-dae...@bugzilla.kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=201609 > > Bjorn Helgaas (bhelg...@google.com) changed: > >What|Removed |Added >

RE: [PATCH v4 1/3] scsi: ufs-bsg: Change the calling convention for write descriptor

2019-01-30 Thread Bean Huo (beanhuo)
Hi, Avri > >When we had a write descriptor query upiu, we appended the descriptor right >after the bsg request. This was fine as the bsg driver allows to allocate >whatever >buffer we needed in its job request. > >Still, the proper way to deliver payload, however small (we only write config >des

[Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=201609 Bjorn Helgaas (bhelg...@google.com) changed: What|Removed |Added CC||bhelg...@google.com

Re: mdadm's raid1 will not eliminate abnormal disk after 5 seconds under IO pressure

2019-01-30 Thread 李春
I have read the description of the failfast feature. According to the phenomenon, it may be not the problem of failfast. Because when there are no io pressure, after stop the disk export on the storage node, the disk will be automatically eliminate from the md disk. However, if there is continuou

Re: mdadm's raid1 will not eliminate abnormal disk after 5 seconds under IO pressure

2019-01-30 Thread 李春
Ok, thanks. So we can look forward this bug will fix in rhel 6.11. If my system is base on rhel 6.7, is there any way to solve it? Xiao Ni 于2019年1月30日周三 下午5:15写道: > > > > On 01/30/2019 03:25 PM, Jack Wang wrote: > > 李春 于2019年1月30日周三 上午7:08写道: > >> # Description of problem: > >> We loaded a disk

[Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=201609 --- Comment #6 from Emil Velikov (emil.l.veli...@gmail.com) --- I stand corrected: The aic94xx revision file exposes a string (which look like hexadecimal number) based on the revision number. Regardless, the ideas proposed earlier are still appl

Re: [PATCH v2 1/7] scsi: libsas: reset the negotiated_linkrate when phy is down

2019-01-30 Thread John Garry
On 30/01/2019 08:24, Jason Yan wrote: If the device is unplugged or disconnected, the negotiated_linkrate still can be seen from the userspace by sysfs. This makes people confused and leaks information of the device last used. So let's reset the negotiated_linkrate after the phy is down. Signed-

[PATCH] scsi: cxlflash: Prevent deadlock when adapter probe fails

2019-01-30 Thread Vaibhav Jain
Presently when an error is encountered during probe of the cxlflash adapter, a deadlock is seen with cpu thread stuck inside cxlflash_remove(). Below is the trace of the deadlock as logged by khungtaskd: cxlflash 0006:00:00.0: cxlflash_probe: init_afu failed rc=-16 INFO: task kworker/80:1:890 bl

Re: [PATCH v4 21/26] lpfc: Rework locking on SCSI io completion

2019-01-30 Thread Hannes Reinecke
On 1/28/19 8:14 PM, James Smart wrote: A scsi host lock is taken on every io completion to check whether the abort handler is waiting on the io completion. This is an expensive lock to take on all completion when rarely in an abort condition. Replace scsi host lock with command-specific lock. Sy

[Bug 201609] sysfs duplicate filename on driver loading Adaptec AIC-9410W

2019-01-30 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=201609 Emil Velikov (emil.l.veli...@gmail.com) changed: What|Removed |Added CC||emil.l.veli...@g

RE: [PATCH v1 1/1] scsi: ufs: Print uic error history in time order

2019-01-30 Thread Bean Huo (beanhuo)
Hi, Stanley I tested it on my own platform. This is very useful and thanks. > >Now uic errors are printed out of time order. > >Simply make it more readable by printing logs in time order, and printing "No >record" if history is empty. > >Signed-off-by: Stanley Chu >--- Reviewed-by: Bean Huo Te

[PATCH -next] scsi: libfc: Fix potential NULL pointer dereference

2019-01-30 Thread YueHaibing
There is a potential NULL pointer dereference in case fc_rport_create() fails and returns NULL. Fixes: 2580064b5ec6 ("scsi: libfc: Replace ->rport_create callback with function call") Signed-off-by: YueHaibing --- drivers/scsi/libfc/fc_lport.c | 4 1 file changed, 4 insertions(+) diff --g

Re: mdadm's raid1 will not eliminate abnormal disk after 5 seconds under IO pressure

2019-01-30 Thread Xiao Ni
On 01/30/2019 03:25 PM, Jack Wang wrote: 李春 于2019年1月30日周三 上午7:08写道: # Description of problem: We loaded a disk from two network of storage node via iscsi, merged into a disk through multipath, and made a raid1 with local disk by mdadm. However, when the storage machine of iscsi disk reboote

[PATCH v2 6/7] scsi: libsas: reset the phy address if discover failed

2019-01-30 Thread Jason Yan
When we failed to discover the device, the phy address is still kept in ex_phy. So when the next time we revalidate this phy the address and device type is the same, it will be considered as flutter and will not be discovered again. So the device will not be brought up. Fix this by reset the phy a

[PATCH v2 3/7] scsi: libsas: optimize the debug print of the revalidate process

2019-01-30 Thread Jason Yan
sas_rediscover() returns error code if discover failed for a expander phy. And sas_ex_revalidate_domain() only returns the last phy's error code. So when sas_revalidate_domain() prints the return value of the discover process, we do not know if the revalidation for every phy is successful or not. W

[PATCH v2 2/7] scsi: libsas: only clear phy->in_shutdown after shutdown event done

2019-01-30 Thread Jason Yan
When the event queue is full of phy up and down events and reached the threshold, we will queue a shutdown-event, and set phy->in_shutdown so that we will not queue a shutdown-event again. But before the shutdown-event can be executed, every phy-down event will clear phy->in_shutdown and a new shut

[PATCH v2 1/7] scsi: libsas: reset the negotiated_linkrate when phy is down

2019-01-30 Thread Jason Yan
If the device is unplugged or disconnected, the negotiated_linkrate still can be seen from the userspace by sysfs. This makes people confused and leaks information of the device last used. So let's reset the negotiated_linkrate after the phy is down. Signed-off-by: Jason Yan CC: John Garry CC: J

[PATCH v2 7/7] scsi: libsas: fix issue of swapping two sas disks

2019-01-30 Thread Jason Yan
The work flow of revalidation now is scanning expander phy by the sequence of the phy and check if the phy have changed. This will leads to an issue of swapping two sas disks on one expander. Assume we have two sas disks, connected with expander phy10 and phy11: phy10: 5000cca04eb1001d port-0:0:

[PATCH v2 4/7] scsi: libsas: split the replacement of sas disks in two steps

2019-01-30 Thread Jason Yan
Now if a new device replaced a old device, the sas address will change. We unregister the old device and discover the new device in one revalidation process. But after we deferred the sas_port_delete(), the sas port is not deleted when we registering the new port and device. The sas port cannot be

[PATCH v2 0/7] libsas: fix issue of swapping or replacing disks

2019-01-30 Thread Jason Yan
The work flow of revalidation now is scanning expander phy by the sequence of the phy and check if the phy have changed. This will leads to some issues of swapping disks or replacing a disk with a new one. This patchset addresses the issues above by these main changes: 1. Let the revalidation firs

[PATCH v2 5/7] scsi: libsas: check if the same device when flutter

2019-01-30 Thread Jason Yan
The ata device do not have a real sas address. If a ata device is replaced with another one, the sas address is the same. Now libsas treat this scenario as flutter and do not delete the old one and discover the new one. This will cause the data read from or write to the wrong device. And also when