Hi Meelis,
Can you try patch just posted here.
https://patchwork.kernel.org/patch/10816115/
It works on both sparcs - no warnings, normal bootup as before.
There is no cable attached so I do not know if it really works but that's the
same state as before.
--
Meelis Roos
#=4 fw=8.07.00 (9496).
[ 40.421151] qla2xxx [:04:00.0]-8038:3: Cable is unplugged...
[ 40.837159] qla2xxx [:83:00.0]-8038:4: Cable is unplugged...
--
Meelis Roos
host#=4 fw=8.07.00 (9496).
[ 69.884173] qla2xxx [:06:01.0]-8038:3: Cable is unplugged...
[ 71.068227] qla2xxx [0001:08:02.0]-8038:4: Cable is unplugged...
--
Meelis Roos
et
# CONFIG_RUNTIME_TESTING_MENU is not set
# CONFIG_MEMTEST is not set
# CONFIG_BUG_ON_DATA_CORRUPTION is not set
# CONFIG_SAMPLES is not set
CONFIG_HAVE_ARCH_KGDB=y
# CONFIG_KGDB is not set
CONFIG_UBSAN=y
CONFIG_UBSAN_ALIGNMENT=y
# CONFIG_TEST_UBSAN is not set
CONFIG_TRACE_IRQFLAGS_SUPPORT=y
# CONFIG_DEBUG_DCFLUSH is not set
--
Meelis Roos
O hang caused by irq vector automatic affinity
Works on HP DL360G6 with integrated smartarray, with no visible
regressions.
--
Meelis Roos (mr...@linux.ee)
> Hi Meelis,
>
> This issue should already be addressed by a very recent commit:
>
> 6a2cf8d3663e13e1 scsi: qla2xxx: Fix crashes in qla2x00_probe_one on probe
> failure
What tree is that commit in?
--
Meelis Roos (mr...@linux.ee)
; + qla2x00_mem_free(ha);
> +
> + qla2x00_free_queues(ha);
> +
>
> are unnecessary. These routines are already called by qla2x00_free_device
> just above
> in qla2x00_remove_one.
No, that was the point of my changes - they must not be called from
qla2x00_free_device or they will be d
also zero
the req and rsp pointers after freeing them once in the error handler of
qla2x00_probe_one().
This fixes memory corruption and further crashes in unrelated code when qla2200
init fails for some reason.
Signed-off-by: Meelis Roos
---
drivers/scsi/qla2xxx/qla_os.c | 11 ++-
1
Fix an obvious copy-paste error in freeing QLAFX00 response queue - the code
checked for rsp->ring but freed rsp->ring_fx00.
Signed-off-by: Meelis Roos
---
drivers/scsi/qla2xxx/qla_os.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/drivers/scsi/qla2xxx/qla_os.c b/d
do.
Note: the debug line identifier is the same 0x1198 for both cases. Maybe it
needs to be renumbered in the new case?
Signed-off-by: Meelis Roos
---
drivers/scsi/qla2xxx/qla_mbx.c | 18 +-
1 file changed, 13 insertions(+), 5 deletions(-)
diff --git a/drivers/scsi/qla2xxx
pointer but
freeing another.
--
Meelis Roos (mr...@linux.ee)
submit them separately). After that I can observe firmware
being loaded and verified but qla2x00_init_firmware fails.
detailed debug trace with qla2xxx.ql2xextended_error_logging=0x7fff
is available at http://kodu.ut.ee/~mroos/qla2200-sparc64-trace.txt
How can I debug it further?
--
Meelis
> This happens on a HP DL360 G6 with Smart Array 410i.
>
> Will try to bisect.
>
> IO completion timeout could be because of some IRQ toubles?
Reverting 84676c1f21e8ff54befe985f4f14dc1edc10046b fixes it for me (as
suggested by Laurence Oberman).
--
Meelis Roos (mr...@linux.ee)
--
Meelis Roos (mr...@linux.ee)
m aac_probe_one() so PCI layer assumes there is
driver attached, and tries to shut it down later.
Fix it by returning error from aac_probe_one() when card-specific init
function fails.
This fixes reboot on my HP NetRAID-4M with dead battery.
Signed-off-by: Meelis Roos
diff --git a/drivers/sc
_reg_2xxx __iomem *reg = &ha->iobase->isp;
/* Read all mbox registers? */
- mboxes = (1 << ha->mbx_count) - 1;
+ mboxes = (1ULL << ha->mbx_count) - 1;
if (!ha->mcp)
ql_dbg(ql_dbg_async, vha, 0x5001, "MBX pointer ERROR.\n");
else
--
Meelis Roos (mr...@linux.ee)
> Hello again.
And again...
>
> > > On Sep 18, 2017, at 3:49 AM, Meelis Roos wrote:
> > >
> > > Hello, I decided to widen the coverage of my kernel testbed and put some
> > > FC cards into servers. This one is a PCI-X QLA2340 in HP Proliant DL
map+0x99/0x120
? do_init_module+0x1a/0x245
do_init_module+0x83/0x245
load_module+0x2764/0x34a0
? kernel_read_file+0x150/0x320
SyS_finit_module+0x82/0xa0
do_fast_syscall_32+0xba/0x340
Signed-off-by: Meelis Roos
diff --git a/drivers/scsi/aacraid/sa.c b/drivers/scsi/aacraid/sa.c
in
; if (now.tv_nsec > NSEC_PER_SEC / 2)
> ++now.tv_sec;
>
> but I don't see why we add in half a second here. Any ideas?
I did not try to understand the details but I can confirm that this
patch makes the warnings go away.
--
Meelis Roos (mr...@linux.ee)
12.293094] kthread+0x13d/0x1f0
[ 12.293172] ? aac_send_hosttime+0xf0/0xf0 [aacraid]
[ 12.293231] ? __kthread_create_worker+0x110/0x110
[ 12.293289] ret_from_fork+0x19/0x24
[ 12.293345]
========
--
Meelis Roos (mr...@linux.ee)
Hello again.
> > On Sep 18, 2017, at 3:49 AM, Meelis Roos wrote:
> >
> > Hello, I decided to widen the coverage of my kernel testbed and put some
> > FC cards into servers. This one is a PCI-X QLA2340 in HP Proliant DL 380
> > G4 (first 64-bit generation of Pro
> On 08/19/2017 10:41 PM, Meelis Roos wrote:
> > Hello, I just tried Linux with the latest kernel (4.13-rc5+git) on a HP
> > DL360 G6 with HP branded ISP2432 HBA. The driver mentions unsupported
> > model of the card:
> >
> > [3.868589] scsi host1: qla2xxx
devinfo->target_mask = (0x01 << devinfo->target_offset);
+ } else {
+ devinfo->target_mask = 0;
+ }
}
void
--
Meelis Roos (mr...@linux.ee)
SS_STORAGE_SCSI)
@@ -249,10 +251,12 @@ ahc_linux_pci_dev_probe(struct pci_dev *pdev, const
struct pci_device_id *ent)
return (-ENODEV);
}
}
+ ahc_set_unit(ahc, ahc_linux_unit++);
ahc->dev_softc = pci;
error = ahc_pci_config(ahc, entry);
if (error != 0) {
ahc_free(ahc);
+ ahc_linux_unit--;
return (-error);
}
--
Meelis Roos (mr...@linux.ee)
[4.489976] qla2xxx [:06:02.0]-00fc:4: ISP2312: PCI-X (100 MHz) @
:06:02.0 hdma+ host#=4 fw=3.03.28 IPX.
--
Meelis Roos (mr...@linux.ee)
Just went and changed kernel conf to HPSA instead of old CCISS but got a
compilation failure:
drivers/scsi/scsi_transport_sas.o: In function `sas_bsg_initialize':
scsi_transport_sas.c:(.text+0x12fd): undefined reference to `bsg_setup_queue'
scsi_transport_sas.c:(.text+0x13b2): undefined reference
> +Smart Array P400i
> +Smart Array P600
> +Smart Array P700m
> +Smart Array P800
> +.fi
> .SS Configuration details
> To configure HP Smart Array controllers,
> use the HP Array Configuration Utility (either
>
--
Meelis Roos (mr...@ut.ee) http://www.cs.ut.ee/~mroos/
2432
SSVID/SSDID (0x103C,0x7041).
Is there some information I can provide to include this card in fully
supported list?
--
Meelis Roos (mr...@linux.ee)
> You should be able to suppress the "can't get device id' messages with:
Yes, these messages are gone and it still works.
--
Meelis Roos (mr...@linux.ee)
or host 0:C0:T0:L0
Direct-Access MAN3735MC
hpsa0: hpsa_update_device_info: can't get device id for host 0:C0:T-1:L-1
Direct-Access MAP3735NC
report luns requested format 2, got 0
--
Meelis Roos (mr...@linux.ee)
be worth trying hpsa driver instead of cciss,
with a longer term goal to to move users of cciss over to hpsa if
possible. Now that I have tested it, it seems not all older cards are
supported in hpsa - it's more than ID-s and interrupt masks.
--
Meelis Roos (mr...@linux.ee)
nd it but no sda is
detected and no bootup.
What next?
And, for readability, we should use something like "Using unsupported
board ID", not plain "unsupported board ID" - the last one leaves
assumption that it will not work, although it should.
--
Meelis Roos (mr...@linux.ee)
ll no go:
[5.199125] hpsa :00:04.0: unrecognized board ID: 0x40800e11, ignoring.
[5.282517] hpsa :00:04.0: Board ID not found
Added specific PCI ID and subdevice ID quad and I still get the same
messages and the adapter is ignored.
What am I doing wrong?
--
Meelis Roos (mr...@linux.ee)
nd of June, 5.4.0 appeared masked. Unmasking kgcc64
5.4.0 and building it made my pariscs work again, with 4.7-rc6 running
fine on all of them.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to major
0xc900035d2000.
[4.167084] qla2xxx [:07:00.0]-0034:3: MSI-X: Unsupported ISP 2432
SSVID/SSDID (0x103C,0x7041).
Why does the driver need to know subsystem ID-s at all?
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the
0x71/0x150
[4.260755] [] entry_SYSCALL64_slow_path+0x25/0x25
[4.260933]
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of
by this change:
Out of interest, how do other parisc users get the new compiler?
What distro are you using?
My pariscs are gentoo and still only the old versions are available on
hppa.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe l
[ef111e80] [c0069378] worker_thread+0x74/0x704
[2.650764] [ef111ef0] [c00731fc] kthread+0xd8/0x134
[2.657326] [ef111f40] [c0019394] ret_from_kernel_thread+0x5c/0x64
[2.663877]
--
Meelis Roos (mr...@linux
] [] ? rest_init+0x60/0x60
[1.732026]
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majo
;] filemap_fault+0x6f0/0x838
[<404067b8>] ext4_filemap_fault+0x58/0x90
[<403193f8>] __do_fault+0x78/0x180
[<40320504>] handle_mm_fault+0x134c/0x1ec0
CPU: 2 PID: 1 Comm: init Tainted: GW 4.6.0 #85
Backtrace:
[<000040216b58>] sho
+0xb6/0x1d0
[4.900284] [] ? sysenter_past_esp+0x40/0x6a
[4.900284]
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message
lk_mq_register_disk+0xe0/0x1a0
Caller[0063f880]: blk_register_queue+0xa0/0x120
Caller[0064dbfc]: add_disk+0x33c/0x480
Caller[006f3bd0]: loop_add+0x190/0x280
Caller[00a8c5b0]: loop_init+0x160/0x1b0
Caller[00426ea4]: do_one_initcall+0xe4/0x1e0
Caller[00
rred
error trap.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
t on this one, I will send it to you:
Revert change that breaks QLA2XXX on big-endian systems,
__constant_cpu_to_le16() is still needed.
Signed-off-by: Meelis Roos
diff --git a/drivers/scsi/qla2xxx/qla_fw.h b/drivers/scsi/qla2xxx/qla_fw.h
index 42bb357..88d3143 100644
--- a/drivers/scsi/qla2xxx/q
fe03e8a0 l2: 008992d8 l3:
0014
l4: 0122e950 l5: 00a88000 l6: 0002 l7:
017af400
i0: 0dff01af0558 i1: 00a36a90 i2: i3:
00a364d0
i4: 009a5f90 i5: f800fd4d3148 i6: f800fc96b001 i7:
00
h less likely to occur!
> >>
> >> How about this one?
> >
> > It make the machine work.
>
> Thanks for testing!
>
What's the status of this fix? It is still not applied on yesterdays
3.19.0-rc6-00105-gc59c961 git...
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
subsystem
[ 381.051395] EXT4-fs (sda1): mounted filesystem without journal. Opts: (null)
[ 381.894633] systemd-journald[70]: Received request to flush runtime journal
from PID 1
[ 387.067316] eth0: Link is up using internal transceiver at 100Mb/s, Full
Duplex.
--
Meelis Roos (mr...@linux.ee)
--
To unsub
> On 2014-11-14 15:59, Meelis Roos wrote:
> > > > > The second oops is in blk_mq_map_queue() which is a trivial
> > > > > two level cpu lookup. I wonder if there's something odd about
> > > > > cpu numbers on these big old sparc syste
> >> The second oops is in blk_mq_map_queue() which is a trivial
> >> two level cpu lookup. I wonder if there's something odd about
> >> cpu numbers on these big old sparc systems?
> >
> > CPU numbers are sparse - they are determined by hardware slot number and
> > some models only fill every ot
_realfault_common+0x10/0x20
[ 160.141167] [0046db08] kthread_data+0x8/0x20
[ 160.201591] [00791914] switch_to_pc+0x198/0x3a4
[ 160.265150] [00454bd4] do_exit+0x5f4/0x9e0
[ 160.323485] [00428744] die_if_kernel+0x184/0x300
[ 160.388088] [007901d4] unhandled_fault+0x84/
> On Mon, Nov 03, 2014 at 11:32:14PM +0200, Meelis Roos wrote:
> > Yes. I took the same 3.18.0-rc1-00422-g2cc9188-dirty kernel that had
> > just this patch reverted, it started the controller fine, detected disk,
> > mounted root, started multiple tasks and then some time af
gt; > 08:00:20:b6:ee:e2
> > > [ 255.235445] sd 1:0:4:0: [sdb] 35843670 512-byte logical blocks: (18.3
> > > GB/17.0 GiB)
> > > [ 255.237573] eth1: Quattro HME slot 0 (SBUS) 10/100baseT Ethernet
> > > 08:00:20:93:79:94
> > > [ 255.239977] eth2:
Adding 787176k swap on /dev/sda4. Priority:-1 extents:1
across:787176k
[ 267.775195] EXT4-fs (sda2): re-mounted. Opts: (null)
[ 268.309702] EXT4-fs (sda2): re-mounted. Opts: errors=remount-ro
[ 272.239364] EXT4-fs (sda1): mounting ext3 file system using the ext4
subsystem
[ 272.321644] EX
c9188-dirty kernel that had
just this patch reverted, it started the controller fine, detected disk,
mounted root, started multiple tasks and then some time after startin
exim it just hangs. This is consisten with what I saw during bisection.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe fr
.
Works fine on both DL380G3 and the other server with MPT and IDE CD.
--
Meelis Roos
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
This may not be the only problem - when bisecting, I also came to commits
that got past this step but hang after about 165 seconds of uptime while
running userspace startup scripts. But let that be another issue at the
moment.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list
perations(struct Scsi_Host
> *shost)
> if (scsi_device_online(sdev) && sdev->locked)
> scsi_eh_lock_door(sdev);
> }
> +#endif
>
> /*
>* next free up anything directly waiting upon the host. this
>
--
Meelis Roos (m
.17 release.
>
> So just to be fully clear, you never enabled scsi-mq on 3.17? To do
> that, you would have had to add a scsi_mod.use_blk_mq=1 boot parameter.
> The scsi-mq kconfig option did not show up until after 3.17 release.
Re-tested DL380G3 with 3.17 and manual scsi_mod.use_blk_mq=
.17 release.
>
> So just to be fully clear, you never enabled scsi-mq on 3.17? To do
> that, you would have had to add a scsi_mod.use_blk_mq=1 boot parameter.
> The scsi-mq kconfig option did not show up until after 3.17 release.
Yes, I never enabled it via command line, only noticed i
red in Kconfig as a new option. But I am not sure whan the
option appeared, before or after 3.17 release.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
> On 2014-10-29 05:46, Meelis Roos wrote:
> > > I tried 3.18-rc2 with blk-mq default on on HP ProLiant DL380 G3 (with HP
> > > CCISS RAID controller). It fails late in the bootup with "task
> > > scsi_eh_1:720 blocked for more than 120 seconds.&
test server with MPT SCSI RAID has similar problem,
scsi_mode.use_blk_mq=0 cures it but I can not get good trace (no serail
console). 3.18.0-rc2-00043-gf7e87a4 was tested there.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi"
do_last.isra.50+0x2f1/0xca0
[ 740.710601] [] ? link_path_walk+0x1e5/0x7b0
[ 740.772056] [] ? kmem_cache_alloc+0x91/0xa0
[ 740.833513] [] ? get_empty_filp+0xa7/0x170
[ 740.893928] [] path_openat+0xa1/0x580
[ 740.949146] [] do_filp_open+0x2c/0x80
[ 741.004361] [] ? __alloc_fd+0x69/0x100
[ 741.0
though.
> >
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
> > the body of a message to majord...@vger.kernel.org
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
> ---end quoted text---
>
--
; > scsi: convert device_busy to atomic_t
>
> That's fixed upstream:
>
> commit 480cadc2b7e0fa2bbab20141efb547dfe0c3707c
Yes, works for both sparc64 and parisc.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" i
> On Tue, 2014-08-19 at 14:25 +0300, Meelis Roos wrote:
> > 3.16 scsi worked fine, 3.17-rc1 misbehaves on 3 of my sparc64 test
> > machines. E220R and E420R are with onboard 5c3875, V210 is with onboarc
> > 53c1010 and all behave the same. Any ideas whre to dig deeper? bi
8298e4] kernel_init+0x4/0x100
> >
> > [ 389.657868] [004060c4] ret_from_fork+0x1c/0x2c
> >
> > [ 389.720324] [] (null)
> >
> > [ 389.7755
] ret_from_fork+0x1c/0x2c
[ 389.720324] [] (null)
[ 389.775518] no locks held by swapper/0/1.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe
sg) and it is a workaround for me so far.
It seems we can not afford to default to so much debug info for this
nvram bit - at least IO completions should be filtered out.
Is there a way to change the nvram from the driver?
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send t
[006a7164] do_scan_async+0x4/0x20
[004817b8] async_run_entry_fn+0x58/0x120
---[ end trace 9a1420108ebfd590 ]---
Signed-off-by: Meelis Roos
diff --git a/drivers/scsi/qla1280.c b/drivers/scsi/qla1280.c
index 5a522c5..97dabd3 100644
--- a/drivers/scsi/qla1280.c
+++ b/drivers/scsi
onsecutive boots, thank you!
Tested-by: Meelis Roos
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
;non_tagged_cmd= (null),
ent=f800fb193bc0
[ 216.230837] kernel BUG at drivers/scsi/esp_scsi.c:620!
(the line number is of course wrong, it's the second BUG inside
esp_free_lun_tag)
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsub
] Kernel panic - not syncing: Aiee, killing interrupt handler!
[ 355.430295] Press Stop-A (L1-A) to return to the boot prom
.] Waiting for /dev to be fully populated...
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body
succeeds, no BUG. Scsi communication
works.
--
Meelis Roos (mr...@linux.ee)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
00 00 65 a1 14 00 00 00 89 45 f0 31 c0
8b 43 54 89 55 8c 8b 13 89 4d 88 8b 40 68 85 c0 74 08 8b 80 4c 01 00 00 <8b> 38
8d 45 98 8b 32 89 45 98 8b 4d 8c 8d 55 a0 89 45 9c 8b 45
[42051.799957] EIP: [] scsi_send_eh_cmnd+0x33/0x320 SS:ESP
0068:f5f8be2c
[42051.800011] CR2:
[420
2...
Nope, this is a differnet computer - SBus-only Ultra 1 with sparc64
architecture. The test I meant to do but could't was to load the libsas
module.
--
Meelis Roos ([EMAIL PROTECTED])
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to
> Eliminate unnecessary PCI dependencies in libsas. It should use generic DMA
> and struct device like other subsystems.
Compiles fine, unfortunately I can not test kernels on this machine
since I have yet to dig out the reason my kernels do not boot.
--
Meelis Roos ([EMAIL PROTECTED]
mp;iscsi_transport_class);
--
Meelis Roos ([EMAIL PROTECTED])
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at http://vger.kernel.org/majordomo-info.html
# CONFIG_SCSI_QLOGICPTI is not set
# CONFIG_SCSI_DEBUG is not set
CONFIG_SCSI_SUNESP=y
--
Meelis Roos ([EMAIL PROTECTED])
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at http://vger.kernel.org/majordomo-info.html
79 matches
Mail list logo