https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=285993

--- Comment #8 from Jim Long <freebsd-bugzi...@umpquanet.com> ---
Thank you for taking a look.  Yes, bus_child_present is returning 0.  The
device is a sled-mounted SSD, as mentioned here:
https://www.dell.com/support/kbdoc/en-us/000185631/how-to-switch-nvme-for-precision-5820-and-7820-tower
  It doesn't seem like a loose connection, since it's consistently working
through n270866 and broken since n270867, which I sussed out using 'git bisect'
on a kernel build.  I have one theory to try, to see if I just get lucky.

Meanwhile ....

Adding a printf to stable/14-n271055-f9afcbff02a2-dirty:

13:02:18 /usr/src# git diff
diff --git a/sys/dev/nvme/nvme_ctrlr.c b/sys/dev/nvme/nvme_ctrlr.c
index 6f5d6ae74add..1850a5f0a389 100644
--- a/sys/dev/nvme/nvme_ctrlr.c
+++ b/sys/dev/nvme/nvme_ctrlr.c
@@ -1532,6 +1532,7 @@ nvme_ctrlr_destruct(struct nvme_controller *ctrlr,
device_t dev)
         * do hotplug), so failsafe with detecting all ff's (impossible with
         * this hardware) as the device being gone.
         */
+nvme_printf(ctrlr, "bus_child_present(dev) == %d\n", bus_child_present(dev));
        gone = bus_child_present(dev) == 0 ||
            (nvme_mmio_read_4(ctrlr, csts) == NVME_GONE);
        if (gone)


dmesg then shows:

# egrep -1 'nvme|pcib?10' /var/run/dmesg.boot 
pci9: <PCI bus> numa-domain 0 on pcib9
pcib10: <PCI-PCI bridge> irq 0 at device 2.0 numa-domain 0 on pci9
pcib10: failed to allocate initial I/O port window: 0-0xafff
pcib11: <PCI-PCI bridge> irq 0 at device 3.0 numa-domain 0 on pci9
--
cpufreq19: <CPU frequency control> numa-domain 0 on cpu19
pcib10: Power Fault Detected
pci10: <PCI bus> numa-domain 0 on pcib10
nvme0: <Generic NVMe Device> irq 0 at device 0.0 numa-domain 0 on pci10
pcib10: Power Fault Detected
hdacc0: <Realtek ALC255 HDA CODEC> at cad 0 on hdac0
--
pcm5: <NVIDIA (0x0080) (HDMI/DP 8ch)> at nid 7 on hdaa1
pcib10: Power Fault Detected
Timecounter "TSC-low" frequency 1655996617 Hz quality 1000
--
uhub0: <Intel XHCI root HUB, class 9/0, rev 3.00/1.00, addr 1> on usbus0
nvme0: bus_child_present(dev) == 0
nvme0: failing outstanding i/o
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:14 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:14 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:13 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:13 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:12 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:12 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:11 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:11 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:10 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:10 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:9 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:9 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:8 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:8 cdw0:0
nvme0: ASYNC_EVENT_REQUEST (0c) sqid:0 cid:7 nsid:0 cdw10:00000000
cdw11:00000000
nvme0: ABORTED - BY REQUEST (00/07) crd:0 m:0 dnr:1 p:0 sqid:0 cid:7 cdw0:0
ada0 at ahcich0 bus 0 scbus0 target 0 lun 0
--
Trying to mount root from zfs:thumb/ROOT/default []...
nvme0: detached
pci10: detached
uhub0: 26 ports with 26 removable, self powered

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to