[Bug 2085103] Re: pci hot plug get "retraining non-functional downstream link at 2.5GT/s]"

2025-02-23 Thread Zhanglei Mao
One of our partner verified that this issue have been fixed on 6.8.0-53 generic kernel -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2085103 Title: pci hot plug get "retraining non-functional downst

[Bug 2085103] Re: pci hot plug get "retraining non-functional downstream link at 2.5GT/s]"

2024-11-29 Thread Zhanglei Mao
Here is the kernel fix for OCP hot plug issue. [RFC] PCI: Fix the issue of link speed downgrade after link retraining - Patchwork https://patchwork.ozlabs.org/project/linux-pci/patch/20241107143758.12643-1-guojinhui.l...@bytedance.com/ -- You received this bug notification because you are a mem

[Bug 2088252] [NEW] intel_uncore ioremap error

2024-11-14 Thread Zhanglei Mao
Public bug reported: Our partner report this intel_uncore ioremap error from syslog: Oct 20 00:51:15 192-168-116-32 kernel: [ 45.925331] Trying to unpack rootfs image as initramfs... Oct 20 00:51:15 192-168-116-32 kernel: [ 45.926196] PCI-DMA: Using software bounce buffering for IO (SWIOTLB

[Bug 2088252] Re: intel_uncore ioremap error

2024-11-14 Thread Zhanglei Mao
It was on 4 nodes total 320 cores process and the kernel version is 5.19.0-32-generic. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2088252 Title: intel_uncore ioremap error To manage notification

[Bug 2085103] Re: pci hot plug get "retraining non-functional downstream link at 2.5GT/s]"

2024-11-12 Thread Zhanglei Mao
To reset(reboot) system, it can be correctly in Gen5. ** Also affects: linux (Ubuntu) Importance: Undecided Status: New ** Package changed: linux (Ubuntu) => linux-hwe-6.5 (Ubuntu) ** Also affects: linux-hwe-6.8 (Ubuntu) Importance: Undecided Status: New -- You received thi

[Bug 2085103] Re: pci hot plug get "retraining non-functional downstream link at 2.5GT/s]"

2024-10-21 Thread Zhanglei Mao
It seems a similar issue bug at https://bugzilla.kernel.org/show_bug.cgi?format=multiple&id=218765 And mentions there is a related patch since v6.5 https://www.spinics.net/lists/linux-pci/msg152506.html On Tue, 23 Apr 2024, Bjorn Helgaas wrote: > FYI. The retraining was added by a89c82249c3

[Bug 2085103] [NEW] pci hot plug get "retraining non-functional downstream link at 2.5GT/s]"

2024-10-21 Thread Zhanglei Mao
Public bug reported: Symptom : OCP NIC hot insert to system and can't archive to Gen5 speed Hardware configuration : CPU : SRF-SP Q4WS *2 OCP NIC MCX753436MC-HEAB (FW : 28.39.2048) Test scenario as below : 1. Boot into OS 2. Check system status is fine and clear all dmesg and SEL 3. Unload mlx

[Bug 2085103] Re: pci hot plug get "retraining non-functional downstream link at 2.5GT/s]"

2024-10-21 Thread Zhanglei Mao
From dmesg below, it seems this "broken device, retraining non-functional downstream link at 2.5GT/s" is related due to the message of " pci :4c:00.0: 32.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x16 link at :4b:02.0 (capable of 504.112 Gb/s with 32.0 GT/s PCIe x16 link

[Bug 2080065] [NEW] package ukui-wallpapers (not installed) failed to install/upgrade: trying to overwrite '/usr/share/backgrounds/calla.png', which is also in package ubuntukylin-wallpapers-focal 24.

2024-09-09 Thread mao
Public bug reported: ran apt install ukui-wallpapers ProblemType: Package DistroRelease: Ubuntu 24.04 Package: ukui-wallpapers (not installed) ProcVersionSignature: Ubuntu 6.8.0-41.41-generic 6.8.12 Uname: Linux 6.8.0-41-generic x86_64 NonfreeKernelModules: nvidia_modeset nvidia ApportVersion: 2.

[Bug 2018539] Re: File selector extremely slow

2024-07-31 Thread Lei Mao
Any follow-up? The new UI is terribly slow. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2018539 Title: File selector extremely slow To manage notifications about this bug go to: https://bugs.laun

[Bug 2071442] Re: platform coretemp.0: Adding Core 128 failed

2024-06-28 Thread Zhanglei Mao
Apr 30 01:44:54 192-168-119-54 kernel: [ 31.650978] Apr 30 01:44:54 192-168-119-54 kernel: [ 31.652249] UBSAN: array-index-out-of-bounds in /build/linux-yUfQuC/linux-5.15.0/drivers/hwmon/coretemp.c:473:17 Apr 30

[Bug 2071442] [NEW] platform coretemp.0: Adding Core 128 failed

2024-06-28 Thread Zhanglei Mao
Public bug reported: The following error appears in the system log when the number of CPU cores exceeds 128 ** Affects: linux-hwe-5.15 (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to U

[Bug 2055238] Re: FC680i(qla2xxx) get dma mapping resulted in different sg counts error on 22.04 but not 20.04.5

2024-02-28 Thread Zhanglei Mao
It seems specific to 5.15.0-25 version and have been fixed since 5.15.0-46-generic from 20.04.5 hwe-kernel and 5.15.0-97-generic (both are tested and verified -- worked well) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs

[Bug 2055238] Re: FC680i(qla2xxx) get dma mapping resulted in different sg counts error on 22.04 but not 20.04.5

2024-02-27 Thread Zhanglei Mao
From lspci 63:00.0 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb Fibre Channel to PCIe Adapter [1077:2261] (rev 01) Subsystem: Hangzhou H3C Technologies Co., Ltd. NIC-FC680i-Mb-2x16G [193d:100d] Physical Slot: 9 63:00.1 Fibre Channel [0c04]: QLogic Corp. ISP2722-based

[Bug 2055238] Re: FC680i(qla2xxx) get dma mapping resulted in different sg counts error on 22.04 but not 20.04.5

2024-02-27 Thread Zhanglei Mao
screen pictures for test commands and error output ** Attachment added: "image003.png" https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.15/+bug/2055238/+attachment/5749969/+files/image003.png -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscri

[Bug 2055238] Re: FC680i(qla2xxx) get dma mapping resulted in different sg counts error on 22.04 but not 20.04.5

2024-02-27 Thread Zhanglei Mao
screen pictures for test commands and error output ** Attachment added: "image001.png" https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.15/+bug/2055238/+attachment/5749968/+files/image001.png -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscri

[Bug 2055238] [NEW] FC680i(qla2xxx) get dma mapping resulted in different sg counts error on 22.04 but not 20.04.5

2024-02-27 Thread Zhanglei Mao
Public bug reported: On of our partner reported that FC680i internalLoopBack test would fail on 22.04 but worked fine on 20.04.5. The dmesg error log is [ 941.140500] qla2xxx [:63:00.1]-701c:7: dma mapping resulted in different sg counts, request_sg_cnt: 2 dma_request_sg_cnt: 1 reply_sg_cn

[Bug 2054863] Re: AMD cpu (/proc/iomme 0-0xf2ffffff) is incorrectly reserved

2024-02-26 Thread Zhanglei Mao
** Summary changed: - AMD /proc/iomme 0-0xf2ff is incorrectly reserved + AMD cpu (/proc/iomme 0-0xf2ff) is incorrectly reserved ** Summary changed: - AMD cpu (/proc/iomme 0-0xf2ff) is incorrectly reserved + AMD cpu (/proc/iomme 0-0xf2ff) is incorrectly reserved on 22.04 ga-kerne

[Bug 2054860] [NEW] AMD-Vi Failed to allocate IRTE

2024-02-25 Thread Zhanglei Mao
Public bug reported: A partner report below error which seems a fixed issue from upstream and it affect NVME performance, hope to backport it. Oct 27 06:05:17 quanta kernel: [ 1238.707245] AMD-Vi: Failed to allocate IRTE The related patch and discussion is below: https://lkml.org/lkml/2020/10/1

[Bug 2054863] Re: AMD /proc/iomme 0-0xf2ffffff is incorrectly reserved

2024-02-25 Thread Zhanglei Mao
Broken one: cat proc/iomem -f2ff : Reserved -e1ff : Reserved -d6ff : Reserved -cbff : Reserved Good one: -0fff : Reserved 1000-0009 : System RAM 000a-000b : PCI Bus :60 000a-000b : Reserved 00

[Bug 2054863] Re: AMD /proc/iomme 0-0xf2ffffff is incorrectly reserved

2024-02-25 Thread Zhanglei Mao
The CPU is below: processor : 0 vendor_id : AuthenticAMD cpu family : 25 model : 17 model name : AMD Eng Sample: 100-00894-04 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.n

[Bug 2054863] [NEW] AMD /proc/iomme 0-0xf2ffffff is incorrectly reserved

2024-02-25 Thread Zhanglei Mao
Public bug reported: During kdump verify on Ubuntu 22.04 ga-kernel 5.15.0, kdump would fail beause 0-0xf2ff(0-3.79G) have been reserved. It seems have been fixed on Ubuntu 22.04.2 HWE-kernel of 5.19.0-32-generic. ** Affects: ubuntu Importance: Undecided Status: New -- You rec

[Bug 1976511] Re: kernel taint ( warning ) caused by smpboot.c: on 5.4.0

2022-06-01 Thread Zhanglei Mao
It seems very similar to https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1882478/ which have been fixed on Bionic. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1976511 Title: kernel taint ( wa

[Bug 1976511] Re: kernel taint ( warning ) caused by smpboot.c: on 5.4.0

2022-06-01 Thread Zhanglei Mao
** Changed in: linux (Ubuntu) Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1976511 Title: kernel taint ( warning ) caused by smpboot.c: on 5.4.0 To manage no

[Bug 1976511] Re: kernel taint ( warning ) caused by smpboot.c: on 5.4.0

2022-06-01 Thread Zhanglei Mao
You can find above message in kernel.log file. ** Attachment added: "sosreport" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1976511/+attachment/5594318/+files/sosreport-G292-280-18.04HWE5.4.0-110-generic-2022-05-25-cvumffx.tar.xz -- You received this bug notification because you are

[Bug 1976511] Re: kernel taint ( warning ) caused by smpboot.c: on 5.4.0

2022-06-01 Thread Zhanglei Mao
We didn't find this on Ubuntu 20.04.4 LTS hwe-kernel ( 5.13.0-41-generic). Both Ubuntu 20.04.4 ga-kernel and 18.04.6 hwe-kernel (5.4.0-107-generic) are found. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net

[Bug 1976511] [NEW] kernel taint ( warning ) caused by smpboot.c: on 5.4.0

2022-06-01 Thread Zhanglei Mao
Public bug reported: Apr 6 02:33:31 G292-280 kernel: [0.007531] [ cut here ] Apr 6 02:33:31 G292-280 kernel: [0.007531] sched: CPU #20's llc-sibling CPU #0 is not on the same node! [node: 1 != 0]. Ignoring dependency. Apr 6 02:33:31 G292-280 kernel: [0.0075

[Bug 1975922] Re: custom image deploy get failed in cmd-curthooks of DISTROS[DISTROS.index(distname)]

2022-05-30 Thread Zhanglei Mao
@Alberto Thanks for help. Below is log file. ** Attachment added: "mass deploy log file" https://bugs.launchpad.net/maas/+bug/1975922/+attachment/5593944/+files/messages-0531.txt -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. htt

[Bug 1975922] Re: custom image deploy get failed in cmd-curthooks of DISTROS[DISTROS.index(distname)]

2022-05-30 Thread Zhanglei Mao
The MAAS version is 3.1.0 (installed by sanp) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1975922 Title: custom image deploy get failed in cmd-curthooks of DISTROS[DISTROS.index(distname)] To m

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-26 Thread Zhanglei Mao
To summary #1 Symptom: On AMD EPYC, ROME server platform, SATA hot plug not working on Ubuntu 22.04 LTS. #2 Root cause: Ubuntu kernel compile with configure CONFIG_SATA_MOBILE_LPM_POLICY=3. During devices scan ( boot, pci scan, ahci driver load), if didn't detected any valid sata

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-24 Thread Zhanglei Mao
this difference? thanks, Mao -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notifications about this bug go to:

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-24 Thread Zhanglei Mao
@Mario, Thanks for those deep and detail analyse for the root cause. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-23 Thread Zhanglei Mao
difference of 0,1,2 for lpm policy. thanks, Mao -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notifications about this

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-20 Thread Zhanglei Mao
For #40, trace data for hotplug not working ** Attachment added: "trace-no-hotplug.zip" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1971576/+attachment/5591529/+files/trace-no-hotplug.zip -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscri

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-20 Thread Zhanglei Mao
For #39, Marios's,partner said all fails ( hotplug was not working). I guess we were expected it can work. So I will check him if he set kernel parameter correctly. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpa

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-20 Thread Zhanglei Mao
For #40, I asked them to collect trace data on both hotplug working and not. ** Attachment added: "trace-hotplug.zip" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1971576/+attachment/5591527/+files/trace-hotplug.zip -- You received this bug notification because you are a member of Ub

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-18 Thread Zhanglei Mao
Hey Haled, Mario, Both 5.15 and 5.18 are failed, kernel were from (Khaled's) #37 building. "uname -r "output from screenshots are: 5.15.0-32-generic 5.18.0-4-generic thanks! -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. htt

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-17 Thread Zhanglei Mao
Hello Kahled, For Mario's comments#33, will you or can you build a main kernel (v5.16) for testing? //thanks. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regres

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-17 Thread Zhanglei Mao
Hello Kahled, First, thanks for your effort to find this root cause in a short time. For your comment #31, parnter has confirmed it is same for other type of disk. I also asked them to raise this issue to AMD and AMD technical guy reply below. "Yes, 1022:7901h is AMD SATA AHCI controlle

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-17 Thread Zhanglei Mao
Hello Khaled, the Version 3007 failed. thanks! //Mao -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notifications

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-16 Thread Zhanglei Mao
Hello Khaled, the 3006 kernel can pass. thanks //Mao -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notifications

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-13 Thread Zhanglei Mao
Hello Khaled, Version 3005 hot plug fail. Thanks. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notifications abo

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-12 Thread Zhanglei Mao
Hello Kahled, Version 3004 hot plug can pass. thanks //Mao -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-11 Thread Zhanglei Mao
Hello Kahled, Version 3003 hot plug passed. //thanks -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notifications a

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-11 Thread Zhanglei Mao
Hello Kahled, thanks for quick response. Your understanding for 3000 bug and 3002 no bug is correct. I have asked them to verify 3003 now. //thanks -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-11 Thread Zhanglei Mao
Hello Khaled, Version 3002 hot plug can pass. //thanks. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server To manage notification

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-10 Thread Zhanglei Mao
Hello Khaled, thanks for share detail information for your patching and explains of unable building mulit-kernel. The partner engineer did use 3000 kernel for sata hotplug test as they sent me a screenshot of "uname -r" output. I have asked them to test new 3002 kernel now. -- You received this

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-10 Thread Zhanglei Mao
Hello Khale, hot plug test is fail for this version kernel. Please build next kernel. By the way, if it possible to build multi kernel, so that they can test them all in one shot. You know, the partner engineer is working on home this week and he have to look for someone else on office each time.

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-10 Thread Zhanglei Mao
Hello Khaled, I have asked partner engineer to test your build kernel. thanks. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EPYC (Asus) server

[Bug 1971576] Re: SATA device hot plug regression on AMD EPYC (Asus) server

2022-05-08 Thread Zhanglei Mao
** Summary changed: - SATA device hot plug regression on AMD EYPC (Asus) server + SATA device hot plug regression on AMD EPYC (Asus) server -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Titl

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-06 Thread Zhanglei Mao
Test it as boot int OS with selected 99 kernel, insert SATA and remove SATA, run sosreport. ** Attachment added: "sosrport for 5.4.0-99 kernel which hotplug does work" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1971576/+attachment/5587308/+files/sosreport-rd1-5.4.0-99-2022-05-06-sna

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-06 Thread Zhanglei Mao
Test it as boot int OS with selected 100 kernle, insert SATA and remove SATA, run sosreport. ** Attachment added: "sosrport for 5.4.0-100 kernel which hotplug doesn't work" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1971576/+attachment/5587307/+files/sosreport-rd1-5.4.0-100-2022-05-0

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-06 Thread Zhanglei Mao
I was told the test are boot into OS with select kernel, insert sata, remove, and run sosreport. Below is syslog info related to boot and sdb or sdc. --sata hotplug didn't work on 100 kernel -- grep -e 'Linux version' -e 'sdb' -e 'sdc' -e 'SATA link' syslog | tail -n25 May 6 07:24:42 rd1 kernel

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-06 Thread Zhanglei Mao
Partner have verified that 5.4.0-99 hotplug works, while 5.4.0-100 doesn't -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EYPC (Asus) server To

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-05 Thread Zhanglei Mao
For Rome, it is the same as Melian which the 20.04.4 kernel 5.4.0-42-generic can work with hotplug, while kernel 5.4.0-109-generic does't. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title:

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-05 Thread Zhanglei Mao
more test as below: 5.11.0-27 hotplug works 5.13.0-40 hotplug doesn't work -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EYPC (Asus) server To

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-04 Thread Zhanglei Mao
Below is full sosreports of our test. Test and log please refer to last 2 boot (42 and 109 kernel) May 4 05:49:11 rd1 kernel: [0.00] Linux version 5.4.0-42-generic (buildd@lgw01-amd64-038) (gcc version 9.3.0 (Ubuntu 9.3.0-10ubuntu2)) #46-Ubuntu SMP Fri Jul 10 00:24:02 UTC 2020 (Ubuntu 5.4

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-04 Thread Zhanglei Mao
For 5.4.0-109 kernel, if "rescan-scsi-bus" after plug in, it can find disk. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1971576 Title: SATA device hot plug regression on AMD EYPC (Asus) server To

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-04 Thread Zhanglei Mao
5. un-plugin sata disk; restart system 6. restart and boot into 5.4.0-109 kernel 7. login and monitor kernel log like sudo tail -f /var/log/syslog 8. plug in sata disk, there no similar log and no disk was found. -- You received this bug notification because you are a member of Ubuntu Bugs, whic

[Bug 1971576] Re: SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-04 Thread Zhanglei Mao
reproduce method 1. On Ubuntu 20.04, install both 5.4.0-42 and 5.4.0-109 kernel 2. restart and boot into 5.4.0-42 kernel via grub menu 3. login and monitor kernel log like sudo tail -f /var/log/syslog 4. plug in sata disk, the log showed disk was dected like below: --- May 4 05:5

[Bug 1971576] [NEW] SATA device hot plug regression on AMD EYPC (Asus) server

2022-05-04 Thread Zhanglei Mao
Public bug reported: SATA disk hot plug can't work on Ubuntu 20.04 ga-kernel of "Linux version 5.4.0-109-generic", but it works on earlier version of " Linux version 5.4.0-42-generic" ** Affects: linux (Ubuntu) Importance: Undecided Status: Incomplete -- You received this bug noti

[Bug 1934112] Re: Intel 100G nic get call trace for sosreport on focal ga-kenel

2021-06-30 Thread Zhanglei Mao
The call trace seems correlating to ethtools like below: Jun 18 01:19:08 frank-viper kernel: [69418.453457] BUG: unable to handle page fault for address: b2660b04 Jun 18 01:19:08 frank-viper kernel: [69418.465204] #PF: supervisor read access in kernel mode Jun 18 01:19:08 frank-viper ke

[Bug 1934112] [NEW] Intel 100G nic get call trace for sosreport on focal ga-kenel

2021-06-30 Thread Zhanglei Mao
Public bug reported: On Asus Server, when it run "sosreport -a" to the network step, the console would print below call trace and the 100G Nic port became unreachable. call trace on ga-kernel ( v5.4): https://paste.ubuntu.com/p/TC5Jn6hYGF/ ** Affects: ubuntu Importance: Undecided S

[Bug 1911828] [NEW] Ubuntu 18.04.5/20.04 kernel doesn't have latest i40e driver for X710-T2L module

2021-01-14 Thread Zhanglei Mao
Public bug reported: Ubuntu 18.04.5/20.04 kernel doesn't have latest i40e driver for X710-T2L module,Please add the driver on next kernel version release. More details please refer to https://bugs.launchpad.net/quantatw/+bug/1905673 ** Affects: linux (Ubuntu) Importance: Undecided

[Bug 1909518] Re: bcache register_bdev() error cannot allocate memory

2020-12-29 Thread Zhanglei Mao
For the 4K sectors, the max limits should be 8T then. (1024G*4096/512) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1909518 Title: bcache register_bdev() error cannot allocate memory To manage no

[Bug 1909518] Re: bcache register_bdev() error cannot allocate memory

2020-12-29 Thread Zhanglei Mao
The command example to change 512 sector to 4K for “Intel P4510 4.0TB U.2 NVMe SSD” ubuntu@prdhci01a:~$ sudo nvme id-ns /dev/nvme1 -n 1 -H |grep "LBA Format" [3:0] : 0 Current LBA Format Selected LBA Format 0 : Metadata Size: 0 bytes - Data Size: 512 bytes - Relative Performance: 0x2 Goo

[Bug 1909518] Re: bcache register_bdev() error cannot allocate memory

2020-12-29 Thread Zhanglei Mao
This max 1024G size applies to the 512byte sector, most NVME ssd can change ( LBA format ) to 4K and the max limits will be 16T than. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1909518 Title: bc

[Bug 1909518] Re: bcache register_bdev() error cannot allocate memory

2020-12-28 Thread Zhanglei Mao
Also, 18.04 ga-kernel worked fine too. ** Description changed: On 20.04(focal) ga-kernel and 18.04 (hwe-kernel). To make.bcache would fail as below if the back devices are large than 1024G ( 1025 would fail, but not 1024 ). [ 727.756453] bcache: register_bdev() error nvme12n1: cannot

[Bug 1909518] [NEW] bcache register_bdev() error cannot allocate memory

2020-12-28 Thread Zhanglei Mao
Public bug reported: On 20.04(focal) ga-kernel and 18.04 (hwe-kernel). To make.bcache would fail as below if the back devices are large than 1024G ( 1025 would fail, but not 1024 ). [ 727.756453] bcache: register_bdev() error nvme12n1: cannot allocate memory [ 727.756456] bcache: register_bcach

[Bug 1900768] Re: Upgrade from 18.04 failed

2020-10-21 Thread Peter Mao
Problem appears when the Xfce Power Manager locks the session (under the security tab). When the screensaver locks the screen, I am still able to log back in. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/

[Bug 1900776] [NEW] can't log back in after sleep/lock

2020-10-20 Thread Peter Mao
Public bug reported: After upgrading to 20.04.1 LTS, when the display goes to sleep and the session locks, I can't log back in. An incorrect password is notified as such, but the correct password blanks the screen. When I move the mouse, the login prompt re-appears. If I lock the session manual

[Bug 1877083] [NEW] qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
Public bug reported: Qemu vnc console can only show "no suitable video mode" and nothing else both on opensack dashboard (vnc) or virt-manager. This was tested on Taishan2280 server (D05). Please see attached screen shoot. ** Affects: ubuntu Importance: Undecided Status: New -- Y

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
The commend link for qemu which created by nova compute, the vcn parameter are: -vnc 0.0.0.0:0 -k en-us -device virtio-gpu-pci,id=video0,max_outputs=1,bus=pci.5,addr=0x0 ubuntu@hc3:~$ ps auxww |grep -i qemu libvirt+ 1157628 3.6 0.1 5691124 737996 ? Sl 04:34 1:21 qemu-system-aarch64 -enable-kvm -

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
For 18.04.2 ga-kernel of Linux hc3 4.15.0-47-generic, the CONFIG_DRM=y and CONFIG_DRM_VIRTIO_GPU=y seems already set. Test with drm virtio-gpu modules load, it seems same. I tested by delete instance via dashboard and create it again on the same hc3.maas host. For the virt-manager still display fo

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
error screen picture ** Attachment added: "error screen picuture" https://bugs.launchpad.net/ubuntu/+bug/1877083/+attachment/5367295/+files/qemu-vnc-error-in-dashboard.png -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bu

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
qemu error sceen shot via virt-manager ** Attachment added: "qemu error sceen shot via virt-manager" https://bugs.launchpad.net/ubuntu/+bug/1877083/+attachment/5367297/+files/qemu-vn-error-via-virt-manager.png -- You received this bug notification because you are a member of Ubuntu Bugs, whi

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
>From the arm qemu wiki: https://wiki.qemu.org/Documentation/Platforms/ARM vns seems works and require as: virt machine graphics Graphics is also available, but unlike x86 there is no default display device enabled: you should select on from the Display devices section of "-device ?". One good op

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
the out put of "virsh dumpxml" ... ... This should be OpenStack configured for VNC and qemu devices. If I manual remove them, and the virt-manager can become normally which show grub menu, boot message and login etc. -- You received this bug notifi

[Bug 1877083] Re: qemu vnc console can only show "no suitable video mode" on taishan2280 server

2020-05-06 Thread Zhanglei Mao
After add this to /etc/default/grub, ubuntu@test-2:~$ cat /proc/cmdline BOOT_IMAGE=/boot/vmlinuz-4.15.0-47-generic root=UUID=0ba22f57-08ab-4b84-9592-936a48bd5692 ro console=tty0 quiet splash vt.handoff=1 ubuntu@test-2:~$ The issues is same which still show as "error: no suitalbe video mode found

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
The kernel of Linux version 3.10.0-862.el7.x86_64 which didn't find issues. From the below log of "usb 1-1: Product: Keyboard Hub", it seems ubuntu new hwe kernel can't detected and support this usb hub of the keybaord. The full log about usb 1-1 are below: usb 1-1: new high-speed USB device numbe

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
Compare the kernel related usb logs between keyboard detected and not, all previous logs are no difference, it began difference only from. I personally thought it might not issues from kernel as it likes external device (keyboad) didn't report devices information back and from testing log it sees j

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
The boot which failed to detected keyboard are below: Feb 25 20:56:06 user-QuantaGrid-D52PL-4U kernel: [ 12.688666] xhci_hcd :00:14.0: xHCI Host Controller Feb 25 20:56:06 user-QuantaGrid-D52PL-4U kernel: [ 12.739684] xhci_hcd :00:14.0: new USB bus registered, assigned bus number 1 F

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
The Quanta give me hits about error and it seems that in 68 reboot cycles there are 11 times which can't detected out the usb keyboard. $ grep 'Command line' kern.log |wc 68 952 12988 $ grep "usb1-port2: couldn't allocate usb_device" kern.log |wc 11 1321199 -- You rece

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
The boot with keyboard detected logs as below: Feb 25 20:30:37 user-QuantaGrid-D52PL-4U kernel: [ 13.242592] hub 1-0:1.0: 16 ports detected Feb 25 20:30:37 user-QuantaGrid-D52PL-4U kernel: [ 13.308359] probe of 1-0:1.0 returned 1 after 129930 usecs Feb 25 20:30:37 user-QuantaGrid-D52PL-4U ke

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
>From the kernel log, beside below errors, there are no errors for usb or keryboard, it also show correctly dectected keyboard for every reboot. $ grep -i error kern.log | tail -n5 Feb 26 10:04:45 user-QuantaGrid-D52PL-4U kernel: [4.321273] ERST: Error Record Serialization Table (ERST) suppor

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
Also, it was said that this issues didn't found on rhel. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869138 Title: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 wit

[Bug 1869138] Re: The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
kernel log from testing ** Attachment added: "kernel.log" https://bugs.launchpad.net/ubuntu/+source/linux-hwe/+bug/1869138/+attachment/5341769/+files/log.zip -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.n

[Bug 1869138] [NEW] The keyboard does not have responeded when boot system into Ubuntu 18.04.4 with HWE(Legacy Mode) on Quanta server

2020-03-26 Thread Zhanglei Mao
Public bug reported: In Quanta server testing for 18.04.3 hwe-kernel 18.04.4 hwe-kernel and 20.04 daily build, it seems usb keyboard would: 1.Same OS would lost keybord and record usb usbx-portx: couldn't allocate usb_device log. 2.It is easy to reproduce the issue if add "open initcall debug" i

[Bug 1864612] Re: requests ipmi-tool to include lasted patch for supporting quanta server

2020-02-26 Thread Zhanglei Mao
It was also tested that freeipmi tools of ipmiconsole and ipmipower cmd works fine with Quanta server as it is standard for ipmi. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1864612 Title: request

[Bug 1864612] Re: requests ipmi-tool to include lasted patch for supporting quanta server

2020-02-26 Thread Zhanglei Mao
Got feedback about what was fixed by the patch as below: New IPMITOOL QCT support OEM DIMM description as below, so customer can know which CPU/DIMM/slot Logged "Correctable ECC / SBE Warning Threshold (DIMM A0) - Asserted": ipmitool -U admin -P admin -H 10.10.12.21 raw 0xa 0x44 0x0 0x0 0x2 0x0

[Bug 1864612] [NEW] requests ipmi-tool to include lasted patch for supporting quanta server

2020-02-24 Thread Zhanglei Mao
Public bug reported: The existing ipmitool can't works for Quanta server as it didn't include lasted patches, even in 20.04. >From Quanta feedback that pmitools in 20.04 can work for quanta server if >apply this patch set: https://github.com/ipmitool/ipmitool/commit/5c033c06abb45bc183f42cd758c61

[Bug 1862723] Re: 18.04 can't auto load ub_iverbs (18.04.3 hwe can) for Mellanox PMD nics

2020-02-10 Thread Zhanglei Mao
in 18.04 ga-kenrel, the mellanox nic PMD would fail. after manually load ib_uverbs, then it works fine. 2019-12-02T15:49:57.243Z|00306|netdev|WARN|dpdk-360bbd6: could not set configuration (Invalid argument) 2019-12-02T15:49:57.262Z|00307|dpdk|INFO|EAL: PCI device :3b:00.0 on NUMA socket 0 2

[Bug 1862723] [NEW] 18.04 can't auto load ub_iverbs (18.04.3 hwe can) for Mellanox PMD nics

2020-02-10 Thread Zhanglei Mao
Public bug reported: 18.04 ga-kernel to load ib_uverbs model which required by Mellanox PMD ( 18.04.3 hwe can auto load via model depends) ** Affects: ubuntu Importance: Undecided Status: New -- You received this bug notification because you are a member of Ubuntu Bugs, which is s

[Bug 1862722] [NEW] call trace during crash kernel boot on Huawei server for 18.04 ga-kernel

2020-02-10 Thread Zhanglei Mao
Public bug reported: To configure kernel crash dump, if memory is less than 576M, it would get oom and to configure crashkernel=2G-:576M, it would get call trace. ** Affects: ubuntu Importance: Undecided Status: New -- You received this bug notification because you are a member of

[Bug 1862722] Re: call trace during crash kernel boot on Huawei server for 18.04 ga-kernel

2020-02-10 Thread Zhanglei Mao
root@node1:~# dmesg |grep -i crash [0.00] Command line: BOOT_IMAGE=/vmlinuz-4.15.0-72-generic root=UUID=d9281738-de5a-4564-852e-f07c29ea4817 ro default_hugepagesz=1GB hugepagesz=1GB hugepages=192 transparent_hugepage=never solcpus=4-13,32-41,18-27,46-55 console=tty0 console=ttyS0 crashke

[Bug 1862720] [NEW] kernel crash dump create error on Huawei server for 18.04.3 hwe-kernel

2020-02-10 Thread Zhanglei Mao
Public bug reported: coping data get io error [ OK ] Reached target Remote File Systems (Pre). [ OK ] Started Availability of block devices. [ 185.379378] kdump-tools[1695]: Starting kdump-tools: * running makedumpfile -c -d 31 /proc/vmcore /var/crash/201912310519/dump-incomplete Copying d

[Bug 1862720] Re: kernel crash dump create error on Huawei server for 18.04.3 hwe-kernel

2020-02-10 Thread Zhanglei Mao
ubuntu@infra1:~$ sudo lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT loop0 7:00 93.1G 0 loop /var/lib/lxd/storage-pools/default sda 8:00 1.1T 0 disk ├─sda1 8:10 512M 0 part /boot/efi └─sda2

[Bug 1860403] [NEW] fio call trace on xenial during ceph rbd testing

2020-01-20 Thread Zhanglei Mao
Public bug reported: During fio testing as below on 16.04 (xenial) hwe-kernel ( 4.15.0-55-generic), the below fio woluld get call trace or segment falut: /tmp/juju-exec53975/script.sh: line 1: 1104000 Segmentation fault (core dumped) fio --name=randread --ioengine=rbd --clientname=admin --poo

[Bug 1762940] Re: Ubuntu 18.04 d-i install screen becomes blurry on Huawei server in EFI boot mode

2019-05-16 Thread Zhanglei Mao
For #44, in the following testing, if disable hibmc_drm loading, then it works fine and would not be hang. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1762940 Title: Ubuntu 18.04 d-i install scree

[Bug 1762940] Re: Ubuntu 18.04 d-i install screen becomes blurry on Huawei server in EFI boot mode

2019-05-16 Thread Zhanglei Mao
For #44 Aaron's suggested patch, it seems patch have been included after 4.15.0-46. The testing 18.04.2 iso with 4.15.0-48 kernel seems the system would be hang on language selection after select install server of grub screen. So it seems this hibmc_drm would caused another this hang issues too. T

  1   2   3   >