[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-23 Thread Mario Limonciello
I'd say let's track the out of bounds stuff in this bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2039926 ** Summary changed: - amdgpu crash on Mantic + VI dGPU fails to initialize on Intel platforms w/ 5.14+ ** Changed in: linux (Ubuntu) Status: Confirmed => Triaged -- You

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-23 Thread Mario Limonciello
Great! Thanks I've posted it for review. https://lore.kernel.org/amd- gfx/20231023134514.144863-1-mario.limoncie...@amd.com/T/#u Yes the "UBSAN: array-index-out-of-bounds" traces are to be fixed in 6.7. If you want to pull those patches early I can point you at them. -- You received this bug

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-23 Thread Paolo Gentili
Ok the patch fixes the problem! I can see a stack trace still but it's probably related to https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/comments/13 ? Thanks! ** Attachment added: "dmesg-mantic-6.5.3-patched" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/+attachm

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-21 Thread Ubuntu Foundations Team Bug Bot
** Tags added: patch -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2036742 Title: amdgpu crash on Mantic Status in linux package in Ubuntu: Confirmed Bug description: [Impact]

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-20 Thread Mario Limonciello
Great, thanks! Let me know if the patch works too. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2036742 Title: amdgpu crash on Mantic Status in linux package in Ubuntu: Confirmed

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-20 Thread Paolo Gentili
With `amdgpu.aspm=0` it works fine. ** Attachment added: "dmesg-focal-5.14-rc2-aspm0" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/+attachment/5711750/+files/dmesg-focal-5.14-rc2-aspm0 -- You received this bug notification because you are a member of Kernel Packages, which is

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-20 Thread Mario Limonciello
If amdgpu.aspm=0 helps the issue, then can you please test this patch against Mantic? If it helps I'll submit it. ** Patch added: "Disable ASPM for VI on all Intel systems" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/+attachment/5711749/+files/0001-drm-amd-Disable-ASPM-for-VI

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-20 Thread Mario Limonciello
To explain some of the differences in the logs: * UVD is the IP block that fails to init (so no "UVD and UVD ENC initialized successfully"). * Once an IP block fails, next one isn't even tried (so no "VCE initialized successfully") * kfd doesn't initialize because amdgpu_amdkfd_device_init() won

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-20 Thread Paolo Gentili
I can't install Jammy because I can't see the installer, so I did the test on Focal. 5.13.19 works, 5.14-rc2 fails (same for 5.14.0). Dmesg attached. ** Attachment added: "dmesg-focal-mainline.zip" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/+attachment/5711667/+files/dmesg-f

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-19 Thread Mario Limonciello
> the firmware on mantic is zstd compressed, so mainline builds from the past can't load the firmware.. As a hack then maybe just clone https://gitlab.com/kernel- firmware/linux-firmware and put everything (uncompressed) in /lib/firmware/updates/ to get by that. Or run the check on Jammy instead.

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-19 Thread Timo Aaltonen
the firmware on mantic is zstd compressed, so mainline builds from the past can't load the firmware.. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2036742 Title: amdgpu crash on Mantic

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-16 Thread Mario Limonciello
This seems like a different bug to me. Can you please open a different report and attach the relevant logs from the journal to it? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2036742 T

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-16 Thread Kristijan Žic 
Here’s an example of what happens when I start using Brave browser in the attachment. With X it crashes and then restarts the gdm and brings me to the login screen. in any case it’s unusable as it crashes maybe 10 seconds into opening any app. ** Attachment added: "example of what happens when I

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-16 Thread Kristijan Žic 
I think I have the same issue but I'm not sure. Please advise if I can test anything and how? GPU: AMD Radeon RX Vega 64 Liquid CPU: AMD Threadripper 1900x DS: Wayland With the new installer: The new installer crashes the entire session when it opens the “Connectivity” screen. -

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-16 Thread Paolo Gentili
On Focal, it also works with 5.14.0-1059. dmesg attached. ** Attachment added: "dmesg-5.14oem" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/+attachment/5709895/+files/dmesg-5.14oem -- You received this bug notification because you are a member of Kernel Packages, which is sub

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-16 Thread Mario Limonciello
Something is really fishy here - the 5.15 test is again missing firmware for both i915, amdgpu and ath10k: [2.345119] kernel: i915 :00:02.0: Direct firmware load for i915/rkl_dmc_ver2_03.bin failed with error -2 [2.345122] kernel: i915 :00:02.0: [drm] Failed to load DMC firmware

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-16 Thread Paolo Gentili
> Thanks for checking. So good to know it hasn't regressed from stable in 5.10. Can you redo your 5.15 test with the firmware in place so we can see if we're OK there or not? Unfortunately still no luck. I booted Focal with 5.10 OEM and then rebooted to Mantic with 5.15.134. The screen freezes at

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-10-05 Thread Mario Limonciello
I don't expect it helps your boot issue, but the UBSAN issue will be fixed by this commit. https://lore.kernel.org/amd- gfx/78abc334-44ed-4632-8796-6bbe9c432...@amd.com/T/#me31ff6b88640b03be1a8edfc6fc8878ac78ca6bb Please redo the test with 5.15. -- You received this bug notification because you

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-26 Thread Mario Limonciello
Thanks for checking. So good to know it hasn't regressed from stable in 5.10. Can you redo your 5.15 test with the firmware in place so we can see if we're OK there or not? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu.

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-25 Thread Paolo Gentili
> Can you see if it keeps working with latest 5.10 OEM? 1057 it looks like is newest. Yes, it still works with that version. ** Attachment added: "dmesg-5.10.0-1057" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2036742/+attachment/570/+files/dmesg-5.10.0-1057 -- You received thi

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-22 Thread Mario Limonciello
> dmesg-ip-block-mask [6.150330] kernel: amdgpu :01:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring vce0 test failed (-110) [6.150581] kernel: [drm:amdgpu_device_ip_init [amdgpu]] *ERROR* hw_init of IP block failed -110 [6.150726] kernel: amdgpu :01:00.0: amdgpu: a

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-22 Thread Paolo Gentili
The setup involves a custom KVM indeed, I'll attach the EDID file involved in this setup. I tested everything as requested, no luck unfortunately. The working configuration, which I've now replicated, is involving the OEM image for Ubuntu 20.04 with which the device has been certified. Please fin

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-21 Thread Mario Limonciello
** Changed in: linux (Ubuntu) Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2036742 Title: amdgpu crash on Mantic Status in linux package in Ubun

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-21 Thread Timo Aaltonen
Yes it's a KVM of sorts, using Zapper. But apparently this did work before, so a rough bisect with mainline builds will be conducted. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2036742

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-20 Thread Mario Limonciello
> [5.134271] kernel: [drm:detect_link_and_local_sink [amdgpu]] *ERROR* No > EDID read. > [5.322247] kernel: [drm:detect_link_and_local_sink [amdgpu]] *ERROR* No > EDID read. > [5.510230] kernel: [drm:detect_link_and_local_sink [amdgpu]] *ERROR* No > EDID read. Is this connected to a

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-20 Thread Juerg Haefliger
UBSAN warnings could be a red herring. They've added a compiler flag that complains about flexible arrays if they're declared incorrectly (false positive). Will take a look tomorrow. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-20 Thread Mario Limonciello
> It seems related to https://bugs.launchpad.net/ubuntu/+source/linux- firmware/+bug/2029396 . I don't believe these to be related. That issue is specifically with navi3x dGPU, your system has a much older dGPU. Your 6.3 and 6.5 logs both appear to crash similarly; Do you have a point in time t

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-20 Thread Dimitri John Ledkov
[4.918050] kernel: UBSAN: array-index-out-of-bounds in /build/linux- IPoq5q/linux-6.5.0/drivers/gpu/drm/amd/amdgpu/../pm/powerplay/hwmgr/smu7_hwmgr.c:3669:4 is not good -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-firmware in

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-20 Thread Dimitri John Ledkov
** Changed in: linux-firmware (Ubuntu) Milestone: None => ubuntu-23.10 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-firmware in Ubuntu. https://bugs.launchpad.net/bugs/2036742 Title: amdgpu crash on Mantic Status in linux-

[Kernel-packages] [Bug 2036742] Re: amdgpu crash on Mantic

2023-09-20 Thread Paolo Gentili
** Description changed: [Impact] - Booting from USB the latest Mantic Desktop canary image (2023-09-19), + Booting from USB the latest Mantic Desktop daily image (2023-09-20), just after the initial logs, nothing is displayed on screen. The system is still alive since _autoinstall_ works