@Matt

Thanks for your test.
I rechecked noble kernel again, the patch will be in Ubuntu-6.8.0-44.44.

Maybe the 6.8.0-43 got a respin.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/2072428

Title:
  6.8 generic & amdpgu / polaris

Status in linux package in Ubuntu:
  In Progress
Status in linux-oem-6.8 package in Ubuntu:
  In Progress
Status in linux source package in Noble:
  Fix Committed
Status in linux-oem-6.8 source package in Noble:
  Fix Released

Bug description:
  The amdgpu driver fails to initialise properly due to a bug introduced
  when trying to fix another bug
  (https://gitlab.freedesktop.org/drm/amd/-/issues/2925).

  This bug is already being tracked on freedesktop, and I tripped over
  it today when rebooting a machine that hadn't been rebooted in a
  while. The bug on freedesktop is
  https://gitlab.freedesktop.org/drm/amd/-/issues/3289 (it also affects
  some Fedora and other Linux distros). The bug is tagged with "Polaris"
  but the machine that is affected here is a RX 5500.

  The commit that introduced the bug is
  
https://gitlab.freedesktop.org/agd5f/linux/-/commit/466a7d115326ece682c2b60d1c77d1d0b9010b4f

  I got a copy of linux-source-6.8.0 and confirmed that the offending
  line in the code is present.

  The issue was not present when using the linux-image-
  unsigned-6.10.0-061000rc4-generic deb from kernel.ubuntu.com

  Here's the details of the machine in question (I've attached complete logs to 
not fill up the text).
  # lsb_release -a
  No LSB modules are available.
  Distributor ID:       Ubuntu
  Description:  Ubuntu 24.04 LTS
  Release:      24.04
  Codename:     noble

  boot line:
  Linux version 6.8.0-36-generic (buildd@lcy02-amd64-077) 
(x86_64-linux-gnu-gcc-13 (Ubuntu 13.2.0-23ubuntu4) 13.2.0, GNU ld (GNU Binutils 
for Ubuntu) 2.42) #36-Ubuntu SMP PREEMPT_DYNAMIC Mon Jun 10 10:49:14 UTC 2024 
(Ubuntu 6.8.0-36.36-generic 6.8.4)

  command line: BOOT_IMAGE=/boot/vmlinuz-6.8.0-36-generic
  root=UUID=2a38795c-259e-455e-aa40-29b51fb6dd29 ro quiet splash
  vt.handoff=7

  subset of dmidecode:

  Handle 0x0000, DMI type 0, 26 bytes
  BIOS Information
   Vendor: American Megatrends Inc.
   Version: A.C0
   Release Date: 04/24/2018
   Address: 0xF0000
   Runtime Size: 64 kB
   ROM Size: 16 MB
   Characteristics:
    PCI is supported
    BIOS is upgradeable
    BIOS shadowing is allowed
    Boot from CD is supported
    Selectable boot is supported
    BIOS ROM is socketed
    EDD is supported
    5.25"/1.2 MB floppy services are supported (int 13h)
    3.5"/720 kB floppy services are supported (int 13h)
    3.5"/2.88 MB floppy services are supported (int 13h)
    Print screen service is supported (int 5h)
    8042 keyboard services are supported (int 9h)
    Serial services are supported (int 14h)
    Printer services are supported (int 17h)
    ACPI is supported
    USB legacy is supported
    BIOS boot specification is supported
    Targeted content distribution is supported
    UEFI is supported
   BIOS Revision: 5.13

  Handle 0x0002, DMI type 2, 15 bytes
  Base Board Information
   Manufacturer: Micro-Star International Co., Ltd.
   Product Name: B350M PRO-VDH (MS-7A38)
   Version: 2.0
   Serial Number: XXXXXXXXXXX
   Asset Tag: To be filled by O.E.M.
   Features:
    Board is a hosting board
    Board is replaceable
   Location In Chassis: To be filled by O.E.M.
   Chassis Handle: 0x0003
   Type: Motherboard
   Contained Object Handles: 0

  Handle 0x0004, DMI type 9, 17 bytes
  System Slot Information
   Designation: J6B2
   Type: x16 PCI Express
   Current Usage: In Use
   Length: Long
   ID: 0
   Characteristics:
    3.3 V is provided
    Opening is shared
    PME signal is supported
   Bus Address: 0000:00:01.0

  Handle 0x0014, DMI type 4, 48 bytes
  Processor Information
   Socket Designation: AM4
   Type: Central Processor
   Family: Zen
   Manufacturer: Advanced Micro Devices, Inc.
   ID: 82 0F 80 00 FF FB 8B 17
   Signature: Family 23, Model 8, Stepping 2
   Flags:
    FPU (Floating-point unit on-chip)
    VME (Virtual mode extension)
    DE (Debugging extension)
    PSE (Page size extension)
    TSC (Time stamp counter)
    MSR (Model specific registers)
    PAE (Physical address extension)
    MCE (Machine check exception)
    CX8 (CMPXCHG8 instruction supported)
    APIC (On-chip APIC hardware supported)
    SEP (Fast system call)
    MTRR (Memory type range registers)
    PGE (Page global enable)
    MCA (Machine check architecture)
    CMOV (Conditional move instruction supported)
    PAT (Page attribute table)
    PSE-36 (36-bit page size extension)
    CLFSH (CLFLUSH instruction supported)
    MMX (MMX technology supported)
    FXSR (FXSAVE and FXSTOR instructions supported)
    SSE (Streaming SIMD extensions)
    SSE2 (Streaming SIMD extensions 2)
    HTT (Multi-threading)
   Version: AMD Ryzen 5 2600 Six-Core Processor
   Voltage: 1.1 V
   External Clock: 100 MHz
   Max Speed: 3900 MHz
   Current Speed: 3400 MHz
   Status: Populated, Enabled
   Upgrade: Socket AM4
   L1 Cache Handle: 0x0011
   L2 Cache Handle: 0x0012
   L3 Cache Handle: 0x0013
   Serial Number: Unknown
   Asset Tag: Unknown
   Part Number: Unknown
   Core Count: 6
   Core Enabled: 6
   Thread Count: 12
   Characteristics:
    64-bit capable
    Multi-Core
    Hardware Thread
    Execute Protection
    Enhanced Virtualization
    Power/Performance Control

  lspci:
  00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Root Complex
  00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-0fh) 
I/O Memory Management Unit
  00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-1fh) PCIe Dummy Host Bridge
  00:01.3 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) PCIe GPP Bridge
  00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-1fh) PCIe Dummy Host Bridge
  00:03.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-1fh) PCIe Dummy Host Bridge
  00:03.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) PCIe GPP Bridge
  00:04.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-1fh) PCIe Dummy Host Bridge
  00:07.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-1fh) PCIe Dummy Host Bridge
  00:07.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Internal PCIe GPP Bridge 0 to Bus B
  00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-1fh) PCIe Dummy Host Bridge
  00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Internal PCIe GPP Bridge 0 to Bus B
  00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 
59)
  00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
  00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 0
  00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 1
  00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 2
  00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 3
  00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 4
  00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 5
  00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 6
  00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) Data Fabric: Device 18h; Function 7
  03:00.0 USB controller: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset 
USB 3.1 xHCI Controller (rev 02)
  03:00.1 SATA controller: Advanced Micro Devices, Inc. [AMD] 300 Series 
Chipset SATA Controller (rev 02)
  03:00.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 43b2 (rev 02)
  16:04.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset 
PCIe Port (rev 02)
  16:05.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset 
PCIe Port (rev 02)
  16:06.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset 
PCIe Port (rev 02)
  16:07.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset 
PCIe Port (rev 02)
  1b:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. 
RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller (rev 15)
  1f:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL 
Upstream Port of PCI Express Switch (rev c5)
  20:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL 
Downstream Port of PCI Express Switch
  21:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] 
Navi 14 [Radeon RX 5500/5500M / Pro 5500M] (rev c5)
  21:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 HDMI 
Audio
  22:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. 
[AMD] Zeppelin/Raven/Raven2 PCIe Dummy Function
  22:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h 
(Models 00h-0fh) Platform Security Processor (PSP) 3.0 Device
  22:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Zeppelin USB 3.0 
xHCI Compliant Host Controller
  23:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. 
[AMD] Zeppelin/Renoir PCIe Dummy Function
  23:00.2 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA 
Controller [AHCI mode] (rev 51)
  23:00.3 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 
00h-0fh) HD Audio Controller

  
  SRU Justification:
  ==============
  [Impact]
  AMD GPUs like RX580 and RX5500 failed to be initialized after
  commit: 466a7d115326 ("drm/amd: Use the first non-dGPU PCI device for BW 
limits")
  The screen stays blank.

  [Fix]
  Set the current speed/width on devices.
  Upstream fix:
  ba46b3bda296 ("drm/amdgpu: Adjust logic in amdgpu_device_partner_bandwidth()")

  [Test]
  Bug reporter tested on hardware, amdgpu works fine.
  Also verified on AMD iGPU of laptop by myself. glmark2 works fine.

  [Where problems could occur]
  It may break amdgpu.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2072428/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to