Public bug reported:

The amdgpu driver crashes and shows colored rectangles on the screen -
hangs for a few seconds - and then seems to soft recover, once it
crashed hard with only sysrq to reboot.

With logs like:
[gfxhub] page fault (src_id:0 ring:24 vmid:2 pasid:32772, for process Xorg pid 
1326767 thread Xorg:cs0 pid 1326768)
amdgpu 0000:03:00.0: amdgpu:   in page starting at address 0x00008001896e8000 
from client 0x1b (UTCL2)
amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00201031
amdgpu 0000:03:00.0: amdgpu:     Faulty UTCL2 client ID: TCP (0x8)
amdgpu 0000:03:00.0: amdgpu:     MORE_FAULTS: 0x1
amdgpu 0000:03:00.0: amdgpu:     WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: amdgpu:     PERMISSION_FAULTS: 0x3
amdgpu 0000:03:00.0: amdgpu:     MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: amdgpu:     RW: 0x0
...
[drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft 
recovered


Also attached syslog file that shows a crash logfile (which is not in the 
dmesg).

# cat /proc/version_signature
Ubuntu 6.8.0-45.45-generic 6.8.12

# lsb_release -rd
No LSB modules are available.
Description:    Ubuntu 24.04.1 LTS
Release:        24.04

# apt-cache policy xserver-xorg-video-amdgpu
xserver-xorg-video-amdgpu:
  Installed: 23.0.0-1build1
  Candidate: 23.0.0-1build1
  Version table:
 *** 23.0.0-1build1 500
        500 http://de.archive.ubuntu.com/ubuntu noble/main amd64 Packages
        100 /var/lib/dpkg/status

ProblemType: Bug
DistroRelease: Ubuntu 24.04
Package: linux-image-6.8.0-45-generic 6.8.0-45.45
ProcVersionSignature: Ubuntu 6.8.0-45.45-generic 6.8.12
Uname: Linux 6.8.0-45-generic x86_64
ApportVersion: 2.28.1-0ubuntu3.1
Architecture: amd64
CRDA: N/A
CasperMD5CheckResult: pass
CurrentDesktop: XFCE
Date: Wed Oct  9 19:29:43 2024
InstallationDate: Installed on 2024-08-27 (43 days ago)
InstallationMedia: Xubuntu 24.04 LTS "Noble Numbat" - Release amd64 (20240424.2)
MachineType: ASUS System Product Name
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.8.0-45-generic 
root=UUID=2a5d59da-aefd-437b-a8f7-7f424bedb9c6 ro amdgpu.mcbp=0 
amd_pstate=balance_performance pcie_aspm=off
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No 
PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
 linux-restricted-modules-6.8.0-45-generic N/A
 linux-backports-modules-6.8.0-45-generic  N/A
 linux-firmware                            20240318.git3b128b60-0ubuntu2.4
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 08/02/2024
dmi.bios.release: 22.4
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 2204
dmi.board.asset.tag: Default string
dmi.board.name: ROG STRIX X670E-E GAMING WIFI
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.version: Default string
dmi.modalias: 
dmi:bvnAmericanMegatrendsInc.:bvr2204:bd08/02/2024:br22.4:svnASUS:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnROGSTRIXX670E-EGAMINGWIFI:rvrRev1.xx:cvnDefaultstring:ct3:cvrDefaultstring:skuSKU:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: System Product Name
dmi.product.sku: SKU
dmi.product.version: System Version
dmi.sys.vendor: ASUS

** Affects: xserver-xorg-video-amdgpu (Ubuntu)
     Importance: Undecided
         Status: New


** Tags: amd64 apport-bug noble

** Attachment added: "syslog"
   https://bugs.launchpad.net/bugs/2084083/+attachment/5826571/+files/syslog

** Also affects: xserver-xorg-video-amdgpu (Ubuntu)
   Importance: Undecided
       Status: New

** No longer affects: linux (Ubuntu)

** Description changed:

  The amdgpu driver crashes and shows colored rectangles on the screen -
  hangs for a few seconds - and then seems to soft recover, once it
  crashed hard with only sysrq to reboot.
  
  With logs like:
+ [gfxhub] page fault (src_id:0 ring:24 vmid:2 pasid:32772, for process Xorg 
pid 1326767 thread Xorg:cs0 pid 1326768)
+ amdgpu 0000:03:00.0: amdgpu:   in page starting at address 0x00008001896e8000 
from client 0x1b (UTCL2)
+ amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00201031
+ amdgpu 0000:03:00.0: amdgpu:   Faulty UTCL2 client ID: TCP (0x8)
+ amdgpu 0000:03:00.0: amdgpu:   MORE_FAULTS: 0x1
+ amdgpu 0000:03:00.0: amdgpu:   WALKER_ERROR: 0x0
+ amdgpu 0000:03:00.0: amdgpu:   PERMISSION_FAULTS: 0x3
+ amdgpu 0000:03:00.0: amdgpu:   MAPPING_ERROR: 0x0
+ amdgpu 0000:03:00.0: amdgpu:   RW: 0x0
+ ...
  [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft 
recovered
  
- Also attached syslog file that shows a crash logfile (which is not in
- the dmesg).
  
+ Also attached syslog file that shows a crash logfile (which is not in the 
dmesg).
  
  # cat /proc/version_signature
  Ubuntu 6.8.0-45.45-generic 6.8.12
  
  # lsb_release -rd
  No LSB modules are available.
  Description:  Ubuntu 24.04.1 LTS
  Release:      24.04
  
  # apt-cache policy xserver-xorg-video-amdgpu
  xserver-xorg-video-amdgpu:
-   Installed: 23.0.0-1build1
-   Candidate: 23.0.0-1build1
-   Version table:
-  *** 23.0.0-1build1 500
-         500 http://de.archive.ubuntu.com/ubuntu noble/main amd64 Packages
-         100 /var/lib/dpkg/status
+   Installed: 23.0.0-1build1
+   Candidate: 23.0.0-1build1
+   Version table:
+  *** 23.0.0-1build1 500
+         500 http://de.archive.ubuntu.com/ubuntu noble/main amd64 Packages
+         100 /var/lib/dpkg/status
  
  ProblemType: Bug
  DistroRelease: Ubuntu 24.04
  Package: linux-image-6.8.0-45-generic 6.8.0-45.45
  ProcVersionSignature: Ubuntu 6.8.0-45.45-generic 6.8.12
  Uname: Linux 6.8.0-45-generic x86_64
  ApportVersion: 2.28.1-0ubuntu3.1
  Architecture: amd64
  CRDA: N/A
  CasperMD5CheckResult: pass
  CurrentDesktop: XFCE
  Date: Wed Oct  9 19:29:43 2024
  InstallationDate: Installed on 2024-08-27 (43 days ago)
  InstallationMedia: Xubuntu 24.04 LTS "Noble Numbat" - Release amd64 
(20240424.2)
  MachineType: ASUS System Product Name
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.8.0-45-generic 
root=UUID=2a5d59da-aefd-437b-a8f7-7f424bedb9c6 ro amdgpu.mcbp=0 
amd_pstate=balance_performance pcie_aspm=off
  PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No 
PulseAudio daemon running, or not running as session daemon.
  RelatedPackageVersions:
-  linux-restricted-modules-6.8.0-45-generic N/A
-  linux-backports-modules-6.8.0-45-generic  N/A
-  linux-firmware                            20240318.git3b128b60-0ubuntu2.4
+  linux-restricted-modules-6.8.0-45-generic N/A
+  linux-backports-modules-6.8.0-45-generic  N/A
+  linux-firmware                            20240318.git3b128b60-0ubuntu2.4
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 08/02/2024
  dmi.bios.release: 22.4
  dmi.bios.vendor: American Megatrends Inc.
  dmi.bios.version: 2204
  dmi.board.asset.tag: Default string
  dmi.board.name: ROG STRIX X670E-E GAMING WIFI
  dmi.board.vendor: ASUSTeK COMPUTER INC.
  dmi.board.version: Rev 1.xx
  dmi.chassis.asset.tag: Default string
  dmi.chassis.type: 3
  dmi.chassis.vendor: Default string
  dmi.chassis.version: Default string
  dmi.modalias: 
dmi:bvnAmericanMegatrendsInc.:bvr2204:bd08/02/2024:br22.4:svnASUS:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnROGSTRIXX670E-EGAMINGWIFI:rvrRev1.xx:cvnDefaultstring:ct3:cvrDefaultstring:skuSKU:
  dmi.product.family: To be filled by O.E.M.
  dmi.product.name: System Product Name
  dmi.product.sku: SKU
  dmi.product.version: System Version
  dmi.sys.vendor: ASUS

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2084083

Title:
  amdgpu crash and shows colored rectangles ring gfx_0.0.0 timeout, but
  soft recovered

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-amdgpu/+bug/2084083/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to