Public bug reported:

Products containing gfx1151 architecture with multiple microcontrollers
(VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
loading or with stress applications on the CRB. This requires rebasing
these firmware versions to eliminate the risk.

# upstream tag 20250211
* 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
# upstream tag 20250109
# upstream tag 20241210
* 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
* 4a172771d ("amdgpu: update psp 14.0.1 firmware")
* d316e650c ("amdgpu: update gc 11.5.1 firmware")
# upstream tag 20241110
# upstream tag 20240811
* f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
# upstream tag 20240709

[ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
[ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 3362 
thread redshiftCmdLine pid 3362)
[ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
[ 217.270433] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
[ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) 
(0xa)
[ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
[ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
[ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 3362 
thread redshiftCmdLine pid 3362)
[ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
[ 217.270454] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
[ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 3362 
thread redshiftCmdLine pid 3362)
[ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
[ 217.270470] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0

** Affects: hwe-next
     Importance: Undecided
         Status: New

** Affects: linux-firmware (Ubuntu)
     Importance: Undecided
         Status: Fix Released

** Affects: linux-firmware (Ubuntu Noble)
     Importance: Undecided
         Status: New

** Affects: linux-firmware (Ubuntu Oracular)
     Importance: Undecided
         Status: New

** Affects: linux-firmware (Ubuntu Plucky)
     Importance: Undecided
         Status: Fix Released


** Tags: amd oem-priority originate-from-2100474

** Tags added: amd oem-priority originate-from-2100474

** Also affects: linux-firmware (Ubuntu Noble)
   Importance: Undecided
       Status: New

** Also affects: linux-firmware (Ubuntu Oracular)
   Importance: Undecided
       Status: New

** Also affects: linux-firmware (Ubuntu Plucky)
   Importance: Undecided
       Status: New

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/2100769

Title:
  Update amdgpu FW for GC 11.5.1

Status in HWE Next:
  New
Status in linux-firmware package in Ubuntu:
  Fix Released
Status in linux-firmware source package in Noble:
  New
Status in linux-firmware source package in Oracular:
  New
Status in linux-firmware source package in Plucky:
  Fix Released

Bug description:
  Products containing gfx1151 architecture with multiple
  microcontrollers (VPE, PSP, VCN, SDMA, etc.), observed a few page
  faults during heavy loading or with stress applications on the CRB.
  This requires rebasing these firmware versions to eliminate the risk.

  # upstream tag 20250211
  * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
  # upstream tag 20250109
  # upstream tag 20241210
  * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
  * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
  * d316e650c ("amdgpu: update gc 11.5.1 firmware")
  # upstream tag 20241110
  # upstream tag 20240811
  * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
  # upstream tag 20240709

  [ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270433] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
  [ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) 
(0xa)
  [ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
  [ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
  [ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270454] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270470] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0

To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2100769/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to