** Description changed:

  OS: Ubuntu Server 23.10.1
  Filesystem: OpenZFS, latest version from Ubuntu Repo
  GPU: Intel ARC A380, Codename DG2
  
  i915 driver for ARC GPU fails to load if the "zfs" module is loaded first 
during boot.
  I noticed this by installing the package "zfsutils-linux" and rebooting the 
server. Hardware acceleration in Jellyfin stopped working and "vainfo" command 
failed.
  
  See the following journalctl output:
  
  ```
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] VT-d active for 
gfx access
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: vgaarb: deactivate vga 
console
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] Can't resize LMEM 
BAR - platform support is missing
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] Local memory IO 
size: 0x0000000010000000
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] Local memory 
available: 0x000000017c800000
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] Using a reduced 
BAR size of 256MiB. Consider enabling 'Resizable BAR' or similar, if available 
in the BIOS.
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: vgaarb: changed VGA 
decodes: olddecodes=io+mem,decodes=io+mem:owns=none
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] Finished loading 
DMC firmware i915/dg2_dmc_ver2_08.bin (v2.8)
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GuC firmware 
i915/dg2_guc_70.bin version 70.8.0
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC firmware 
i915/dg2_huc_gsc.bin version 7.10.3
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: 
submission enabled
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: SLPC 
enabled
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: RC 
enabled
- Jan 08 14:34:04 schublade kernel: [drm] Initialized i915 1.6.0 20201103 for 
0000:0c:00.0 on minor 0
- Jan 08 14:34:04 schublade kernel: fbcon: i915drmfb (fb0) is primary device
- Jan 08 14:34:04 schublade kernel: i915 0000:0c:00.0: [drm] fb0: i915drmfb 
frame buffer device
- Jan 08 14:34:05 schublade kernel: mei_gsc i915.mei-gscfi.3072: FW not ready: 
resetting: dev_state = 2 pxp = 0
- Jan 08 14:34:05 schublade kernel: mei_gsc i915.mei-gscfi.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 
E0020002 00000000
- Jan 08 14:34:05 schublade kernel: mei_gsc i915.mei-gsc.3072: FW not ready: 
resetting: dev_state = 2 pxp = 2
- Jan 08 14:34:05 schublade kernel: mei_gsc i915.mei-gsc.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 
E0020002 00000000
- Jan 08 14:34:05 schublade kernel: snd_hda_intel 0000:0d:00.0: bound 
0000:0c:00.0 (ops i915_audio_component_bind_ops [i915])
- Jan 08 14:34:05 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC: 
authenticated for all workloads
- Jan 08 14:34:05 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: bound 0000:0c:00.0 (ops 
i915_pxp_tee_component_ops [i915])
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] VT-d active for 
gfx access
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: vgaarb: deactivate vga 
console
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Can't resize LMEM 
BAR - platform support is missing
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Local memory IO 
size: 0x0000000010000000
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Local memory 
available: 0x000000017c800000
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Using a reduced 
BAR size of 256MiB. Consider enabling 'Resizable BAR' or similar, if availab>
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: vgaarb: changed VGA 
decodes: olddecodes=io+mem,decodes=io+mem:owns=none
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Finished loading 
DMC firmware i915/dg2_dmc_ver2_08.bin (v2.8)
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GuC firmware 
i915/dg2_guc_70.bin version 70.8.0
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC firmware 
i915/dg2_huc_gsc.bin version 7.10.3
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: load 
failed: status = 0x80000534, time = 1000ms, freq = 2400MHz, ret = -110
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: load 
failed: status: Reset = 0, BootROM = 0x1A, UKernel = 0x05, MIA = 0x00, Auth =>
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: still 
extracting hwconfig table.
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* GT0: GuC 
initialization failed -ETIMEDOUT
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* GT0: 
Enabling uc failed (-5)
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* GT0: 
Failed to initialize GPU, declaring it wedged!
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm:add_taint_for_CI 
[i915]] CI tainted:0x9 by intel_gt_set_wedged_on_init+0x34/0x50 [i915]
+ Jan 08 13:56:46 schublade kernel: [drm] Initialized i915 1.6.0 20201103 for 
0000:0c:00.0 on minor 0
+ Jan 08 13:56:46 schublade kernel: fbcon: i915drmfb (fb0) is primary device
+ Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] fb0: i915drmfb 
frame buffer device
+ Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gscfi.3072: cl:host=01 
me=32 fw disconnect request received
+ Jan 08 13:56:48 schublade kernel: mei 
i915.mei-gscfi.3072-e2c2afa2-3817-4d19-9d95-06b16b588a5d: Could not read FW 
version ret = -19
+ Jan 08 13:56:48 schublade kernel: mei 
i915.mei-gscfi.3072-e2c2afa2-3817-4d19-9d95-06b16b588a5d: FW version command 
failed -5
+ Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gsc.3072: FW not ready: 
resetting: dev_state = 2 pxp = 2
+ Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gscfi.3072: FW not ready: 
resetting: dev_state = 2 pxp = 0
+ Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gsc.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 E002>
+ Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gscfi.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 E0>
+ Jan 08 13:56:48 schublade kernel: snd_hda_intel 0000:0d:00.0: bound 
0000:0c:00.0 (ops i915_audio_component_bind_ops [i915])
+ Jan 08 13:56:48 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* failed to 
load huc via gsc -8
+ Jan 08 13:56:48 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: failed to bind 
0000:0c:00.0 (ops i915_pxp_tee_compone>
+ Jan 08 13:56:48 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: adev bind failed: -8
+ Jan 08 13:56:48 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: Master comp add failed 
-8
+ Jan 08 13:56:48 schublade kernel: mei_pxp: probe of 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1 failed with error -8
  ```
  
  I was able to "fix" it by configuring a soft dependency for the zfs module:
  1. Create File '/etc/modprobe.d/i915_before_zfs.conf'
  2. Add content: "softdep zfs pre: i915"
  3. Reboot
  4. Profit.
  
  Obviously this is not a permanent solution, but forces the i915 driver
  to load before the zfs module breaks it.
  
  Here is the log output after the above mentioned "fix":
  
  ```
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] VT-d active for 
gfx access
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: vgaarb: deactivate vga 
console
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Can't resize LMEM 
BAR - platform support is missing
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Local memory IO 
size: 0x0000000010000000
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Local memory 
available: 0x000000017c800000
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Using a reduced 
BAR size of 256MiB. Consider enabling 'Resizable BAR' or similar, if available 
in the BIOS.
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: vgaarb: changed VGA 
decodes: olddecodes=io+mem,decodes=io+mem:owns=none
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Finished loading 
DMC firmware i915/dg2_dmc_ver2_08.bin (v2.8)
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GuC firmware 
i915/dg2_guc_70.bin version 70.8.0
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC firmware 
i915/dg2_huc_gsc.bin version 7.10.3
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: 
submission enabled
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: SLPC 
enabled
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: RC 
enabled
  Jan 08 14:57:37 schublade kernel: [drm] Initialized i915 1.6.0 20201103 for 
0000:0c:00.0 on minor 0
  Jan 08 14:57:37 schublade kernel: fbcon: i915drmfb (fb0) is primary device
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] fb0: i915drmfb 
frame buffer device
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gscfi.3072: FW not ready: 
resetting: dev_state = 2 pxp = 0
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gscfi.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 
E0020002 00000000
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gsc.3072: FW not ready: 
resetting: dev_state = 2 pxp = 2
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gsc.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 
E0020002 00000000
  Jan 08 14:57:39 schublade kernel: snd_hda_intel 0000:0d:00.0: bound 
0000:0c:00.0 (ops i915_audio_component_bind_ops [i915])
  Jan 08 14:57:39 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC: 
authenticated for all workloads
  ```

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/2048679

Title:
  ARC GPU declared wedged when zfs Kernel Module is loaded

Status in linux-firmware package in Ubuntu:
  Confirmed

Bug description:
  OS: Ubuntu Server 23.10.1
  Filesystem: OpenZFS, latest version from Ubuntu Repo
  GPU: Intel ARC A380, Codename DG2

  i915 driver for ARC GPU fails to load if the "zfs" module is loaded first 
during boot.
  I noticed this by installing the package "zfsutils-linux" and rebooting the 
server. Hardware acceleration in Jellyfin stopped working and "vainfo" command 
failed.

  See the following journalctl output:

  ```
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] VT-d active for 
gfx access
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: vgaarb: deactivate vga 
console
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Can't resize LMEM 
BAR - platform support is missing
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Local memory IO 
size: 0x0000000010000000
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Local memory 
available: 0x000000017c800000
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Using a reduced 
BAR size of 256MiB. Consider enabling 'Resizable BAR' or similar, if availab>
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: vgaarb: changed VGA 
decodes: olddecodes=io+mem,decodes=io+mem:owns=none
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] Finished loading 
DMC firmware i915/dg2_dmc_ver2_08.bin (v2.8)
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GuC firmware 
i915/dg2_guc_70.bin version 70.8.0
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC firmware 
i915/dg2_huc_gsc.bin version 7.10.3
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: load 
failed: status = 0x80000534, time = 1000ms, freq = 2400MHz, ret = -110
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: load 
failed: status: Reset = 0, BootROM = 0x1A, UKernel = 0x05, MIA = 0x00, Auth =>
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: still 
extracting hwconfig table.
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* GT0: GuC 
initialization failed -ETIMEDOUT
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* GT0: 
Enabling uc failed (-5)
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* GT0: 
Failed to initialize GPU, declaring it wedged!
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm:add_taint_for_CI 
[i915]] CI tainted:0x9 by intel_gt_set_wedged_on_init+0x34/0x50 [i915]
  Jan 08 13:56:46 schublade kernel: [drm] Initialized i915 1.6.0 20201103 for 
0000:0c:00.0 on minor 0
  Jan 08 13:56:46 schublade kernel: fbcon: i915drmfb (fb0) is primary device
  Jan 08 13:56:46 schublade kernel: i915 0000:0c:00.0: [drm] fb0: i915drmfb 
frame buffer device
  Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gscfi.3072: cl:host=01 
me=32 fw disconnect request received
  Jan 08 13:56:48 schublade kernel: mei 
i915.mei-gscfi.3072-e2c2afa2-3817-4d19-9d95-06b16b588a5d: Could not read FW 
version ret = -19
  Jan 08 13:56:48 schublade kernel: mei 
i915.mei-gscfi.3072-e2c2afa2-3817-4d19-9d95-06b16b588a5d: FW version command 
failed -5
  Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gsc.3072: FW not ready: 
resetting: dev_state = 2 pxp = 2
  Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gscfi.3072: FW not ready: 
resetting: dev_state = 2 pxp = 0
  Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gsc.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 E002>
  Jan 08 13:56:48 schublade kernel: mei_gsc i915.mei-gscfi.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 E0>
  Jan 08 13:56:48 schublade kernel: snd_hda_intel 0000:0d:00.0: bound 
0000:0c:00.0 (ops i915_audio_component_bind_ops [i915])
  Jan 08 13:56:48 schublade kernel: i915 0000:0c:00.0: [drm] *ERROR* failed to 
load huc via gsc -8
  Jan 08 13:56:48 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: failed to bind 
0000:0c:00.0 (ops i915_pxp_tee_compone>
  Jan 08 13:56:48 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: adev bind failed: -8
  Jan 08 13:56:48 schublade kernel: mei_pxp 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: Master comp add failed 
-8
  Jan 08 13:56:48 schublade kernel: mei_pxp: probe of 
i915.mei-gsc.3072-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1 failed with error -8
  ```

  I was able to "fix" it by configuring a soft dependency for the zfs module:
  1. Create File '/etc/modprobe.d/i915_before_zfs.conf'
  2. Add content: "softdep zfs pre: i915"
  3. Reboot
  4. Profit.

  Obviously this is not a permanent solution, but forces the i915 driver
  to load before the zfs module breaks it.

  Here is the log output after the above mentioned "fix":

  ```
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] VT-d active for 
gfx access
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: vgaarb: deactivate vga 
console
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Can't resize LMEM 
BAR - platform support is missing
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Local memory IO 
size: 0x0000000010000000
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Local memory 
available: 0x000000017c800000
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Using a reduced 
BAR size of 256MiB. Consider enabling 'Resizable BAR' or similar, if available 
in the BIOS.
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: vgaarb: changed VGA 
decodes: olddecodes=io+mem,decodes=io+mem:owns=none
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] Finished loading 
DMC firmware i915/dg2_dmc_ver2_08.bin (v2.8)
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GuC firmware 
i915/dg2_guc_70.bin version 70.8.0
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC firmware 
i915/dg2_huc_gsc.bin version 7.10.3
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: 
submission enabled
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: SLPC 
enabled
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] GT0: GUC: RC 
enabled
  Jan 08 14:57:37 schublade kernel: [drm] Initialized i915 1.6.0 20201103 for 
0000:0c:00.0 on minor 0
  Jan 08 14:57:37 schublade kernel: fbcon: i915drmfb (fb0) is primary device
  Jan 08 14:57:37 schublade kernel: i915 0000:0c:00.0: [drm] fb0: i915drmfb 
frame buffer device
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gscfi.3072: FW not ready: 
resetting: dev_state = 2 pxp = 0
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gscfi.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 
E0020002 00000000
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gsc.3072: FW not ready: 
resetting: dev_state = 2 pxp = 2
  Jan 08 14:57:39 schublade kernel: mei_gsc i915.mei-gsc.3072: unexpected 
reset: dev_state = ENABLED fw status = 00000345 84670000 00000000 00000000 
E0020002 00000000
  Jan 08 14:57:39 schublade kernel: snd_hda_intel 0000:0d:00.0: bound 
0000:0c:00.0 (ops i915_audio_component_bind_ops [i915])
  Jan 08 14:57:39 schublade kernel: i915 0000:0c:00.0: [drm] GT0: HuC: 
authenticated for all workloads
  ```

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-firmware/+bug/2048679/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to