Hello, Debian users.
I have stability issues (freezing) on my laptop running Debian 12
(current stable), and, according to logs, the culprit is kernel module i915.
My kernel version:
$ uname -a
Linux hostname 6.1.0-30-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.124-1
(2025-01-12) x86_64 GNU/Linux
My GPU is CPU integrated:
$ lspci -nn -v | grep VGA
00:02.0 VGA compatible controller [0300]: Intel Corporation Alder Lake-P
GT1 [UHD Graphics] [8086:46a3] (rev 0c) (prog-if 00 [VGA controller])
A few relevant lines from syslog after freeze and reboot:
...
Jan 18 02:08:26 hostname kernel: i915 0000:00:02.0: Uninitialized
forcewake domain(s) 0x8 accessed at 0x1cd000
Jan 18 02:08:26 hostname kernel: WARNING: CPU: 2 PID: 182 at
drivers/gpu/drm/i915/intel_uncore.c:950 find_fw_domain+0xcd/0xe0 [i915]
...
Jan 18 02:08:26 hostname kernel: i915 0000:00:02.0: [drm] GPU HANG:
ecode 12:1:85dfbffd, in Xorg [848]
Jan 18 02:08:26 hostname kernel: i915 0000:00:02.0: [drm] Resetting chip
for stopped heartbeat on rcs0
Jan 18 02:08:27 hostname kernel: i915 0000:00:02.0: [drm] Xorg[848]
context reset due to GPU hang
Jan 18 02:08:27 hostname kernel: i915 0000:00:02.0: [drm] GuC firmware
i915/adlp_guc_70.bin version 70.36.0
Jan 18 02:08:27 hostname kernel: i915 0000:00:02.0: [drm] HuC firmware
i915/tgl_huc.bin version 7.9.3
Jan 18 02:08:27 hostname kernel: i915 0000:00:02.0: [drm] HuC authenticated
Jan 18 02:08:27 hostname kernel: i915 0000:00:02.0: [drm] GuC submission
enabled
Jan 18 02:08:27 hostname kernel: i915 0000:00:02.0: [drm] GuC SLPC enabled
...
Logs with call traces:
https://paste.debian.net/1345769/
Things I've tried so far:
1. Updated Intel graphics firmware to latest version available on
git.kernel.org
Things slightly improved, now I have a freeze after a hour or two
instead of a few minutes. My guess is that firmware got better at "GPU
hang" handling and recovery, but didn't fixed the bug itself.
2. Switched Xorg driver from "modesetting" to older "xf86-video-intel".
My GPU appears to be too modern for it, I run into graphical artifacts,
xorg server couldn't recognize GPU chipset, etc, so I had to revert back
to "modesetting".
3. Tried to set "enable_guc=0" option for i915 module, which disabled
GuC offloading.
While I couldn't reproduce the issue anymore and my laptop won't freeze
even after a few hours, instead I got poor performance, higher than
usual fan noise and battery drain.
It also taints the kernel by Intel's design, so I had to revert back to
default (which functionally seems to be "enable_guc=3").
There is also an option to try official backported Intel driver which
builds out-of-tree DKMS module, but I rather look for a more standard
solution.
https://dgpu-docs.intel.com/releases/releases.html
Searching for "Uninitialized forcewake domain(s) 0x8 accessed at" gave
me a result leading to a patch which could be addressing the same issue
I have.
https://www.spinics.net/lists/intel-gfx/msg274293.html
So now my questions are:
How to check if this patch was added upstream to git.kernel.org and to
which kernel version?
And how upstream kernel version correlates to kernel version from
Debian, so I can get a reference point to see what patches for i915 were
added?
--
With kindest regards, Alexander.
Debian - The universal operating system
https://www.debian.org