Hi. I have one of these: eben@cerberus:~$ nvidia-detect Detected NVIDIA GPUs: 01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GM204 [GeForce GTX 970] [10de:13c2] (rev a1)
Checking card: NVIDIA Corporation GM204 [GeForce GTX 970] (rev a1) Your card is supported by all driver versions. Your card is also supported by the Tesla 470 drivers series. It is recommended to install the nvidia-driver package. on here eben@cerberus:~$ cat /etc/debian_version 12.9 Now, the fan doesn't usually come on on this card, so the only way to keep it at a reasonable temperature is to have the side cover off and disable a monitor. And sometimes that's not enough and I have to go to the console until it cools down. so, definitely suboptimal. If replacing it were the only solution and I could afford to do it, I would. But anyhow. The installed nvidia-* packages are: eben@cerberus:~$ apt list --installed nvidia\* Listing... Done nvidia-alternative/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-detect/stable,now 535.216.01-1~deb12u1 amd64 [installed] nvidia-driver-bin/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-driver-libs/stable,now 535.216.01-1~deb12u1 amd64 [installed] nvidia-driver/stable,now 535.216.01-1~deb12u1 amd64 [installed] nvidia-egl-common/stable,now 535.216.01-1~deb12u1 amd64 [installed] nvidia-egl-icd/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-installer-cleanup/stable,now 20220217+3~deb12u1 amd64 [installed,automatic] nvidia-kernel-common/stable,now 20220217+3~deb12u1 amd64 [installed,automatic] nvidia-kernel-dkms/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-kernel-support/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-legacy-check/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-modprobe/stable,now 535.161.07-1~deb12u1 amd64 [installed,automatic] nvidia-persistenced/stable,now 535.171.04-1~deb12u1 amd64 [installed,automatic] nvidia-settings/stable,now 535.171.04-1~deb12u1 amd64 [installed,automatic] nvidia-smi/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-support/stable,now 20220217+3~deb12u1 amd64 [installed,automatic] nvidia-suspend-common/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-vdpau-driver/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-vulkan-common/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] nvidia-vulkan-icd/stable,now 535.216.01-1~deb12u1 amd64 [installed,automatic] Is there anything on that list that looks suspicious, or anything *not* on that list that would explain my problem? The times I have seen it act normally (i.e., run the fan at 10-20% to hold the temperature in the mid-60s, and allow me to use all monitors) are after I've been away from the computer long enough for the monitors to go to sleep. My hypothesis is that something the OS does in that sleep-wake cycle makes the nvidia driver shape up and fly right. Is it just setting DPMS to some higher value (4 or 5) after some time and setting it back to 1 when I return? I've done that using ddccontrol and the driver's behavior doesn't change. Maybe there's a minimum time for the monitors to be asleep or the GPU has to cool to below a certain temperature?