If it helps, I’ve gathered some readings while the external monitor is
plugged in and the fans are spinning:
Output of nvidia-smi:
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.107.02 Driver Version: 550.107.02 CUDA
Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile
Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util
Compute M. |
| | |
MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 3060 ... Off | 00000000:01:00.0 On |
N/A |
| N/A 35C P8 11W / 80W | 46MiB / 6144MiB | 0%
Default |
| | |
N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes:
|
| GPU GI CI PID Type Process name
GPU Memory |
| ID ID
Usage |
|=========================================================================================|
| 0 N/A N/A 3733 G /usr/bin/gnome-shell
42MiB |
+-----------------------------------------------------------------------------------------+
Output of sensors:
coretemp-isa-0000
Adapter: ISA adapter
Package id 0: +38.0°C (high = +100.0°C, crit = +100.0°C)
Core 0: +34.0°C (high = +100.0°C, crit = +100.0°C)
Core 1: +34.0°C (high = +100.0°C, crit = +100.0°C)
Core 2: +34.0°C (high = +100.0°C, crit = +100.0°C)
Core 3: +33.0°C (high = +100.0°C, crit = +100.0°C)
Core 4: +32.0°C (high = +100.0°C, crit = +100.0°C)
Core 5: +35.0°C (high = +100.0°C, crit = +100.0°C)
Core 6: +31.0°C (high = +100.0°C, crit = +100.0°C)
Core 7: +34.0°C (high = +100.0°C, crit = +100.0°C)
nvme-pci-e100
Adapter: PCI adapter
Composite: +46.9°C (low = -0.1°C, high = +79.8°C)
(crit = +81.8°C)
Sensor 1: +47.9°C (low = -273.1°C, high = +65261.8°C)
BAT1-acpi-0
Adapter: ACPI interface
in0: 16.78 V
curr1: 0.00 A
iwlwifi_1-virtual-0
Adapter: Virtual device
temp1: +48.0°C
ucsi_source_psy_USBC000:001-isa-0000
Adapter: ISA adapter
in0: 0.00 V (min = +0.00 V, max = +0.00 V)
curr1: 0.00 A (max = +0.00 A)
acpitz-acpi-0
Adapter: ACPI interface
temp1: +45.0°C
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/2081935
Title:
Fan control stuck at high speed after upgrade to Ubuntu 24, thermal
zone 6 fails to read sensor
Status in linux package in Ubuntu:
New
Bug description:
After upgrading from Ubuntu 22 (kernel 5.x.x) to Ubuntu 24 (kernel
6.8.x), I have encountered an issue with fan control on my Acer
Predator PH315-54. The fans get stuck at high speed, even when the
system temperatures are relatively low (30-40°C for both CPU and GPU).
This problem did not occur on Ubuntu 22, where fan control worked
properly.
The issue seems related to the thermal_zone6 sensor, which
consistently fails to provide a valid temperature reading in both
dmesg and thermald logs.
This is accompanied by the following error in dmesg:
thermal thermal_zone6: failed to read out thermal zone (-61)
In the thermald logs, I also observe errors related to sensor id 16:
sensor id 16 : No temp sysfs for reading raw temp
I have attempted multiple troubleshooting steps, including restarting
and stopping thermald, checking BIOS settings, and reinstalling NVIDIA
drivers, but the fan remains stuck at high speed. The issue persists
regardless of system load, and the laptop remains cool to the touch.
System Information:
Model: Acer Predator PH315-54
BIOS Version: V1.11
Kernel Version: 6.x.x (Ubuntu 24)
Thermal Sensors Detected:
acpitz
INT3400 Thermal
SEN2
SEN3
TCPU
x86_pkg_temp
iwlwifi_1
Steps to Reproduce:
Boot into Ubuntu 24.
Allow the system to idle or perform light system activities.
Observe that the fan stays at high speed, regardless of system
temperature.
Expected Behavior:
The fans should adjust dynamically according to system temperatures, slowing
down when the system is idle or under light load.
Actual Behavior:
The fan remains at high speed continuously, even when the system is
cool (30-40°C for CPU and GPU), and thermal_zone6 fails to read
temperature data.
Troubleshooting Steps Taken:
Restarted Thermald: Restarting thermald does not fix the issue.
Stopped Thermald: Disabling thermald temporarily reduces the issue, but
the fan still runs at high speed under BIOS control.
Reinstalled NVIDIA Drivers: The problem persists regardless of NVIDIA
driver reinstallation.
Checked Kernel Logs: The dmesg logs consistently show the thermal_zone6
failure.
Checked Sensors: Other sensors are reporting valid temperature data
(e.g., CPU and GPU), but thermal_zone6 is not functional.
BIOS Settings: Checked BIOS/UEFI for fan control options but found none.
The BIOS was updated to the latest version (V1.11), but the issue persists.
Additional Information:
The issue appears to be tied to the kernel update (6.x.x) in Ubuntu 24.
The fan control worked properly on Ubuntu 22 (kernel 5.x.x), suggesting a
possible regression or missing support for hardware sensors in the newer kernel.
The issue could be related to BIOS/ACPI fan management or a conflict with
thermald in the newer kernel.
I believe this may be a regression in the thermal management drivers,
and I am willing to provide any additional logs or information
required to help resolve it.
I have also noticed a bunch of ACPI logs in 'dmesg' that i attached
with this. This might be related or not i'm not sure.
Regards!
ProblemType: Bug
DistroRelease: Ubuntu 24.04
Package: linux-image-6.8.0-45-generic 6.8.0-45.45
ProcVersionSignature: Ubuntu 6.8.0-45.45-generic 6.8.12
Uname: Linux 6.8.0-45-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
ApportVersion: 2.28.1-0ubuntu3.1
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/seq: alexgmenard 4277 F.... pipewire
/dev/snd/controlC0: alexgmenard 4282 F.... wireplumber
/dev/snd/controlC1: alexgmenard 4282 F.... wireplumber
CRDA: N/A
CasperMD5CheckResult: pass
CurrentDesktop: ubuntu:GNOME
Date: Tue Sep 24 23:42:14 2024
InstallationDate: Installed on 2022-08-19 (768 days ago)
InstallationMedia: Ubuntu 22.04.1 LTS "Jammy Jellyfish" - Release amd64
(20220809.1)
MachineType: Acer Predator PH315-54
ProcFB: 0 i915drmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.8.0-45-generic
root=UUID=f5f773ef-b6c6-4c65-9bd3-0e23f6946276 ro quiet splash loglevel=3
vt.handoff=7
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No
PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
linux-restricted-modules-6.8.0-45-generic N/A
linux-backports-modules-6.8.0-45-generic N/A
linux-firmware 20240318.git3b128b60-0ubuntu2.3
SourcePackage: linux
UpgradeStatus: Upgraded to noble on 2024-09-21 (3 days ago)
dmi.bios.date: 06/29/2023
dmi.bios.release: 1.15
dmi.bios.vendor: Insyde Corp.
dmi.bios.version: V1.15
dmi.board.asset.tag: Type2 - Board Asset Tag
dmi.board.name: QX60_TLS
dmi.board.vendor: TGL
dmi.board.version: V1.15
dmi.chassis.type: 10
dmi.chassis.vendor: Acer
dmi.chassis.version: V1.15
dmi.ec.firmware.release: 1.9
dmi.modalias:
dmi:bvnInsydeCorp.:bvrV1.15:bd06/29/2023:br1.15:efr1.9:svnAcer:pnPredatorPH315-54:pvrV1.15:rvnTGL:rnQX60_TLS:rvrV1.15:cvnAcer:ct10:cvrV1.15:sku0000000000000000:
dmi.product.family: Predator Helios 300
dmi.product.name: Predator PH315-54
dmi.product.sku: 0000000000000000
dmi.product.version: V1.15
dmi.sys.vendor: Acer
mtime.conffile..etc.init.d.apport: 2024-07-22T10:59:07
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2081935/+subscriptions
--
Mailing list: https://launchpad.net/~kernel-packages
Post to : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help : https://help.launchpad.net/ListHelp