Hello,
This is my first post here - guidance/clarification on Launchpad 
protocol/process welcomed :-)

Previously posted to ubuntuforums
(http://ubuntuforums.org/showthread.php?p=2113284#post2113284) but
received no useful responses... investigated Launchpad and this appears
to be the right place(?).

Various manifestations of this issue appear to have been logged here,
this thread appears the most like my experience.

I would like to understand "how is the fan control *intended* to work in
edgy?"

=====
Quick Summary.
I have a problem regarding overheating/acpi control of the CPU fan.
Essentially: when the temp gets hot it seems the CPU fan cannot be activated, 
resulting in the temp warning siren going off and the temp rising and rising as 
reported in /proc files.

This (or something like it) has been reported elsewhere but I believe
not properly addressed/fixed - correct me if I am wrong

Below is a full report - I hope it helps.

1) Intro:
=========
I have witnessed this a few times and have a reproducible case.
I have done some initial research (googling/forum searching) and this has been 
reported/discussed - however it seems people have treated it as a problem with 
the message, rather than a an actual problem with the acpi/fan control.

I think this is potentially a dangerous problem as it could cause
overheating (and consequent hardware damage?)

Some system info:
-----------------
Running Edgy
uname -a
Linux shaft 2.6.17-10-generic #2 SMP Tue Dec 5 22:28:26 UTC 2006 i686 GNU/Linux

from: cpuinfo
------
vendor_id : GenuineIntel
cpu family : 15
model : 2
model name : Intel(R) Pentium(R) 4 CPU 3.00GHz
stepping : 9
cpu MHz : 3007.288
cache size : 512 KB
-------

from: /proc/acpi/thermal_zone/THRM
----------------------
% more *
::::::::::::::
cooling_mode
::::::::::::::
cooling mode: active
::::::::::::::
polling_frequency
::::::::::::::
<polling disabled>
::::::::::::::
state
::::::::::::::
state: ok
::::::::::::::
temperature
::::::::::::::
temperature: 49 C
::::::::::::::
trip_points
::::::::::::::
critical (S5): 100 C
active[0]: 85 C: devices=0xdffe3bd0
---------

2) Problem description
======================
When I run something CPU intensive (i.e. wav => mp3 encoding with lame) for a 
period of time (a few minutes plus) I get the following symptoms:
a) CPU load at 50% (is 100% of a virtual CPU) as expected for Hyper Threading P4
b) Temp reported in /proc/acpi/thermal_zone/THRM/temperature climbs/increments 
steadily as job continues
c) when temp reaches 85C (the threshold in trip_points file) warnings are 
generated in /var/log/messages:
------------
Jan 17 18:51:59 shaft kernel: [18129602.424000] ACPI: Transitioning device 
[FAN] to D0
Jan 17 18:51:59 shaft kernel: [18129602.424000] ACPI: Transitioning device 
[FAN] to D0
Jan 17 18:51:59 shaft kernel: [18129602.424000] ACPI: Unable to turn cooling 
device [dffe3bd0] 'on'
Jan 17 18:52:00 shaft kernel: [18129602.732000] ACPI: Transitioning device 
[FAN] to D0
Jan 17 18:52:00 shaft kernel: [18129602.732000] ACPI: Transitioning device 
[FAN] to D0
Jan 17 18:52:00 shaft kernel: [18129602.732000] ACPI: Unable to turn cooling 
device [dffe3bd0] 'on'
------------

d) A minute or two later there is an "alarm sound" from the PC speaker (i.e. 
NOT the audio system) - a scary "siren like sound".... my machine is in pain... 
it is calling to me.... it needs help !!!
e) If I ^Z (suspend) the job in the shell.... obviously the CPU usage drops to 
nothing... and I watch the reported temp decline back to a steady state.
f) if I resume the job (fg in shell) the process is repeated (temp climbs - log 
file messages - alarm sounds)


3) Conclusions:
===============
I believe that the message reported is a genuine one... in that the system 
detects the temp threshold correctly, and attempts to start the fan. For some 
reason (I have no idea why) this fails.... and is reported accordingly in the 
log file. If I do not intervene (i.e. suspend job) the temp climbs and climbs.. 
my machine screams in pain (er.. issues audible alert).
Would this go on until a melt down ?
Is it dangerous ?

It certainly seems straightforward to me.
It certainly has only occurred since I move to Edgy (previously ran Warty and 
Hoary on same machine - never had this).


4) FIX IT!
==========
Can someone give an authoritative reply to this - or tell me how (else) I 
should report it ?

Many thanks - Steve
=======

-- 
Multiple ACPI related issues after upgrade Dapper > Edgy
https://launchpad.net/bugs/71465

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to