Summary: | ATI card fan is always on with opensource radeon driver | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Product: | xorg | Reporter: | Bryce Harrington <bryce> | ||||||||
Component: | Driver/Radeon | Assignee: | xf86-video-ati maintainers <xorg-driver-ati> | ||||||||
Status: | RESOLVED INVALID | QA Contact: | Xorg Project Team <xorg-team> | ||||||||
Severity: | major | ||||||||||
Priority: | high | CC: | aleritty, ayers, berndl81, cJ-X, dnaslave, glisse, grantipak, higuita, javier, jjdekroon, jon, julo42, linux, mattengi, mcepl, mozilla_bugs, nicku, patrick, pradermecker, sonichedgehog_hyperblast00, szebenyib, tadej.j, zawullon | ||||||||
Version: | 7.6 (2010.12) | ||||||||||
Hardware: | All | ||||||||||
OS: | Linux (All) | ||||||||||
See Also: | https://bugzilla.redhat.com/show_bug.cgi?id=682851 | ||||||||||
Whiteboard: | |||||||||||
i915 platform: | i915 features: | ||||||||||
Attachments: |
|
Description
Bryce Harrington
2011-04-20 15:15:17 UTC
Created attachment 45876 [details]
BootDmesg.txt
Created attachment 45877 [details]
CurrentDmesg.txt
Created attachment 45878 [details]
XorgLog.txt
I, too, suffer from this exact problem with an ATI Radeon HD 4200 (on a Dell Inspiron M301Z). Do you need any information to help you? Tobias Lipper made on the downstream Fedora bugs these comments which shouldn't be lost IMHO in the mist of time: I found a patch in the upstream kernel that fixes the termal reading for amd juniper cards. This could be the source of the error. I am not so familiar with kernel development, but I will try the patch and see if it fixes the problem. http://git.kernel.org/?p=linux/kernel/git/next/linux-next.git;a=blobdiff;f=drivers/gpu/drm/radeon/evergreen.c;h=8f446aadccd660f49854d4aadad109bdd559f95d;hp=34cd5a878088b19f65a87cffd9c491cdd31ae0c2;hb=67b3f823ec78d08aea8835bce2655674237abc1d;hpb=457558eda1545c22163574f6dbb883394705e9dd ---------------------------- I could apply the patch and the module compiles fine, but I found that su -c 'echo low > /sys/class/drm/card0/device/power_profile' fixes the problem even without the patch. (at least it turns down the fan to regular) See the "KMS Power Management Options" section of this page: http://wiki.x.org/wiki/RadeonFeature (In reply to comment #6) > See the "KMS Power Management Options" section of this page: > http://wiki.x.org/wiki/RadeonFeature I have Mobility Radeon HD 3650 and I'm seeing this problem aswell. The default power_profile called "default" makes the laptop overheat and causes emergency thermal shutdowns and crashes. This happens at least with Linux 2.6.35, 2.6.38 and 3.0.0. I've tried with Fedora 14, 15 and 16 alpha. Changing the power_profile to "low" makes the temperature go down at least 10-20 degrees celsius. Any idea why the default profile doesn't work? dynpm echo> / sys/class/drm/card0/device/power_method dynamically changes the frequency, but act like it was moving between mid and high If there were an option, in addition, similar to dynpm but that could change from low to mid / high this might solve the problem of course I meant to write the command echo dynpm > / sys/class/drm/card0/device/power_method Hello everybody, I saw two bugs which are a bit related to each other to my knowledge. I decided to post my comment in this bug report. My laptop a MSI GX623 equipped with an Ati Mobility Radeon HD4670. The problem has been around from the moment I bought my computer (5 Dec. 2009). Currently I'm using Ubuntu 11.10, with Xorg-edgers ppa and kernel version 3.1-rc4. When the computer is cleanly installed the power profile of the video card is always defaulted to default. When leaving it on this profile, and not even touching my laptop the computer starts overheating, and after 10 to 15 minutes powers down due to thermal protection. To solve my problem I started to experiment with some power management settings. Using dynpm is no option for me at this moment, because of the flickering screen bug. So I've experimented with some other power management options. First I tried auto, with this my laptop doesn't overheat when nothing is running. When I start some task which makes use of the video-card, my computer switches to high profile, and starts overheating again with the same problem mentioned above. Also when forced to profile high my computer overheats and shuts down. In the forced mid profile the computer can be used without problems, but again when using the laptop for example to watch a movie, my computer overheats again and shuts down. The only profile in which I don't have problems is low, but then my video card is clocked down to minimal capability. Hopefully the developers can make use of my findings. Also when I need to provide some extra information, then you can just ask me what you would like me to do. Greetings Jasper (In reply to comment #10) > When the computer is cleanly installed the power profile of the video card is > always defaulted to default. > When leaving it on this profile, and not even touching my laptop the computer > starts overheating, and after 10 to 15 minutes powers down due to thermal > protection. > Yep, the radeon default power_profile "default" is broken. Any comments from the developers? What needs to be done to get this issue fixed? (In reply to comment #11) > (In reply to comment #10) > > When the computer is cleanly installed the power profile of the video card is > > always defaulted to default. > > When leaving it on this profile, and not even touching my laptop the computer > > starts overheating, and after 10 to 15 minutes powers down due to thermal > > protection. > > > > Yep, the radeon default power_profile "default" is broken. > > Any comments from the developers? What needs to be done to get this issue > fixed? > Ping ? Come on! At least tell us if someone is working on it, or if you need for information. I know it's rude to spam a bug report, but I find it even worse not to give any answer to people who took the time te report and comment on the same bug report. any progress on this? i am getting the same problem (overheating) with a samsung e272 () as well as an acer aspire 7540G (mobility 4570) on archlinux with kernel 3.1.6 and latest git-snapshots of xf86-video-ati. furthermore, at least on archlinux the required procedure to use powersaving (setting power_profile to "low" using dynpm power_method) on these cards do not work either for various reasons (flickering, screen artifacts, all terms even tty* completely frozen). what is required to get this one sorted out because it is simply not possible to use the open source graphics drivers with these two laptops. i'll happily provide any information necessary. sorry, forgot to write the ati card in the samsung e272 is a 4650. i'm might just be adding noise, but this might also help some people... ** WARNING ** THIS CAN BREAK YOUR CARD MY SETTINGS ARE FOR *MY* CARD, EACH CARD IS DIFFERENT IF YOU DONT WANT TO LOSE YOUR PC, DONT EVEN TRY TO DO THIS!! YOU HAVE BEEN WARNED! ** WARNING ** i found that my video card bios (HD2600xt, AGP, on a mid-tower) only had one power profile (high), no matter what profile i send to the /sys/class/drm/card0/device/power_profile, the /sys/kernel/debug/dri/0/radeon_pm_info stayed the same... i tried to underclock the card with the rovclock, but it too old and doesnt support current chips... after some research i found a windows util (sadly its windows only) that allow one to download the card bios and change the power profiles and re-upload that changed bios. It is a dangerous thing, one might brick the card with wrong settings, so i tested with one older card and tested all frequency settings in windows, using the amd/ati drivers overclock setting and all worked fine so i tried in my main card and i backup my old firmware, prepare a boot floppy to restore it and start to change the profiles...i tried always small steps, with the low profile being the testing one and the default always the normal settings, so i could test in linux the switch between profiles and detect the problems, and fall back to the default with a simple cold boot i keep the same boot settings (so less risk of breaking things), keep the max freq (800MHz @ 1.2V), but created a mid an lower profiles with less 1.1V and with the card running at 350MHz and 200MHz with this changed, and running the low profile, i dropped the temperature from my card from about 80ºC to 59ºC and the card heatsink fan also run slower of course, the card is also running slower, but i'm not using 3D and for 2D its fine. when i need more graphic power, i can manually change the power profile to high. the url for that windows software is http://www.techpowerup.com/rbe/ now, this is a last thing we would want to do, but might help with broken card bios like mine, where the power profiles are broken or nonexistent and help people break free from the card builder settings. also, it would be a lot better that someone created a new linux side under/overclock tool, specially using the radeon modules/drivers. Maybe this tool could help debuging and improving the power control of radeon cards in linux, but creating various profiles and comparing what changes between then. with this maybe we could control the power profiles from linux, without messing with the card bios. finally, and to compare, this same card, running in windows, without underclock runs idle at 53ºC and under heavy usage about 90ºC, so windows is clearly sending some type of IDLE command to the GPU that cuts power and heat usage. finding and using that GPU IDLE command would also help a lot solving this problem again, this is dangerous, its not for everyone (you must understand how thing works and what are the risks) and requires a MS Windows usage. i have no idea if this works on laptops. good luck higuita (In reply to comment #6) > See the "KMS Power Management Options" section of this page: > http://wiki.x.org/wiki/RadeonFeature As written in https://bugs.freedesktop.org/show_bug.cgi?id=41762, I have "Mobility Radeon HD3650", and as I understand it is based on R600 chipset. The feature matrix on the RadeonFeature wiki page says all the power saving/management features are implemented for this card. But power saving doesn't seem to work. The laptop overheats and shuts down before I even get to the LiveCD desktop/shell to be able to switch to the power_profile "low". Do you know if this is an error on the wiki page, or a bug in the radeon driver? (In reply to comment #17) > (In reply to comment #6) > > See the "KMS Power Management Options" section of this page: > > http://wiki.x.org/wiki/RadeonFeature > > As written in https://bugs.freedesktop.org/show_bug.cgi?id=41762, > I have "Mobility Radeon HD3650", and as I understand it is based on R600 > chipset. > The feature matrix on the RadeonFeature wiki page says all the power > saving/management features are implemented for this card. > > But power saving doesn't seem to work. The laptop overheats and shuts down > before I even get to the LiveCD desktop/shell to be able to switch to the > power_profile "low". > > Do you know if this is an error on the wiki page, or a bug in the radeon > driver? This problem seems to be specific to your system. Please make sure the heat sinks and fans are clear of dust and and that the acpi thermal zones for your system are working properly. Hi! The overheating of my HP Elitebook 8530p with Mobility Radeon HD 3650 video card has been bothering me for a while now, however, I've just been researching the issue today. (In reply to comment #18) > > This problem seems to be specific to your system. Please make sure the heat > sinks and fans are clear of dust and and that the acpi thermal zones for your > system are working properly. Alex, I can confirm this issue is not specific to his system. I have the same laptop (HP Elitebook 8530p) with the same graphics card (Mobility Radeon HD 3650 video card - AMD RV635) and I'm also experiencing the same issues. I've also thought the issue is with the "dust carpet" around the heat fan/sink, but it is not the case. Since I bought the laptop in Nov 2008, I had to remove the dust around the heat fan/sink every half year to keep the laptop cool (the temperature dropped around 5 degrees Celsius after cleaning), so I know this could be an issue. Only today, I managed to decrees the laptop's temperature by using the fix suggested by Pasi in bug #41762, namely switching the power_profile from "default" to "low": [root@tlinux64 ~]# cat /sys/class/drm/card0/device/power_profile default [root@tlinux64 ~]# cat /sys/kernel/debug/dri/0/radeon_pm_info default engine clock: 600000 kHz current engine clock: 594000 kHz default memory clock: 700000 kHz current memory clock: 693000 kHz voltage: 1100 mV PCIE lanes: 16 [root@tlinux64 ~]# echo low > /sys/class/drm/card0/device/power_profile [root@tlinux64 ~]# cat /sys/class/drm/card0/device/power_profile low [root@tlinux64 ~]# cat /sys/kernel/debug/dri/0/radeon_pm_info default engine clock: 600000 kHz current engine clock: 109680 kHz default memory clock: 700000 kHz current memory clock: 405000 kHz voltage: 900 mV PCIE lanes: 16 [root@tlinux64 ~] As I commented in bug #41762, the system is around ~15 degrees colder now. Alex, does this convince you that the issue is real and it should be worked on? I've been regularly using: echo low | sudo tee /sys/class/drm/card0/device/power_profile I've am also using psensor to warn me if the temperature rises above 92°. When 'idle' the temperature is at about 80°. If the CPU is used (merely starting eclipse and letting all the the startup validation processes finish) it quickly rises to 88°. If the usages is sustained (rebuilding a java project) the temperature oscillates between 95° and 98°. If I do anything else that adds to the load (ie. if the temperature is >98° for short period of time), the system freezes, but the everything remains on, fans and CPU temperature stays up. I need to turn off the notebook by keeping the power button pressed. Note that this system overheats /with/ the low power profile. Here are some general stats about this system, please let me know, if I can provide further information: HP Presario CQ61 320eg AMD Turion(tm) II Dual-Core Mobile M500 01:00.0 VGA compatible controller: Advanced Micro Devices [AMD] nee ATI RV710 [Mobility Radeon HD 4300 Series] (prog-if 00 [VGA controller]) Linux schiefer 3.2.0-33-generic #52-Ubuntu SMP Thu Oct 18 16:29:15 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux ayers@schiefer:~$ lsmod|grep rad radeon 804503 2 ttm 76949 1 radeon drm_kms_helper 46978 1 radeon drm 241921 4 radeon,ttm,drm_kms_helper i2c_algo_bit 13423 1 radeon This started after the upgrading from Maverick. I'm currently running Precise with the 2D Unity desktop. Same situation on my Lenovo T500 Notebook with the following card: 01:00.0 VGA compatible controller: Advanced Micro Devices [AMD] nee ATI RV635 [Mobility Radeon HD 3650] If my Notebook is in IDLE-state the temperature stays about 75° and I hear the noise of the fan nearly all the time. Heavy load increases temp to over 90° and it will shutdown due to the overheating limits of kernel. My workaround is switching to intel hd graphics in bios because setting power profile to low seems to be slower than using the integrated graphics adapter. Thats obviously no real solution because I want to use the full power of the ati graphics card without switching to fglrx driver. I have the same issue with a Lenovo Thinkpad Edge 13. robi@edge:~$ lspci|grep VGA 01:05.0 VGA compatible controller: Advanced Micro Devices [AMD] nee ATI RS880M [Mobility Radeon HD 4200 Series] Also there seem to be other reports of the same issue: bug #41234, #41762, #50327, #52001, and #54791. Yes, I have the same problem on Asus laptop with Mobility Radeon HD 3200 video. Operation system - Ubuntu 12.10 hi there, i have the same problem with my Asus laptop, X5DAF, AMD Turion II @ 2.30 GHz, 4 GB RAM elhoir@elhoir-laptop:~$ lspci | grep VGA 01:00.0 VGA compatible controller: Advanced Micro Devices [AMD] nee ATI RV710 [Mobility Radeon HD 4500/5100 Series] elhoir@elhoir-laptop:~$ I have had this problem for years and it is the sole reason I had used the proprietary ATI driver. Unfortunately new versions of the ATI drivers don't support my card anymore (Radeon HD 4670) so I am kind of waiting for a fix on the open source driver side ... If you don't want to trouble yourself too much I guess there are two options. One is to use an older linux distro such as Ubuntu 12.04 or Debian 6 (CentOS 6 might fit as well). Another take is to go for something quite up-to-date such as Arch linux (help, wiki are great there) and hope for a fix to be released soon ... I can confirm this bug also on my card: Ati Mobility radeon HD 4500, RV710 I always had overheating problems with the opensource driver, so I used the binary driver from ATI. Actually with the latest updates the ati-closed-driver doesn't work anymore also on ubuntu 12.04 So the only solution is to install the old LTS from Ubuntu or debian stable. I can provide additional information about my hardware and my issue. This bug was around from a lot of time right now and is potentially harmful about the hardware. Can be possible to correct the bug (at least initially) forcing the driver to "low" profile? it will be less responsive but at least it will be safe to use. I didn't get to test the Radeon driver much (stuck with fglrx due to an issue) but one of the things that bothered me is the lack of a properly adjusted fan speed. I fully support a dynamic system of adjustment. At least with 7.0.0, the fan speed is constantly slow with the "low" profile and constantly fast with the "high" profile. Regardless of what the card is doing, how much it's heated, or how hard it's working. My suggestion is to adjust fan speed (within a minimum and maximum range) either based on GPU / VRAM frequencies or based on card temperature when a temp sensor is available. This would be most useful with the dynpm method, so fan speed is only set to maximum when you play a game and matches the requirement. Better power management for AMD/ATI Radeon R600 and newer hardware is finally available in the upstream 3.11 linux kernel. For people using Ubuntu, the first release candidate (3.11-rc1) of the 3.11 kernel is available at http://kernel.ubuntu.com/~kernel-ppa/mainline/ and instructions on how to install and uninstall it are available at https://wiki.ubuntu.com/Kernel/MainlineBuilds Fedora users can get the 3.11-rc1 kernel from Rawhide at http://mirrors.fedoraproject.org/publiclist/Fedora/development/ For more information about using Rawhide, see https://fedoraproject.org/wiki/Releases/Rawhide?rd=Rawhide Users of other Linux distributions will need to obtain the 3.11 kernel by other means. For now, to use this power management for the AMD/ATI Radeon you will need to select it at boot by adding radeon.dpm=1 to your GRUB kernel boot options as described at https://help.ubuntu.com/community/Grub2/Troubleshooting#Editing_the_GRUB_2_Menu_During_Boot Good news indeed. Will openSUSE 13.1 have the kernel with this feature? Also, is this going to be defaulted, or will we have to manually add the GRUB parameter? Also, what exactly does this new feature do? Does it add adaptive fan speed and / or better GPU frequency selection (though that's already possible via dynpm)? For further information see my blog post: http://www.botchco.com/agd5f/?p=57 Thanks Alex. I also note that at http://phoronix.com/forums/showthread.php?81666-AMD-Has-Massive-Radeon-Patch-Set-Power-Management!&p=338745#post338745 you said "DPM works fine with multiple monitors." which is great given that http://wiki.x.org/wiki/RadeonFeature/#index3h2 says that the older "dynpm" method only works when a single head is active. I presume that this also means that there is no/much less possibility of flicker on a single display configuration with DPM than with dynpm? Regarding openSUSE 13.1, all I know is that Milestone 1 had the 3.9 kernel and Milestone 2 has the 3.10-rc4 kernel, and that the final release is scheduled for November (or Movember if you are planning to grow a moustache to raise money for mens health charities). Yes, it works with multiple monitors and there shouldn't be any flickering as the performance level changes are handled by dedicated hardware rather than the driver. I realize that this is several years old but it's the closest match to the symptoms I'm searching. The last comment was "it works" but my AMD 6870 based desktop card rages with fans at full speed from start up to shut down. linux 3.18.6-1 xf86-video-ati 1:7.5.0-2 Considered Invalid downstream. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.