Bug 108260

Summary: [Regression?] [powerplay] Failed to retrieve minimum clocks. 4.19-rc6+
Product: DRI Reporter: taijian
Component: DRM/AMDgpuAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED FIXED QA Contact:
Severity: normal    
Priority: medium CC: aklhfex, cwidmer, darkdefende, dex+fdobugzilla, garththeisen
Version: DRI git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
relevant dmesg output
none
dmesg R9 390
none
kernel log 290x none

Description taijian 2018-10-06 19:58:23 UTC
Created attachment 141923 [details]
relevant dmesg output

After rebuilding the mainline kernel from git today in order to see if the recent change: "PCI: Reprogram bridge prefetch registers on resume" would help with my crash issues. It does seem to do this, however I noticed that my dGPU would now no longer properly power down. dmesg give me this:

amdgpu: [powerplay] Failed to retrieve minimum clocks.
amdgpu: [powerplay] Error in phm_get_clock_info 

This error did not occur on previous kernel versions.
Comment 1 taijian 2018-10-08 13:28:27 UTC
I just checked: This problem remains with 4.19-rc7 but does not exist with 4.18.12.

Also:
# lspci -tvv
-[0000:00]-+-00.0  Intel Corporation Xeon E3-1200 v6/7th Gen Core Processor Host Bridge/DRAM Registers
           +-01.0-[01]--+-00.0  Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X]
           |            \-00.1  Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 580]
           +-01.2-[02-3a]--
           +-02.0  Intel Corporation Device 591b
           +-04.0  Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem
           +-14.0  Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller
           +-14.2  Intel Corporation Sunrise Point-H Thermal subsystem
           +-16.0  Intel Corporation Sunrise Point-H CSME HECI #1
           +-17.0  Intel Corporation Sunrise Point-H SATA controller [AHCI mode]
           +-1b.0-[3b]----00.0  Toshiba America Info Systems XG4 NVMe SSD Controller
           +-1c.0-[3c]--
           +-1c.5-[3d]----00.0  Qualcomm Atheros QCA6174 802.11ac Wireless Network Adapter
           +-1d.0-[3e]----00.0  Samsung Electronics Co Ltd NVMe SSD Controller SM961/PM961
           +-1f.0  Intel Corporation Sunrise Point-H LPC Controller
           +-1f.2  Intel Corporation Sunrise Point-H PMC
           +-1f.3  Intel Corporation CM238 HD Audio Controller
           \-1f.4  Intel Corporation Sunrise Point-H SMBus

The empty 01.2- branch is where my Thunderbolt controller lives. Not visible because of runtime pm.
Comment 2 taijian 2018-10-10 07:50:28 UTC
Same error also occurs with the latest drm-next-4.20-wip as of commit 	e5560166811c07862d7ed9790674b2763c71f12e.
Comment 3 Daniel Exner 2018-10-15 08:51:04 UTC
Can confirm the same message on:

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X] (rev e7) (prog-if 00 [VGA controller])
        Subsystem: Tul Corporation / PowerColor Ellesmere [Radeon RX 470/480/570/570X/580/580X]


Kernel 4.19.0-rc8
Comment 4 Garth Theisen 2018-10-23 01:27:22 UTC
Created attachment 142147 [details]
dmesg R9 390
Comment 5 Garth Theisen 2018-10-23 01:28:12 UTC
Comment on attachment 142147 [details]
dmesg R9 390

Similar error ...

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Hawaii PRO [Radeon R9 290/390] (rev 80)
        Subsystem: XFX Pine Group Inc. Hawaii PRO [Radeon R9 290/390]
        Kernel driver in use: amdgpu
        Kernel modules: amdgpu

[    7.429944] amdgpu: [powerplay] Failed to retrieve minimum clocks.
[    7.429945] amdgpu: [powerplay] Error in phm_get_clock_info 
...
[   18.949366] amdgpu 0000:01:00.0: amdgpu_device_ip_init failed
[   18.949370] amdgpu 0000:01:00.0: Fatal error during GPU init
[   18.949387] [drm] amdgpu: finishing device.

But the device fails to init.

Kernel 4.19.0
linux-firmware.git
Comment 6 Sebastian Parborg 2018-10-24 21:21:44 UTC
Created attachment 142179 [details]
kernel log 290x

I have the same problem on 4.19.0 with my 290x. It fails to init the GPU. I have a similar failure as with the 390 already posted.
Comment 7 taijian 2018-10-29 20:42:12 UTC
OK, the error messages seem to be gone on the latest Linux-mainline compiled from Linus' git.
Comment 8 Daniel Exner 2018-11-05 18:04:28 UTC
Can confirm: Message is gone on 4.20-rc1
Comment 9 tantrido 2018-11-20 19:15:30 UTC
I also have this bug with nvidia driver - no graphic login screen only console one.
Comment 10 Alexandr Akulich 2018-11-27 06:05:48 UTC
I have the same problem on the latest release 4.19.4. Anyone know the fix commit (to pick it from the upcoming 4.20)? It seems that there is no fix in 4.19.5-rc1
Comment 11 Martin Jørgensen 2019-01-24 07:39:49 UTC
Running on debian buster with 4.19 kernel I get the same warning, but only this warning. No errors regarding the GPU. 

Will there be a patch for 4.19, or do we simply have to upgrade or manually compile a kernel?

amdgpu: [powerplay] Failed to retrieve minimum clocks.


uname -a :


Linux mkjws 4.19.0-1-amd64 #1 SMP Debian 4.19.12-1 (2018-12-22) x86_64 GNU/Linux


lspci -v :

42:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480] (rev e7) (prog-if 00 [VGA controller])
	Subsystem: ASUSTeK Computer Inc. Ellesmere [Radeon RX 470/480/570/570X/580/580X/590]
	Flags: bus master, fast devsel, latency 0, IRQ 70, NUMA node 1
	Memory at 4600000000 (64-bit, prefetchable) [size=8G]
	Memory at 4500000000 (64-bit, prefetchable) [size=2M]
	I/O ports at 2000 [size=256]
	Memory at 9f600000 (32-bit, non-prefetchable) [size=256K]
	Expansion ROM at 9f640000 [disabled] [size=128K]
	Capabilities: [48] Vendor Specific Information: Len=08 <?>
	Capabilities: [50] Power Management version 3
	Capabilities: [58] Express Legacy Endpoint, MSI 00
	Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
	Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
	Capabilities: [150] Advanced Error Reporting
	Capabilities: [200] #15
	Capabilities: [270] #19
	Capabilities: [2b0] Address Translation Service (ATS)
	Capabilities: [2c0] Page Request Interface (PRI)
	Capabilities: [2d0] Process Address Space ID (PASID)
	Capabilities: [320] Latency Tolerance Reporting
	Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
	Capabilities: [370] L1 PM Substates
	Kernel driver in use: amdgpu
	Kernel modules: amdgpu
Comment 12 Alex Deucher 2019-01-24 14:11:43 UTC
(In reply to Martin Jørgensen from comment #11)
> Running on debian buster with 4.19 kernel I get the same warning, but only
> this warning. No errors regarding the GPU. 
> 
> Will there be a patch for 4.19, or do we simply have to upgrade or manually
> compile a kernel?
> 
> amdgpu: [powerplay] Failed to retrieve minimum clocks.

This warning is harmless.
Comment 13 Martin Jørgensen 2019-01-27 08:36:24 UTC
I see

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.