Created attachment 141923 [details] relevant dmesg output After rebuilding the mainline kernel from git today in order to see if the recent change: "PCI: Reprogram bridge prefetch registers on resume" would help with my crash issues. It does seem to do this, however I noticed that my dGPU would now no longer properly power down. dmesg give me this: amdgpu: [powerplay] Failed to retrieve minimum clocks. amdgpu: [powerplay] Error in phm_get_clock_info This error did not occur on previous kernel versions.
I just checked: This problem remains with 4.19-rc7 but does not exist with 4.18.12. Also: # lspci -tvv -[0000:00]-+-00.0 Intel Corporation Xeon E3-1200 v6/7th Gen Core Processor Host Bridge/DRAM Registers +-01.0-[01]--+-00.0 Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X] | \-00.1 Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 580] +-01.2-[02-3a]-- +-02.0 Intel Corporation Device 591b +-04.0 Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem +-14.0 Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller +-14.2 Intel Corporation Sunrise Point-H Thermal subsystem +-16.0 Intel Corporation Sunrise Point-H CSME HECI #1 +-17.0 Intel Corporation Sunrise Point-H SATA controller [AHCI mode] +-1b.0-[3b]----00.0 Toshiba America Info Systems XG4 NVMe SSD Controller +-1c.0-[3c]-- +-1c.5-[3d]----00.0 Qualcomm Atheros QCA6174 802.11ac Wireless Network Adapter +-1d.0-[3e]----00.0 Samsung Electronics Co Ltd NVMe SSD Controller SM961/PM961 +-1f.0 Intel Corporation Sunrise Point-H LPC Controller +-1f.2 Intel Corporation Sunrise Point-H PMC +-1f.3 Intel Corporation CM238 HD Audio Controller \-1f.4 Intel Corporation Sunrise Point-H SMBus The empty 01.2- branch is where my Thunderbolt controller lives. Not visible because of runtime pm.
Same error also occurs with the latest drm-next-4.20-wip as of commit e5560166811c07862d7ed9790674b2763c71f12e.
Can confirm the same message on: 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X] (rev e7) (prog-if 00 [VGA controller]) Subsystem: Tul Corporation / PowerColor Ellesmere [Radeon RX 470/480/570/570X/580/580X] Kernel 4.19.0-rc8
Created attachment 142147 [details] dmesg R9 390
Comment on attachment 142147 [details] dmesg R9 390 Similar error ... 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Hawaii PRO [Radeon R9 290/390] (rev 80) Subsystem: XFX Pine Group Inc. Hawaii PRO [Radeon R9 290/390] Kernel driver in use: amdgpu Kernel modules: amdgpu [ 7.429944] amdgpu: [powerplay] Failed to retrieve minimum clocks. [ 7.429945] amdgpu: [powerplay] Error in phm_get_clock_info ... [ 18.949366] amdgpu 0000:01:00.0: amdgpu_device_ip_init failed [ 18.949370] amdgpu 0000:01:00.0: Fatal error during GPU init [ 18.949387] [drm] amdgpu: finishing device. But the device fails to init. Kernel 4.19.0 linux-firmware.git
Created attachment 142179 [details] kernel log 290x I have the same problem on 4.19.0 with my 290x. It fails to init the GPU. I have a similar failure as with the 390 already posted.
OK, the error messages seem to be gone on the latest Linux-mainline compiled from Linus' git.
Can confirm: Message is gone on 4.20-rc1
I also have this bug with nvidia driver - no graphic login screen only console one.
I have the same problem on the latest release 4.19.4. Anyone know the fix commit (to pick it from the upcoming 4.20)? It seems that there is no fix in 4.19.5-rc1
Running on debian buster with 4.19 kernel I get the same warning, but only this warning. No errors regarding the GPU. Will there be a patch for 4.19, or do we simply have to upgrade or manually compile a kernel? amdgpu: [powerplay] Failed to retrieve minimum clocks. uname -a : Linux mkjws 4.19.0-1-amd64 #1 SMP Debian 4.19.12-1 (2018-12-22) x86_64 GNU/Linux lspci -v : 42:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480] (rev e7) (prog-if 00 [VGA controller]) Subsystem: ASUSTeK Computer Inc. Ellesmere [Radeon RX 470/480/570/570X/580/580X/590] Flags: bus master, fast devsel, latency 0, IRQ 70, NUMA node 1 Memory at 4600000000 (64-bit, prefetchable) [size=8G] Memory at 4500000000 (64-bit, prefetchable) [size=2M] I/O ports at 2000 [size=256] Memory at 9f600000 (32-bit, non-prefetchable) [size=256K] Expansion ROM at 9f640000 [disabled] [size=128K] Capabilities: [48] Vendor Specific Information: Len=08 <?> Capabilities: [50] Power Management version 3 Capabilities: [58] Express Legacy Endpoint, MSI 00 Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+ Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?> Capabilities: [150] Advanced Error Reporting Capabilities: [200] #15 Capabilities: [270] #19 Capabilities: [2b0] Address Translation Service (ATS) Capabilities: [2c0] Page Request Interface (PRI) Capabilities: [2d0] Process Address Space ID (PASID) Capabilities: [320] Latency Tolerance Reporting Capabilities: [328] Alternative Routing-ID Interpretation (ARI) Capabilities: [370] L1 PM Substates Kernel driver in use: amdgpu Kernel modules: amdgpu
(In reply to Martin Jørgensen from comment #11) > Running on debian buster with 4.19 kernel I get the same warning, but only > this warning. No errors regarding the GPU. > > Will there be a patch for 4.19, or do we simply have to upgrade or manually > compile a kernel? > > amdgpu: [powerplay] Failed to retrieve minimum clocks. This warning is harmless.
I see
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.