Bug 104437

Summary: RX480 + SME = amdgpu init failed + null dereference
Product: DRI Reporter: Asbjørn Sannes <ace>
Component: DRM/AMDgpuAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED MOVED QA Contact:
Severity: normal    
Priority: medium CC: sarnex
Version: DRI git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
dmesg when not working
none
dmesg_sme.txt none

Description Asbjørn Sannes 2018-01-01 16:20:53 UTC
Created attachment 136470 [details]
dmesg when not working

When upgrading from linux 4.13.x -> 4.14.x the amdgpu failed to come up, often hanging my desktop computer.

Ending the amdgpg bring up with:
[drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
[drm:amdgpu_device_init [amdgpu]] *ERROR* hw_init of IP block <gfx_v8_0> failed -22
amdgpu 0000:0a:00.0: amdgpu_init failed


After getting some help in #radeon we figured out that the culprit was that I had enabled SME (which the CPU Ryzen X1800 support).


Got the same error on 4.15-rc6, but with additional null dereference in DC code.
Comment 1 Nick Sarnie 2019-06-15 16:21:21 UTC
Created attachment 144553 [details]
dmesg_sme.txt

Same issue using 5.1.9, RX Vega 64 and Ryzen 2700x.

[   19.998488] [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* sw_init of IP block <vega10_ih> failed -12
[   19.998490] amdgpu 0000:0c:00.0: amdgpu_device_ip_init failed
[   19.998492] amdgpu 0000:0c:00.0: Fatal error during GPU init
[   19.998493] [drm] amdgpu: finishing device.
[   19.998601] [drm] amdgpu: ttm finalized


I've attached my dmesg as well.

Thanks,
Sarnex
Comment 2 Michel Dänzer 2019-06-17 09:09:07 UTC
(In reply to Nick Sarnie from comment #1)
> 
> Same issue [...]

The symptoms are clearly different, therefore please file your own report.
Comment 3 Nick Sarnie 2019-06-17 22:22:30 UTC
Sorry, filed https://bugs.freedesktop.org/show_bug.cgi?id=110929
Comment 4 Martin Peres 2019-11-19 08:28:10 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/285.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.