Bug 104437 - RX480 + SME = amdgpu init failed + null dereference
Summary: RX480 + SME = amdgpu init failed + null dereference
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: DRI git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Default DRI bug account
QA Contact:
Depends on:
Reported: 2018-01-01 16:20 UTC by Asbjørn Sannes
Modified: 2019-11-19 08:28 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:

dmesg when not working (79.53 KB, text/plain)
2018-01-01 16:20 UTC, Asbjørn Sannes
no flags Details
dmesg_sme.txt (76.88 KB, text/plain)
2019-06-15 16:21 UTC, Nick Sarnie
no flags Details

Description Asbjørn Sannes 2018-01-01 16:20:53 UTC
Created attachment 136470 [details]
dmesg when not working

When upgrading from linux 4.13.x -> 4.14.x the amdgpu failed to come up, often hanging my desktop computer.

Ending the amdgpg bring up with:
[drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
[drm:amdgpu_device_init [amdgpu]] *ERROR* hw_init of IP block <gfx_v8_0> failed -22
amdgpu 0000:0a:00.0: amdgpu_init failed

After getting some help in #radeon we figured out that the culprit was that I had enabled SME (which the CPU Ryzen X1800 support).

Got the same error on 4.15-rc6, but with additional null dereference in DC code.
Comment 1 Nick Sarnie 2019-06-15 16:21:21 UTC
Created attachment 144553 [details]

Same issue using 5.1.9, RX Vega 64 and Ryzen 2700x.

[   19.998488] [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* sw_init of IP block <vega10_ih> failed -12
[   19.998490] amdgpu 0000:0c:00.0: amdgpu_device_ip_init failed
[   19.998492] amdgpu 0000:0c:00.0: Fatal error during GPU init
[   19.998493] [drm] amdgpu: finishing device.
[   19.998601] [drm] amdgpu: ttm finalized

I've attached my dmesg as well.

Comment 2 Michel Dänzer 2019-06-17 09:09:07 UTC
(In reply to Nick Sarnie from comment #1)
> Same issue [...]

The symptoms are clearly different, therefore please file your own report.
Comment 3 Nick Sarnie 2019-06-17 22:22:30 UTC
Sorry, filed https://bugs.freedesktop.org/show_bug.cgi?id=110929
Comment 4 Martin Peres 2019-11-19 08:28:10 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/285.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.