Bug 112221 - RX 580 fails to resume sometimes with 5.3 kernel (Ubuntu 19.10)
Summary: RX 580 fails to resume sometimes with 5.3 kernel (Ubuntu 19.10)
Status: RESOLVED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: not set not set
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-11-06 15:37 UTC by Kai Groner
Modified: 2019-11-07 14:08 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
Kernel logs with resume errors (340.38 KB, application/gzip)
2019-11-06 15:37 UTC, Kai Groner
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Kai Groner 2019-11-06 15:37:36 UTC
Created attachment 145903 [details]
Kernel logs with resume errors

In the last two weeks the gpu has failed to resume 4 out of 26 suspend/resume cycles.  Prior to that, the 5.0 kernel (Ubuntu 9.04) did not have this problem.

Here are excerpts from the resume failures.  I am attaching the full kernel logs from the four crashes.  The first group (gfx_v8_0) has happened three times, the second group (sdma_v3_0) has happened one time.

--
amdgpu 0000:07:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring gfx test failed (-110)
[drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v8_0> failed -110
[drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_device_ip_resume failed (-110).
PM: dpm_run_callback(): pci_pm_resume+0x0/0xa0 returns -110
PM: Device 0000:07:00.0 failed to resume async: error -110
--
amdgpu 0000:07:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring sdma0 test failed (-110)
[drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <sdma_v3_0> failed -110
[drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_device_ip_resume failed (-110).
PM: dpm_run_callback(): pci_pm_resume+0x0/0xa0 returns -110
PM: Device 0000:07:00.0 failed to resume async: error -110
Comment 1 Alex Deucher 2019-11-06 15:51:45 UTC
Can you try kernel 5.4?  There have a number of suspend and resume fixes.  They should filter back to stable kernels as well.
Comment 2 Kai Groner 2019-11-07 14:07:23 UTC
With the 5.4-rc6 kernel I was able to run a suspend/resume cycle 50 times without anything breaking.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.