112221 – RX 580 fails to resume sometimes with 5.3 kernel (Ubuntu 19.10)

Bug 112221 - RX 580 fails to resume sometimes with 5.3 kernel (Ubuntu 19.10)

Summary: RX 580 fails to resume sometimes with 5.3 kernel (Ubuntu 19.10)

Status:	RESOLVED FIXED

Alias:	None

Product:	DRI
Classification:	Unclassified
Component:	DRM/AMDgpu (show other bugs)
Version:	unspecified
Hardware:	x86-64 (AMD64) Linux (All)

Importance:	not set not set
Assignee:	Default DRI bug account
QA Contact:

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:

Reported:	2019-11-06 15:37 UTC by Kai Groner
Modified:	2019-11-07 14:08 UTC (History)
CC List:	1 user (show)

See Also:
i915 platform:
i915 features:

Attachments
Kernel logs with resume errors (340.38 KB, application/gzip) 2019-11-06 15:37 UTC, Kai Groner	no flags	Details
View All

Description Kai Groner 2019-11-06 15:37:36 UTC

Created attachment 145903 [details]
Kernel logs with resume errors

In the last two weeks the gpu has failed to resume 4 out of 26 suspend/resume cycles.  Prior to that, the 5.0 kernel (Ubuntu 9.04) did not have this problem.

Here are excerpts from the resume failures.  I am attaching the full kernel logs from the four crashes.  The first group (gfx_v8_0) has happened three times, the second group (sdma_v3_0) has happened one time.

--
amdgpu 0000:07:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring gfx test failed (-110)
[drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v8_0> failed -110
[drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_device_ip_resume failed (-110).
PM: dpm_run_callback(): pci_pm_resume+0x0/0xa0 returns -110
PM: Device 0000:07:00.0 failed to resume async: error -110
--
amdgpu 0000:07:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring sdma0 test failed (-110)
[drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <sdma_v3_0> failed -110
[drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_device_ip_resume failed (-110).
PM: dpm_run_callback(): pci_pm_resume+0x0/0xa0 returns -110
PM: Device 0000:07:00.0 failed to resume async: error -110

Comment 1 Alex Deucher 2019-11-06 15:51:45 UTC

Can you try kernel 5.4?  There have a number of suspend and resume fixes.  They should filter back to stable kernels as well.

Comment 2 Kai Groner 2019-11-07 14:07:23 UTC

With the 5.4-rc6 kernel I was able to run a suspend/resume cycle 50 times without anything breaking.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.