Bug 112297 - AMDGPU.gpu_recovery does recover desktop to an unusable state
Summary: AMDGPU.gpu_recovery does recover desktop to an unusable state
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: highest critical
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-11-15 12:09 UTC by David Biró
Modified: 2019-11-19 10:01 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments

Description David Biró 2019-11-15 12:09:32 UTC
I have got Radeon RX 5700 XT which has unstable driver. So that amdgpu.gpu_recovery=1 flag would be helpful if even that could work well.
Unfortunately it doesn't.

My kernel's latest commit ID: 53dbc27ad5a93932ff1892a8e4ef266827d74a0f ( fix AVFS handling with custom powerplay table) On branch drm-next-5.5
Display server: X11
WM: BSPWM
DM: Slim

Journalctl output:
https://pastebin.com/if4PYda7

The error is the "average" ring sdma0 timeout. After reset, the actual application that caused the crash, stays on the screen, The screen is flickering, vibrating, I'm frequently able to see some green rectangle at the top of the screen with full width. But I can able to move mouse. Also I cannot do anything else, because then BSPWM can switch workspaces (I can see it by the type of the mouse cursor), but the crashed application stays on the screen anyway, so I have to restart.

At least, I can save my work if I remember which workspace contained my work.
Comment 1 Daniel Suarez 2019-11-15 15:11:14 UTC
Don't expect this to work anytime soon unfortunately. AMD really slacked off with the drivers for this and similar bug reports have been open for months. 

As for your work, save early, save often while writing. Don't rely on auto-save doing it for you.  Get in the habit of Ctrl+S'ing often.

Ideally you shouldn't be using a 5700 if you're dealing with sensitive work.
Comment 2 Martin Peres 2019-11-19 10:01:52 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/957.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.