Bug 111456 - amdgpu numerous failures on resume from suspend
Summary: amdgpu numerous failures on resume from suspend
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: not set normal
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-08-21 13:03 UTC by Fermulator
Modified: 2019-11-19 09:39 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
Aug21 fermmy resume syslog full (after resume) (47.45 KB, text/plain)
2019-08-21 13:03 UTC, Fermulator
no flags Details

Description Fermulator 2019-08-21 13:03:15 UTC
Created attachment 145113 [details]
Aug21 fermmy resume syslog full (after resume)

This morning, ~8:40AM EST Aug21 2019, resumed my system from suspend.
Normally it just comes back AOK, but today it hung. The screens were all wonky (mostly grey with a speckle of colour), disk was going bonkers, couldn't SSH into it;  sometimes this happens and I just walk away for 10 mins and come back, sometimes it fixes itself, sometimes not.  Today I was not patient and just reset it after ~3mins of waiting.

Took a look at the previous boot system log and there were numerous AMDGPU related failures.

snippets include:
{{{
Aug 21 08:42:21 fermmy kernel: [drm:amdgpu_suspend [amdgpu]] *ERROR* suspend of IP block <uvd_v6_0> failed -12

Aug 21 08:42:24 fermmy kernel: do_IRQ: 0.34 No irq handler for vector

Aug 21 08:42:25 fermmy kernel: [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
Aug 21 08:42:25 fermmy kernel: [drm:amdgpu_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v8_0> failed -22
Aug 21 08:42:25 fermmy kernel: [drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_resume failed (-22).
Aug 21 08:42:25 fermmy kernel: dpm_run_callback(): pci_pm_resume+0x0/0xb0 returns -22
Aug 21 08:42:25 fermmy kernel: PM: Device 0000:07:00.0 failed to resume async: error -22

Aug 21 08:42:25 fermmy kernel: amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0>
Aug 21 08:42:25 fermmy kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
Aug 21 08:42:25 fermmy kernel: [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!

}}}


Linux fermmy 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux

Ubuntu 18.04.3 LTS \n \l

 dpkg --list | egrep -i "amd|dri-|xorg|mesa" | grep -v amd64
ii  libdrm-amdgpu1:i386                                              2.4.97-1ubuntu1~18.04.1                             i386         Userspace interface to amdgpu-specific kernel DRM services -- runtime
ii  libegl-mesa0:i386                                                19.1.2-0~b~padoka0                                  i386         free implementation of the EGL API -- Mesa vendor library
ii  libgl1-mesa-dri:i386                                             19.1.2-0~b~padoka0                                  i386         free implementation of the OpenGL API -- DRI modules
rc  libgl1-mesa-dri-lts-quantal:i386                                 9.0.3-0ubuntu0.4~precise1                           i386         free implementation of the OpenGL API -- DRI modules
rc  libgl1-mesa-dri-lts-utopic:i386                                  3:13                                                i386         Transitional package for libgl1-mesa-dri-lts-utopic
ii  libgl1-mesa-glx:i386                                             19.1.2-0~b~padoka0                                  i386         transitional dummy package
rc  libgl1-mesa-glx-lts-quantal:i386                                 9.0.3-0ubuntu0.4~precise1                           i386         free implementation of the OpenGL API -- GLX runtime
ii  libglapi-mesa:i386                                               19.1.2-0~b~padoka0                                  i386         free implementation of the GL API -- shared library
rc  libglapi-mesa-lts-quantal:i386                                   9.0.3-0ubuntu0.4~precise1                           i386         free implementation of the GL API -- shared library
ii  libglu1-mesa:i386                                                9.0.0-2.1build1                                     i386         Mesa OpenGL utility library (GLU)
ii  libglx-mesa0:i386                                                19.1.2-0~b~padoka0                                  i386         free implementation of the OpenGL API -- GLX vendor library
ii  libosmesa6:i386                                                  19.1.2-0~b~padoka0                                  i386         Mesa Off-screen rendering extension
rc  libtxc-dxtn-s2tc0:i386                                           0~git20131104-1.1                                   i386         Texture compression library for Mesa
ii  libwayland-egl1-mesa:i386                                        19.1.2-0~b~padoka0                                  i386         transitional dummy package
ii  python3-xkit                                                     0.5.0ubuntu2                                        all          library for the manipulation of xorg.conf files (Python 3)
ii  xorg-docs-core                                                   1:1.7.1-1.1                                         all          Core documentation for the X.org X Window System
ii  xorg-sgml-doctools                                               1:1.11-1                                            all          Common tools for building X.Org SGML documentation

$ sudo lshw -C video
  *-display                 
       description: VGA compatible controller
       product: Ellesmere [Radeon RX 470/480/570/570X/580/580X/590]
       vendor: Advanced Micro Devices, Inc. [AMD/ATI]
       physical id: 0
       bus info: pci@0000:07:00.0
       version: ef
       width: 64 bits
       clock: 33MHz
       capabilities: pm pciexpress msi vga_controller bus_master cap_list rom
       configuration: driver=amdgpu latency=0
       resources: irq:63 memory:c0000000-cfffffff memory:d0000000-d01fffff ioport:c000(size=256) memory:fe600000-fe63ffff memory:c0000-dffff


My monitors are 3-fold atm (normally only run with 2x)
2xDVI + 1xDP

$ xrandr | grep normal
DisplayPort-0 connected 1680x1050+3840+25 (normal left inverted right x axis y axis) 474mm x 296mm
HDMI-A-0 disconnected (normal left inverted right x axis y axis)
DVI-D-0 connected 1920x1080+1920+25 (normal left inverted right x axis y axis) 521mm x 293mm
DVI-D-1 connected primary 1920x1080+0+0 (normal left inverted right x axis y axis) 521mm x 293mm
Comment 1 Martin Peres 2019-11-19 09:39:05 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/890.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.