Bug 111456 - amdgpu numerous failures on resume from suspend
Summary: amdgpu numerous failures on resume from suspend
Status: NEW
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: not set normal
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-08-21 13:03 UTC by Fermulator
Modified: 2019-08-21 13:03 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
Aug21 fermmy resume syslog full (after resume) (47.45 KB, text/plain)
2019-08-21 13:03 UTC, Fermulator
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Fermulator 2019-08-21 13:03:15 UTC
Created attachment 145113 [details]
Aug21 fermmy resume syslog full (after resume)

This morning, ~8:40AM EST Aug21 2019, resumed my system from suspend.
Normally it just comes back AOK, but today it hung. The screens were all wonky (mostly grey with a speckle of colour), disk was going bonkers, couldn't SSH into it;  sometimes this happens and I just walk away for 10 mins and come back, sometimes it fixes itself, sometimes not.  Today I was not patient and just reset it after ~3mins of waiting.

Took a look at the previous boot system log and there were numerous AMDGPU related failures.

snippets include:
{{{
Aug 21 08:42:21 fermmy kernel: [drm:amdgpu_suspend [amdgpu]] *ERROR* suspend of IP block <uvd_v6_0> failed -12

Aug 21 08:42:24 fermmy kernel: do_IRQ: 0.34 No irq handler for vector

Aug 21 08:42:25 fermmy kernel: [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)
Aug 21 08:42:25 fermmy kernel: [drm:amdgpu_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v8_0> failed -22
Aug 21 08:42:25 fermmy kernel: [drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_resume failed (-22).
Aug 21 08:42:25 fermmy kernel: dpm_run_callback(): pci_pm_resume+0x0/0xb0 returns -22
Aug 21 08:42:25 fermmy kernel: PM: Device 0000:07:00.0 failed to resume async: error -22

Aug 21 08:42:25 fermmy kernel: amdgpu 0000:07:00.0: couldn't schedule ib on ring <sdma0>
Aug 21 08:42:25 fermmy kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
Aug 21 08:42:25 fermmy kernel: [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!

}}}


Linux fermmy 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux

Ubuntu 18.04.3 LTS \n \l

 dpkg --list | egrep -i "amd|dri-|xorg|mesa" | grep -v amd64
ii  libdrm-amdgpu1:i386                                              2.4.97-1ubuntu1~18.04.1                             i386         Userspace interface to amdgpu-specific kernel DRM services -- runtime
ii  libegl-mesa0:i386                                                19.1.2-0~b~padoka0                                  i386         free implementation of the EGL API -- Mesa vendor library
ii  libgl1-mesa-dri:i386                                             19.1.2-0~b~padoka0                                  i386         free implementation of the OpenGL API -- DRI modules
rc  libgl1-mesa-dri-lts-quantal:i386                                 9.0.3-0ubuntu0.4~precise1                           i386         free implementation of the OpenGL API -- DRI modules
rc  libgl1-mesa-dri-lts-utopic:i386                                  3:13                                                i386         Transitional package for libgl1-mesa-dri-lts-utopic
ii  libgl1-mesa-glx:i386                                             19.1.2-0~b~padoka0                                  i386         transitional dummy package
rc  libgl1-mesa-glx-lts-quantal:i386                                 9.0.3-0ubuntu0.4~precise1                           i386         free implementation of the OpenGL API -- GLX runtime
ii  libglapi-mesa:i386                                               19.1.2-0~b~padoka0                                  i386         free implementation of the GL API -- shared library
rc  libglapi-mesa-lts-quantal:i386                                   9.0.3-0ubuntu0.4~precise1                           i386         free implementation of the GL API -- shared library
ii  libglu1-mesa:i386                                                9.0.0-2.1build1                                     i386         Mesa OpenGL utility library (GLU)
ii  libglx-mesa0:i386                                                19.1.2-0~b~padoka0                                  i386         free implementation of the OpenGL API -- GLX vendor library
ii  libosmesa6:i386                                                  19.1.2-0~b~padoka0                                  i386         Mesa Off-screen rendering extension
rc  libtxc-dxtn-s2tc0:i386                                           0~git20131104-1.1                                   i386         Texture compression library for Mesa
ii  libwayland-egl1-mesa:i386                                        19.1.2-0~b~padoka0                                  i386         transitional dummy package
ii  python3-xkit                                                     0.5.0ubuntu2                                        all          library for the manipulation of xorg.conf files (Python 3)
ii  xorg-docs-core                                                   1:1.7.1-1.1                                         all          Core documentation for the X.org X Window System
ii  xorg-sgml-doctools                                               1:1.11-1                                            all          Common tools for building X.Org SGML documentation

$ sudo lshw -C video
  *-display                 
       description: VGA compatible controller
       product: Ellesmere [Radeon RX 470/480/570/570X/580/580X/590]
       vendor: Advanced Micro Devices, Inc. [AMD/ATI]
       physical id: 0
       bus info: pci@0000:07:00.0
       version: ef
       width: 64 bits
       clock: 33MHz
       capabilities: pm pciexpress msi vga_controller bus_master cap_list rom
       configuration: driver=amdgpu latency=0
       resources: irq:63 memory:c0000000-cfffffff memory:d0000000-d01fffff ioport:c000(size=256) memory:fe600000-fe63ffff memory:c0000-dffff


My monitors are 3-fold atm (normally only run with 2x)
2xDVI + 1xDP

$ xrandr | grep normal
DisplayPort-0 connected 1680x1050+3840+25 (normal left inverted right x axis y axis) 474mm x 296mm
HDMI-A-0 disconnected (normal left inverted right x axis y axis)
DVI-D-0 connected 1920x1080+1920+25 (normal left inverted right x axis y axis) 521mm x 293mm
DVI-D-1 connected primary 1920x1080+0+0 (normal left inverted right x axis y axis) 521mm x 293mm


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.