Summary: | [SKL] GPU HANG (after resuming from hibernation): ecode 9:1:0x4c32ff67, in Xorg [1902], reason: Hang on blitter ring, action: reset (kernel 4.9.0) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | unki | ||||||||||
Component: | DRM/Intel | Assignee: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||||||||
Status: | CLOSED DUPLICATE | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||||||||
Severity: | normal | ||||||||||||
Priority: | medium | CC: | intel-gfx-bugs, jonathan, unki | ||||||||||
Version: | XOrg git | ||||||||||||
Hardware: | x86-64 (AMD64) | ||||||||||||
OS: | Linux (All) | ||||||||||||
Whiteboard: | |||||||||||||
i915 platform: | SKL | i915 features: | GPU hang, power/suspend-resume | ||||||||||
Attachments: |
|
Description
unki
2017-01-26 06:53:02 UTC
Created attachment 129159 [details]
dmesg snapshot after returning from hibernation
Created attachment 129160 [details]
the gpu error log as requested in the dmesg messages from /sys/class/drm/card0/error
This morning, on resuming from hibernation, system has recovered after several GPU resets. But only for a few minutes. Then it locked up hard while working in X. I've waited a few minutes if it recovers again, but finally performed a hard reset. I've captured dmesg and /sys/class/drm/card0/error in the short time frame where the system was responding. I'm attaching them to this bug, in case you note any differences in there. Created attachment 129244 [details]
dmesg
Created attachment 129245 [details]
/sys/class/drm/card0/error
commit bafb2f7d4755bf1571bd5e9a03b97f3fc4fe69ae Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Wed Sep 21 14:51:08 2016 +0100 drm/i915/execlists: Reset RING registers upon resume There is a disparity in the context image saved to disk and our own bookkeeping - that is we presume the RING_HEAD and RING_TAIL match our stored ce->ring->tail value. However, as we emit WA_TAIL_DWORDS into the ring but may not tell the GPU about them, the GPU may be lagging behind our bookkeeping. Upon hibernation we do not save stolen pages, presuming that their contents are volatile. This means that although we start writing into the ring at tail, the GPU starts executing from its HEAD and there may be some garbage in between and so the GPU promptly hangs upon resume. Testcase: igt/gem_exec_suspend/basic-S4 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96526 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Joonas Lahtinen <joonas.lahtinen@linux.intel.com> Link: http://patchwork.freedesktop.org/patch/msgid/20160921135108.29574-3-chris@chris-wilson.co.uk *** This bug has been marked as a duplicate of bug 96526 *** |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.