Bug 95516 - GPU HANG: ecode 7:-1:0x00000000, reason: Ring hung, action: reset
Summary: GPU HANG: ecode 7:-1:0x00000000, reason: Ring hung, action: reset
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-05-21 09:47 UTC by Anthony DeRobertis
Modified: 2017-06-27 15:54 UTC (History)
1 user (show)

See Also:
i915 platform: IVB
i915 features: GPU hang


Attachments
/sys/class/drm/card0/error gzip'd (269.30 KB, application/gzip)
2016-05-21 09:47 UTC, Anthony DeRobertis
no flags Details

Description Anthony DeRobertis 2016-05-21 09:47:27 UTC
Created attachment 123956 [details]
/sys/class/drm/card0/error gzip'd

I've never seen this before, and have no idea how to reproduce. Was attempting to open a new Firefox window, with the system under some I/O load (copying a DVD). Froze up for a bit, then recovered. 

Not sure if this is a useful bug report, but couldn't find any other way to submit the crash dump. I won't object at all if you close it as useless.

Linux Watt 4.4.0-1-amd64 #1 SMP Debian 4.4.6-1 (2016-03-17) x86_64 GNU/Linux
X.Org version: 1.18.2

May 21 05:30:08 Watt kernel: [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render ring idle
May 21 05:31:08 Watt kernel: [drm] no progress on render ring
May 21 05:31:08 Watt kernel: [drm] GPU HANG: ecode 7:-1:0x00000000, reason: Ring hung, action: reset
May 21 05:31:08 Watt kernel: [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
May 21 05:31:08 Watt kernel: [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
May 21 05:31:08 Watt kernel: [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
May 21 05:31:08 Watt kernel: [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
May 21 05:31:08 Watt kernel: [drm] GPU crash dump saved to /sys/class/drm/card0/error
May 21 05:31:08 Watt kernel: [drm:intel_rcs_ctx_init [i915]] *ERROR* init render state: -11
May 21 05:31:08 Watt kernel: [drm:i915_switch_context [i915]] *ERROR* ring init context: -11
May 21 05:31:08 Watt kernel: drm/i915: Resetting chip after gpu hang
Comment 1 Jari Tahvanainen 2017-03-28 13:25:39 UTC
We seem to have neglected the bug quite a bit, apologies.

Anthony, since there has been quite a lot improvements pushed in kernel that will benefit to your system, please re-test with latest kernel and mark as REOPENED if you can reproduce (and attach kernel log) and RESOLVED/* if you cannot reproduce.
Comment 2 Chris Wilson 2017-04-08 19:27:37 UTC
This looks like it might be an innocent warning where the waiter hasn't been woken before the alarmed fired. E.g.

commit 8d769ea7bc16c34c9dc5143be021e943014c4cd1
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Feb 27 20:58:47 2017 +0000

    drm/i915: Report both waiters and success from intel_engine_wakeup()


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.