Bug 101591 - [KBL] GPU HANG in portal2
Summary: [KBL] GPU HANG in portal2
Status: RESOLVED FIXED
Alias: None
Product: Mesa
Classification: Unclassified
Component: Drivers/DRI/i965 (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) All
: medium normal
Assignee: Intel 3D Bugs Mailing List
QA Contact: Intel 3D Bugs Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-06-26 01:25 UTC by Charles Samuels
Modified: 2017-12-01 18:47 UTC (History)
3 users (show)

See Also:
i915 platform: KBL
i915 features: GPU hang


Attachments
/sys/class/drm/card0/error (56.35 KB, text/plain)
2017-06-26 01:25 UTC, Charles Samuels
Details
dmesg with drm.debug=0x1e (6.14 KB, application/octet-stream)
2017-06-26 19:26 UTC, Charles Samuels
Details
card0-error with drm.debug=0x1e (14.77 KB, application/x-bzip)
2017-06-26 19:27 UTC, Charles Samuels
Details

Description Charles Samuels 2017-06-26 01:25:06 UTC
Created attachment 132242 [details]
/sys/class/drm/card0/error

Easily reproducible when graphics quality settings are at their highest. Happens within a couple minutes of playing Portal 2 (shortly after "going to bed"). Slower when graphics quality settings are reduced.

Linux audax 4.11.0-1-amd64 #1 SMP Debian 4.11.6-1 (2017-06-19) x86_64 GNU/Linux

Linux kernel package linux-image-4.11.0-1-amd64=4.11.6-1 from debian unstable, on an otherwise normal Debian Stretch installation. Kaby Lake, Lenovo T470p, Core i5-7440HQ.

[17384.396333] [drm] GPU HANG: ecode 9:0:0x85dffffb, in portal2_linux [12819], reason: Hang on render ring, action: reset
[17384.396334] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[17384.396334] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[17384.396335] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[17384.396335] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[17384.396335] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[17384.396374] drm/i915: Resetting chip after gpu hang
[17384.396442] [drm] RC6 on
[17384.409186] [drm] GuC firmware load skipped
Comment 1 Jari Tahvanainen 2017-06-26 14:34:19 UTC
Please add drm.debug=0x1e to kernel cmd line and provide dmesg and card0/error as bz2 file. See https://01.org/linuxgraphics/documentation/how-report-bugs
Comment 2 Charles Samuels 2017-06-26 19:26:23 UTC
rebooted and used kernel option drm.debug=0x1e - your debug information is attached.
Comment 3 Charles Samuels 2017-06-26 19:26:52 UTC
Created attachment 132265 [details]
dmesg with drm.debug=0x1e
Comment 4 Charles Samuels 2017-06-26 19:27:15 UTC
Created attachment 132266 [details]
card0-error with drm.debug=0x1e
Comment 5 Robert 2017-10-16 21:44:24 UTC
Duplicate of bug 99561 ??
Comment 6 Kenneth Graunke 2017-12-01 18:47:26 UTC
This may be fixed by:

commit ee57b15ec764736e2d5360beaef9fb2045ed0f68
Author: Jason Ekstrand <jason.ekstrand@intel.com>
Date:   Wed Nov 29 16:22:42 2017 -0800

    i965: Disable regular fast-clears (CCS_D) on gen9+
    
    This partially reverts commit 3e57e9494c2279580ad6a83ab8c065d01e7e634e
    which caused a bunch of GPU hangs on several Source titles.  To date, we
    have no clue why these hangs are actually happening.  This undoes the
    final effect of 3e57e9494c227 and gets us back to not hanging.  Tested
    with Team Fortress 2.
    
    Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102435
    Fixes: 3e57e9494c2279580ad6a83ab8c065d01e7e634e
    Cc: mesa-stable@lists.freedesktop.org

Please reopen if you still experience hangs with Mesa master (or 17.3.0 once it's released, which will be soon).


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.