Bug 103405 - [KBL] drm/i915: Resetting chip after gpu hang, RC6 on, TF2 segfault
Summary: [KBL] drm/i915: Resetting chip after gpu hang, RC6 on, TF2 segfault
Status: RESOLVED FIXED
Alias: None
Product: Mesa
Classification: Unclassified
Component: Drivers/DRI/i965 (show other bugs)
Version: 17.2
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel 3D Bugs Mailing List
QA Contact: Intel 3D Bugs Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-10-22 17:46 UTC by Robert
Modified: 2017-12-01 18:22 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments

Description Robert 2017-10-22 17:46:19 UTC
Team Fortress 2 is crashing quite often.  All it takes is a few minutes of play or a map switch/load.  This is what is being reported in dmesg:

[20194.868764] drm/i915: Resetting chip after gpu hang
[20194.868965] [drm] RC6 on
[20206.868784] drm/i915: Resetting chip after gpu hang
[20206.868919] [drm] RC6 on
[20219.860883] drm/i915: Resetting chip after gpu hang
[20219.861028] [drm] RC6 on
[20229.876940] drm/i915: Resetting chip after gpu hang
[20229.877087] [drm] RC6 on
[20239.861012] drm/i915: Resetting chip after gpu hang
[20239.861162] [drm] RC6 on
[20240.110535] MatQueue0[7395]: segfault at fffffffc ip 00000000dbade0f8 sp 00000000c95cba10 error 4 in client.so[dab98000+2041000]

I checked /sys/class/drm/card0/error and the last error report in that file does not appear to match this set of crashes.  The date was current, but the time was a few hours earlier.  dmesg consistently reported the same error with each crash.

Dell XPS 13 9360 DE
Ubuntu 17.10

[tag] [reply] [−] Description Robert 2017-08-27 18:46:18 UTC
Created attachment 133817 [details]
GPU dump file, CSGO dump file and dmesg output

CSGO crashed after playing ~2 hours in and out of matches.  The following was reported in dmesg:

[ 7987.649974] [drm] GPU HANG: ecode 9:0:0x86df7cf9, in csgo_linux64 [4947], reason: Hang on rcs, action: reset
[ 7987.649976] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[ 7987.649978] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[ 7987.649979] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[ 7987.649980] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[ 7987.649981] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[ 7987.650057] drm/i915: Resetting chip after gpu hang
[ 7987.650622] [drm] RC6 on
[ 8001.652386] drm/i915: Resetting chip after gpu hang
[ 8001.652537] [drm] RC6 on
[ 8013.652392] drm/i915: Resetting chip after gpu hang
[ 8013.652531] [drm] RC6 on
[ 8027.636176] drm/i915: Resetting chip after gpu hang
[ 8027.636314] [drm] RC6 on
[ 8038.644153] drm/i915: Resetting chip after gpu hang
[ 8038.644306] [drm] RC6 on
[ 8038.843763] show_signal_msg: 65 callbacks suppressed
[ 8038.843765] csgo_linux64[5008]: segfault at 1338 ip 00007f04bfe3f2a9 sp 00007f0444182710 error 6 in client_client.so[7f04bf1c6000+17cf000]

I've included this as well as the GPU crash dump in the attachment.
[tag] [reply] [−] Comment 1 Robert 2017-08-27 18:53:48 UTC
I'd also like to mention:

Dell XPS 13 9360 DE
Ubuntu(Xubuntu) 17.10 (in development with current updates)
Mesa 17.2.2
Comment 1 Robert 2017-10-22 17:49:52 UTC
UGH, it appears I accidentally copied in text from another error report.  I see no way to edit previous comments :/  Forgive me.  Perhaps I should close this...
Comment 2 Kenneth Graunke 2017-12-01 18:22:11 UTC
This should be fixed by:

commit ee57b15ec764736e2d5360beaef9fb2045ed0f68
Author: Jason Ekstrand <jason.ekstrand@intel.com>
Date:   Wed Nov 29 16:22:42 2017 -0800

    i965: Disable regular fast-clears (CCS_D) on gen9+
    
    This partially reverts commit 3e57e9494c2279580ad6a83ab8c065d01e7e634e
    which caused a bunch of GPU hangs on several Source titles.  To date, we
    have no clue why these hangs are actually happening.  This undoes the
    final effect of 3e57e9494c227 and gets us back to not hanging.  Tested
    with Team Fortress 2.
    
    Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102435
    Fixes: 3e57e9494c2279580ad6a83ab8c065d01e7e634e
    Cc: mesa-stable@lists.freedesktop.org

If not, please reopen.  Thanks for the reports and your patience!


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.