Bug 106956 - GPU HANG: ecode 9:0:0x83ff6f77, in totem [4233], reason: No progress on rcs0, action: reset
Summary: GPU HANG: ecode 9:0:0x83ff6f77, in totem [4233], reason: No progress on rcs0,...
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium minor
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: Triaged
Keywords:
Depends on:
Blocks:
 
Reported: 2018-06-18 17:29 UTC by Robert T.
Modified: 2018-08-13 09:46 UTC (History)
1 user (show)

See Also:
i915 platform: SKL
i915 features: GPU hang


Attachments
compressed contents of /sys/class/drm/card0/error (26.21 KB, application/x-bzip)
2018-06-18 17:29 UTC, Robert T.
no flags Details

Description Robert T. 2018-06-18 17:29:20 UTC
Created attachment 140207 [details]
compressed contents of /sys/class/drm/card0/error

Dear maintainer,

My desktop has just frozen and I am filing a bug report as suggested by the error message in dmesg.

This is a fully updated debian 9 + backports machine

root@aspire:~# dmesg |tail -14
[ 2396.810432] [drm] GPU HANG: ecode 9:0:0x83ff6f77, in totem [4233], reason: No progress on rcs0, action: reset
[ 2396.810434] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[ 2396.810434] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[ 2396.810434] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[ 2396.810435] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[ 2396.810435] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[ 2396.810463] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[ 2412.807181] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[ 2428.807013] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[ 2444.806799] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[ 2463.782603] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[ 2663.434961] do_trap: 4 callbacks suppressed
[ 2663.434966] traps: gnome-shell[1603] trap int3 ip:7f7a9a1ec261 sp:7ffdee7c6860 error:0 in libglib-2.0.so.0.5000.3[7f7a9a19c000+112000]
[ 2716.087096] gnome-session-f[5062]: segfault at 0 ip 00007f995a062e19 sp 00007ffd25d4fba0 error 4 in libgtk-3.so.0.2200.11[7f9959d80000+700000]

root@aspire:~# uname -a
Linux aspire 4.16.0-0.bpo.2-amd64 #1 SMP Debian 4.16.12-1~bpo9+1 (2018-06-03) x86_64 GNU/Linux

root@aspire:~# cat /etc/debian_version 
9.4

Thank you,
Robert
Comment 1 Chris Wilson 2018-06-19 07:39:21 UTC
That is no kernel v4.16, that is a backport monstrosity. Can you please try an upstream v4.16 to affirm that is not the cause and to get a clean error state.
Comment 2 Jani Saarinen 2018-06-25 10:10:20 UTC
You could also try using https://cgit.freedesktop.org/drm-tip and send dmesg with drm.debug=0x1e log_buf_len=4M?
Comment 3 Jani Saarinen 2018-08-13 09:46:12 UTC
No feedback in many months, closing as resolved works for me.
Please re-open is still the case after testing latest https://cgit.freedesktop.org/drm-tip and send dmesg with drm.debug=0x1e log_buf_len=4M?


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.