Bug 102853

Summary: [SKL] frequent / recurring X crash - GPU HANG: ecode 9:0:0x85dffffb, in Xorg [3674], reason: Hang on rcs0, action: reset
Product: Mesa Reporter: TimSmall <tim>
Component: Drivers/DRI/i965Assignee: Intel 3D Bugs Mailing List <intel-3d-bugs>
Status: RESOLVED FIXED QA Contact: Intel 3D Bugs Mailing List <intel-3d-bugs>
Severity: normal    
Priority: medium CC: eblau, intel-gfx-bugs
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments: contents of /sys/class/drm/card0/error following GPU bug / X restart

Description TimSmall 2017-09-19 08:50:27 UTC
Created attachment 134330 [details]
contents of /sys/class/drm/card0/error following GPU bug / X restart

Frequent / recurring X crash, contents of /sys/class/drm/card0/error attached.

dmesg content below...

BOOT_IMAGE=/vmlinuz-4.13.0-rc3-drm-tip+ root=UUID=fdd057bd-d641-4ccc-8693-c91e0aa683f7 ro console=tty0 intel_iommu=on cgroup_enable=memory drm.debug=0xe log_buf_len=1M


Thanks,

Tim.

[67422.337832] [drm:missed_breadcrumb [i915]] rcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x5a/0x80 [i915], irq posted? yes, current seqno=2c1489, last=2c1490
[67426.377790] [drm] GPU HANG: ecode 9:0:0x85dffffb, in Xorg [3674], reason: Hang on rcs0, action: reset
[67426.377793] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[67426.377793] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[67426.377794] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[67426.377795] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[67426.377795] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[67426.377800] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[67426.377911] [drm:i915_gem_reset_engine [i915]] context Xorg[3674]/1 marked guilty (score 10) banned? no
[67426.377923] [drm:i915_gem_reset_engine [i915]] resetting rcs0 to restart from tail of request 0x2c148a
[67426.377952] [drm:gen8_init_common_ring [i915]] Execlists enabled for rcs0
[67426.377968] [drm:gen8_init_common_ring [i915]] Restarting rcs0:0 from 0x2c148f
[67426.377980] [drm:gen8_init_common_ring [i915]] Restarting rcs0:1 from 0x2c1490
[67426.377996] [drm:init_workarounds_ring [i915]] rcs0: Number of context specific w/a: 14
[67430.369767] [drm:missed_breadcrumb [i915]] rcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x5a/0x80 [i915], irq posted? yes, current seqno=2c148a, last=2c1490
[67434.401678] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[67434.401832] [drm:i915_gem_reset_engine [i915]] context Xorg[3674]/1 marked guilty (score 19) banned? no
[67434.401885] [drm:i915_gem_reset_engine [i915]] resetting rcs0 to restart from tail of request 0x2c148b
[67434.401970] [drm:gen8_init_common_ring [i915]] Execlists enabled for rcs0
[67434.402022] [drm:gen8_init_common_ring [i915]] Restarting rcs0:0 from 0x2c148f
[67434.402069] [drm:gen8_init_common_ring [i915]] Restarting rcs0:1 from 0x2c1490
[67434.402128] [drm:init_workarounds_ring [i915]] rcs0: Number of context specific w/a: 14
[67438.401737] [drm:missed_breadcrumb [i915]] rcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x5a/0x80 [i915], irq posted? yes, current seqno=2c148b, last=2c1490
[67442.401633] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[67442.401836] [drm:i915_gem_reset_engine [i915]] context Xorg[3674]/1 marked guilty (score 28) banned? no
[67442.401891] [drm:i915_gem_reset_engine [i915]] resetting rcs0 to restart from tail of request 0x2c148c
[67442.401981] [drm:gen8_init_common_ring [i915]] Execlists enabled for rcs0
[67442.402033] [drm:gen8_init_common_ring [i915]] Restarting rcs0:0 from 0x2c148f
[67442.402081] [drm:gen8_init_common_ring [i915]] Restarting rcs0:1 from 0x2c1490
[67442.402145] [drm:init_workarounds_ring [i915]] rcs0: Number of context specific w/a: 14
[67446.401670] [drm:missed_breadcrumb [i915]] rcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x5a/0x80 [i915], irq posted? yes, current seqno=2c148c, last=2c1490
[67447.677981] usb 1-10: USB disconnect, device number 10
[67450.401608] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[67450.401775] [drm:i915_gem_reset_engine [i915]] context Xorg[3674]/1 marked guilty (score 37) banned? no
[67450.401829] [drm:i915_gem_reset_engine [i915]] resetting rcs0 to restart from tail of request 0x2c148d
[67450.401916] [drm:gen8_init_common_ring [i915]] Execlists enabled for rcs0
[67450.401970] [drm:gen8_init_common_ring [i915]] Restarting rcs0:0 from 0x2c148f
[67450.402018] [drm:gen8_init_common_ring [i915]] Restarting rcs0:1 from 0x2c1490
[67450.402074] [drm:init_workarounds_ring [i915]] rcs0: Number of context specific w/a: 14
[67454.401594] [drm:missed_breadcrumb [i915]] rcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x5a/0x80 [i915], irq posted? yes, current seqno=2c148d, last=2c1490
[67458.369593] i915 0000:00:02.0: Resetting rcs0 after gpu hang
[67458.369695] [drm:i915_gem_reset_engine [i915]] context Xorg[3674]/1 marked guilty (score 46) banned? yes
[67458.369750] [drm:i915_gem_reset_engine [i915]] client Xorg[3674]/1 has had 1 context banned
[67458.369796] [drm:i915_gem_reset_engine [i915]] resetting rcs0 to restart from tail of request 0x2c148e
[67458.369882] [drm:gen8_init_common_ring [i915]] Execlists enabled for rcs0
[67458.369933] [drm:gen8_init_common_ring [i915]] Restarting rcs0:0 from 0x2c148f
[67458.369979] [drm:gen8_init_common_ring [i915]] Restarting rcs0:1 from 0x2c1490
[67458.370035] [drm:init_workarounds_ring [i915]] rcs0: Number of context specific w/a: 14
Comment 1 TimSmall 2017-09-19 09:06:06 UTC
Running Debian/Stable

glx-alternative-mesa           0.7.4                       amd64 
i965-va-driver:amd64           1.7.3-1                     amd64 
libegl1-mesa:amd64             13.0.6-1+b2                 amd64 
libegl1-mesa:i386              13.0.6-1+b2                 i386  
libegl1-mesa-drivers:i386      13.0.6-1+b2                 i386  
libgl1-mesa-dev:amd64          13.0.6-1+b2                 amd64 
libgl1-mesa-dri:amd64          13.0.6-1+b2                 amd64 
libgl1-mesa-dri:i386           13.0.6-1+b2                 i386  
libgl1-mesa-glx:amd64          13.0.6-1+b2                 amd64 
libgl1-mesa-glx:i386           13.0.6-1+b2                 i386  
libgl1-mesa-glx-dbgsym:amd64   13.0.6-1+b2                 amd64 
libglapi-mesa:amd64            13.0.6-1+b2                 amd64 
libglapi-mesa:i386             13.0.6-1+b2                 i386  
libgles1-mesa:amd64            13.0.6-1+b2                 amd64 
libgles2-mesa:amd64            13.0.6-1+b2                 amd64 
libglu1-mesa:amd64             9.0.0-2.1                   amd64 
libglu1-mesa:i386              9.0.0-2.1                   i386  
libglu1-mesa-dev:amd64         9.0.0-2.1                   amd64 
libosmesa6:amd64               13.0.6-1+b2                 amd64 
libosmesa6:i386                13.0.6-1+b2                 i386  
libwayland-egl1-mesa:amd64     13.0.6-1+b2                 amd64 
libwayland-egl1-mesa:i386      13.0.6-1+b2                 i386  
mesa-common-dev:amd64          13.0.6-1+b2                 amd64 
mesa-utils                     8.3.0-3                     amd64 
mesa-vdpau-drivers:amd64       13.0.6-1+b2                 amd64 
xorg-docs-core                 1:1.7.1-1                   all   
xorg-sgml-doctools             1:1.11-1                    all   
xserver-xorg-core              2:1.19.2-1+deb9u1           amd64 
xserver-xorg-input-all         1:7.7+19                    amd64 
xserver-xorg-input-libinput    0.23.0-2                    amd64
Comment 2 Elizabeth 2018-03-06 20:55:52 UTC
Hi, mesa 13 is quite old, could you update, if possible to latest mesa release 17.3.6? Also any way to reliable reproduce the hangs? Thank you.
Comment 3 TimSmall 2019-05-20 14:20:10 UTC
Has not recurred recently.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.