Bug 93287 - GPU HANG: ecode 0:0x9f47f9fd, in chrome [10834], reason: Ring hung, action: reset
Summary: GPU HANG: ecode 0:0x9f47f9fd, in chrome [10834], reason: Ring hung, action: r...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-12-07 22:53 UTC by Georg Faerber
Modified: 2016-09-20 14:44 UTC (History)
2 users (show)

See Also:
i915 platform: I965GM
i915 features: GPU hang


Attachments
GPU crash dump /sys/class/drm/card0/error (846.50 KB, text/plain)
2015-12-07 22:53 UTC, Georg Faerber
no flags Details

Description Georg Faerber 2015-12-07 22:53:16 UTC
Created attachment 120400 [details]
GPU crash dump /sys/class/drm/card0/error

dmesg just told me:
[  675.816055] [drm] stuck on render ring
[  675.817138] [drm] GPU HANG: ecode 0:0x9f47f9fd, in chrome [10834], reason: Ring hung, action: reset
[  675.817140] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[  675.817141] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[  675.817143] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[  675.817144] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[  675.817145] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[  675.817288] [drm:i915_reset] *ERROR* Failed to reset chip: -19
[  676.055584] chrome[12349]: segfault at 1f8 ip 00007fa272c4ae4f sp 00007ffc1dff9240 error 4 in i965_dri.so[7fa2728f5000+51e000]
[  676.150304] chrome[12357]: segfault at 1f8 ip 00007f138a913e4f sp 00007ffe133a63f0 error 4 in i965_dri.so[7f138a5be000+51e000]

The mentioned gpu crash dump is attached.

Involved software:
# uname -a
Linux debian.local 3.16.0-0.bpo.4-amd64 #1 SMP Debian 3.16.7-ckt11-1+deb8u6~bpo70+1 (2015-11-11) x86_64 GNU/Linux

# cat /etc/debian_version 
7.9

# dpkg -l | grep intel
ii  libdrm-intel1:amd64                          2.4.58-2
ii  xserver-xorg-video-intel                     2:2.21.15-2+b2

# dpkg -l | grep chrome
ii  google-chrome-stable                         47.0.2526.73-1

If I'm able to debug or you need further information, please tell me.

Thanks for your work and all the best,
Georg
Comment 1 Georg Faerber 2015-12-07 22:55:05 UTC
Involved hardware:
IBM Thinkpad T60

# lspci
00:00.0 Host bridge: Intel Corporation Mobile PM965/GM965/GL960 Memory Controller Hub (rev 0c)
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (primary) (rev 0c)
00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (secondary) (rev 0c)
00:19.0 Ethernet controller: Intel Corporation 82566MM Gigabit Network Connection (rev 03)
00:1a.0 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #4 (rev 03)
00:1a.1 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #5 (rev 03)
00:1a.7 USB controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #2 (rev 03)
00:1b.0 Audio device: Intel Corporation 82801H (ICH8 Family) HD Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 3 (rev 03)
00:1d.0 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #1 (rev 03)
00:1d.1 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #2 (rev 03)
00:1d.2 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #3 (rev 03)
00:1d.7 USB controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #1 (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev f3)
00:1f.0 ISA bridge: Intel Corporation 82801HEM (ICH8M-E) LPC Interface Controller (rev 03)
00:1f.1 IDE interface: Intel Corporation 82801HM/HEM (ICH8M/ICH8M-E) IDE Controller (rev 03)
00:1f.2 SATA controller: Intel Corporation 82801HM/HEM (ICH8M/ICH8M-E) SATA Controller [AHCI mode] (rev 03)
00:1f.3 SMBus: Intel Corporation 82801H (ICH8 Family) SMBus Controller (rev 03)
03:00.0 Network controller: Qualcomm Atheros AR928X Wireless Network Adapter (PCI-Express) (rev 01)
15:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev b6)
Comment 2 yann 2016-09-20 14:44:06 UTC
CONSTANT_BUFFER issue on G965 was already fixed: commit
 f55ded764ce60f87463e33bfa3a32e2c44715581 which shipped as part of Mesa 10.6.0 in June 2015. Please update your driver.

Details: from this error dump, hung is happening in render ring batch with active head at 0x03c10b8c, with 0x60020100 (CONSTANT_BUFFER) as IPEHR.

Batch extract (around 0x03c10b8c):

0x03c10b68:      0x78090005: 3DSTATE_VERTEX_ELEMENTS
0x03c10b6c:      0x04400000:    buffer 0: valid, type 0x0040, src offset 0x0000 bytes
0x03c10b70:      0x11130000:    (X, Y, Z, 1.0), dst offset 0x00 bytes
0x03c10b74:      0x0485000c:    buffer 0: valid, type 0x0085, src offset 0x000c bytes
0x03c10b78:      0x11230004:    (X, Y, 0.0, 1.0), dst offset 0x10 bytes
0x03c10b7c:      0x04d80014:    buffer 0: valid, type 0x00d8, src offset 0x0014 bytes
0x03c10b80:      0x12230008:    (X, 0.0, 0.0, 1.0), dst offset 0x20 bytes
0x03c10b84:      0x60020100: CONSTANT_BUFFER: valid
0x03c10b88:      0x04dd2c44:    offset: 0x04dd2c40, length: 320 bytes
0x03c10b8c:      0x7b009004: 3DPRIMITIVE: tri list random
0x03c10b90:      0x00000006:    vertex count
0x03c10b94:      0x00000000:    start vertex
0x03c10b98:      0x00000001:    instance count
0x03c10b9c:      0x00000000:    start instance
0x03c10ba0:      0x00000000:    index bias


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.