93749 – [IVB] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung

Bug 93749 - [IVB] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung

Summary: [IVB] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render rin...

Status:	CLOSED INVALID

Alias:	None

Product:	DRI
Classification:	Unclassified
Component:	DRM/Intel (show other bugs)
Version:	unspecified
Hardware:	x86-64 (AMD64) Linux (All)

Importance:	medium normal
Assignee:	Intel GFX Bugs mailing list
QA Contact:	Intel GFX Bugs mailing list

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:

Reported:	2016-01-18 02:20 UTC by Aslan Xie
Modified:	2017-04-11 12:28 UTC (History)
CC List:	1 user (show)

See Also:
i915 platform:	BYT
i915 features:	GPU hang

Attachments
/sys/class/drm/card0/error (2.09 MB, text/plain) 2016-01-18 02:20 UTC, Aslan Xie	no flags	Details
View All

Description Aslan Xie 2016-01-18 02:20:01 UTC

Created attachment 121100 [details]
/sys/class/drm/card0/error

GPU hung on Android 5.1, Linux kernel 3.14.55, here is the serial port log:

[  306.037014] atomisp-css2400b0_v21 0000:00:03.0: DFS target freq is rejected by HW.
[  306.181557] atomisp-css2400b0_v21 0000:00:03.0: atomisp_isr:no subdev.event:8192
[  306.203808] atomisp-css2400b0_v21 0000:00:03.0: stop stream timeout.
[  306.268207] atomisp-css2400b0_v21 0000:00:03.0: stop stream timeout.
[  306.421314] atomisp-css2400b0_v21 0000:00:03.0: DFS target freq is rejected by HW.
[  306.442146] compat_ioctl32: unknown ioctl '>', dir=1, #0 (0x40043e00)
[  306.509024] atomisp-css2400b0_v21 0000:00:03.0: DFS target freq is rejected by HW.
[  335.128824] CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[  335.128829] CPU1: Core temperature above threshold, cpu clock throttled (total events = 1)
[  335.128855] CPU2: Core temperature above threshold, cpu clock throttled (total events = 1)
[  335.128900] CPU3: Core temperature above threshold, cpu clock throttled (total events = 1)
[  336.195246] CPU0: Core temperature/speed normal
[  336.195257] CPU1: Core temperature/speed normal
[  336.195273] CPU2: Core temperature/speed normal
[  336.195283] CPU3: Core temperature/speed normal
[  491.933138] fence timeout on [ffff880070697440] after 2000ms
[  492.798333] fence timeout on [ffff8800590ba300] after 3000ms
[  495.797511] [drm] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung
[  495.807834] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[  495.818341] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[  495.828461] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[  495.839385] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[  495.849541] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[  495.856910] i915_gem_wedged() (0) intr 0
[  495.861439] [drm:intel_update_plane] *ERROR* pin and fence of fb failed with -5
[  495.861466] i915_gem_wedged() (0) intr 1
[  495.861468] i915_gem_wedged() (0) intr 1
[  495.861470] i915_gem_wedged() (0) intr 1
[  495.861505] i915_gem_wedged() (0) intr 1
[  495.861507] i915_gem_wedged() (0) intr 1
[  495.861509] i915_gem_wedged() (0) intr 1
[  495.861545] i915_gem_wedged() (0) intr 1
[  495.861546] i915_gem_wedged() (0) intr 1
[  495.861548] i915_gem_wedged() (0) intr 1
[  495.910328] [drm:intel_set_disp_plane_update] *ERROR* drm_mode_setplane failed
[  516.819791] [drm] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung
[  516.829971] i915_gem_wedged: 4 callbacks suppressed
[  516.835531] i915_gem_wedged() (0) intr 1
[  516.840095] i915_gem_wedged() (0) intr 1
[  516.840105] i915_gem_wedged() (0) intr 1
[  516.840108] i915_gem_wedged() (0) intr 1
[  516.840111] i915_gem_wedged() (0) intr 1
[  516.857812] i915_gem_wedged() (0) intr 1
[  516.858779] i915_gem_wedged() (0) intr 1
[  516.858783] i915_gem_wedged() (0) intr 1
[  516.858787] i915_gem_wedged() (0) intr 1
[  516.875483] i915_gem_wedged() (0) intr 1
[  601.928031] mce: [Hardware Error]: Machine check events logged
[  643.194217] CPU0: Core temperature above threshold, cpu clock throttled (total events = 70)
[  643.194221] CPU1: Core temperature above threshold, cpu clock throttled (total events = 70)
[  643.194245] CPU2: Core temperature above threshold, cpu clock throttled (total events = 70)
[  643.194280] CPU3: Core temperature above threshold, cpu clock throttled (total events = 70)
[  645.793067] CPU0: Core temperature/speed normal
[  645.793495] CPU1: Core temperature/speed normal
[  645.793509] CPU3: Core temperature/speed normal
[  645.793519] CPU2: Core temperature/speed normal
[  752.141518] mce: [Hardware Error]: Machine check events logged

Comment 1 cprigent 2016-03-04 15:01:27 UTC

PCI ID: 0x0f31
Name: Atom Processor Z36xxx/Z37xxx Series Graphics & Display
ValleyView Gen7

Comment 2 yann 2017-03-16 13:25:22 UTC

We seem to have neglected the bug a bit, apologies.

Aslan Xie, since There were improvements pushed in kernel that will benefit to your system, so please re-test with latest kernel and mark as REOPENED if you can reproduce (and attach fresh gpu error dump & kernel log) and RESOLVED/* if you cannot reproduce.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.