Bug 93749 - [IVB] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung
Summary: [IVB] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render rin...
Status: CLOSED INVALID
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-01-18 02:20 UTC by Aslan Xie
Modified: 2017-04-11 12:28 UTC (History)
1 user (show)

See Also:
i915 platform: BYT
i915 features: GPU hang


Attachments
/sys/class/drm/card0/error (2.09 MB, text/plain)
2016-01-18 02:20 UTC, Aslan Xie
no flags Details

Description Aslan Xie 2016-01-18 02:20:01 UTC
Created attachment 121100 [details]
/sys/class/drm/card0/error

GPU hung on Android 5.1, Linux kernel 3.14.55, here is the serial port log:

[  306.037014] atomisp-css2400b0_v21 0000:00:03.0: DFS target freq is rejected by HW.
[  306.181557] atomisp-css2400b0_v21 0000:00:03.0: atomisp_isr:no subdev.event:8192
[  306.203808] atomisp-css2400b0_v21 0000:00:03.0: stop stream timeout.
[  306.268207] atomisp-css2400b0_v21 0000:00:03.0: stop stream timeout.
[  306.421314] atomisp-css2400b0_v21 0000:00:03.0: DFS target freq is rejected by HW.
[  306.442146] compat_ioctl32: unknown ioctl '>', dir=1, #0 (0x40043e00)
[  306.509024] atomisp-css2400b0_v21 0000:00:03.0: DFS target freq is rejected by HW.
[  335.128824] CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[  335.128829] CPU1: Core temperature above threshold, cpu clock throttled (total events = 1)
[  335.128855] CPU2: Core temperature above threshold, cpu clock throttled (total events = 1)
[  335.128900] CPU3: Core temperature above threshold, cpu clock throttled (total events = 1)
[  336.195246] CPU0: Core temperature/speed normal
[  336.195257] CPU1: Core temperature/speed normal
[  336.195273] CPU2: Core temperature/speed normal
[  336.195283] CPU3: Core temperature/speed normal
[  491.933138] fence timeout on [ffff880070697440] after 2000ms
[  492.798333] fence timeout on [ffff8800590ba300] after 3000ms
[  495.797511] [drm] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung
[  495.807834] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[  495.818341] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[  495.828461] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[  495.839385] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[  495.849541] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[  495.856910] i915_gem_wedged() (0) intr 0
[  495.861439] [drm:intel_update_plane] *ERROR* pin and fence of fb failed with -5
[  495.861466] i915_gem_wedged() (0) intr 1
[  495.861468] i915_gem_wedged() (0) intr 1
[  495.861470] i915_gem_wedged() (0) intr 1
[  495.861505] i915_gem_wedged() (0) intr 1
[  495.861507] i915_gem_wedged() (0) intr 1
[  495.861509] i915_gem_wedged() (0) intr 1
[  495.861545] i915_gem_wedged() (0) intr 1
[  495.861546] i915_gem_wedged() (0) intr 1
[  495.861548] i915_gem_wedged() (0) intr 1
[  495.910328] [drm:intel_set_disp_plane_update] *ERROR* drm_mode_setplane failed
[  516.819791] [drm] GPU HANG: ecode 0:0x85fcfffd, in IntelHwCodec [3958], reason render ring hung
[  516.829971] i915_gem_wedged: 4 callbacks suppressed
[  516.835531] i915_gem_wedged() (0) intr 1
[  516.840095] i915_gem_wedged() (0) intr 1
[  516.840105] i915_gem_wedged() (0) intr 1
[  516.840108] i915_gem_wedged() (0) intr 1
[  516.840111] i915_gem_wedged() (0) intr 1
[  516.857812] i915_gem_wedged() (0) intr 1
[  516.858779] i915_gem_wedged() (0) intr 1
[  516.858783] i915_gem_wedged() (0) intr 1
[  516.858787] i915_gem_wedged() (0) intr 1
[  516.875483] i915_gem_wedged() (0) intr 1
[  601.928031] mce: [Hardware Error]: Machine check events logged
[  643.194217] CPU0: Core temperature above threshold, cpu clock throttled (total events = 70)
[  643.194221] CPU1: Core temperature above threshold, cpu clock throttled (total events = 70)
[  643.194245] CPU2: Core temperature above threshold, cpu clock throttled (total events = 70)
[  643.194280] CPU3: Core temperature above threshold, cpu clock throttled (total events = 70)
[  645.793067] CPU0: Core temperature/speed normal
[  645.793495] CPU1: Core temperature/speed normal
[  645.793509] CPU3: Core temperature/speed normal
[  645.793519] CPU2: Core temperature/speed normal
[  752.141518] mce: [Hardware Error]: Machine check events logged
Comment 1 cprigent 2016-03-04 15:01:27 UTC
PCI ID: 0x0f31
Name: Atom Processor Z36xxx/Z37xxx Series Graphics & Display
ValleyView Gen7
Comment 2 yann 2017-03-16 13:25:22 UTC
We seem to have neglected the bug a bit, apologies.

Aslan Xie, since There were improvements pushed in kernel that will benefit to your system, so please re-test with latest kernel and mark as REOPENED if you can reproduce (and attach fresh gpu error dump & kernel log) and RESOLVED/* if you cannot reproduce.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.