Bug 99884 - Intel Iris GPU Hang
Summary: Intel Iris GPU Hang
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-02-21 11:11 UTC by Colin Walls
Modified: 2017-03-07 14:31 UTC (History)
1 user (show)

See Also:
i915 platform: SKL
i915 features: GPU hang


Attachments

Description Colin Walls 2017-02-21 11:11:41 UTC
Periodically getting a GPU reset, usually when handling images or video.

This is on an Intel Skull Canyon NUC, with an Intel i7-6770 processor and Intel Iris GPU

Jan 25 16:35:25 linux kernel: [drm] RC6 on
Jan 25 16:35:24 linux kernel: drm/i915: Resetting chip after gpu hang
Jan 25 16:35:24 linux kernel: ---[ end trace 414d7eec39f88711 ]---
Jan 25 16:35:24 linux kernel:  [<ffffffff8109d240>] ? kthread_park+0x50/0x50
Jan 25 16:35:24 linux kernel: Leftover inexact backtrace:
Jan 25 16:35:24 linux kernel: 
Jan 25 16:35:24 linux kernel: DWARF2 unwinder stuck at ret_from_fork+0x3f/0x70
Jan 25 16:35:24 linux kernel:  [<ffffffff8160ac8f>] ret_from_fork+0x3f/0x70
Jan 25 16:35:24 linux kernel:  [<ffffffff8109d308>] kthread+0xc8/0xe0
Jan 25 16:35:24 linux kernel:  [<ffffffff81097d26>] worker_thread+0x116/0x4b0
Jan 25 16:35:24 linux kernel:  [<ffffffff810971e5>] process_one_work+0x155/0x440
Jan 25 16:35:24 linux kernel:  [<ffffffffa032bb02>] intel_mmio_flip_work_func+0x382/0x3d0 [i915]
Jan 25 16:35:24 linux kernel:  [<ffffffff8107e8bc>] warn_slowpath_fmt+0x4c/0x50
Jan 25 16:35:24 linux kernel:  [<ffffffff8107e841>] warn_slowpath_common+0x81/0xb0
Jan 25 16:35:24 linux kernel:  [<ffffffff81327b17>] dump_stack+0x5c/0x85
Jan 25 16:35:24 linux kernel:  [<ffffffff8101b011>] show_stack+0x21/0x40
Jan 25 16:35:24 linux kernel:  [<ffffffff8101a26a>] show_stack_log_lvl+0xfa/0x180
Jan 25 16:35:24 linux kernel:  [<ffffffff81019ea9>] dump_trace+0x59/0x320
Jan 25 16:35:24 linux kernel: Call Trace:
Jan 25 16:35:24 linux kernel:  ffffffff8107e8bc ffffffffa039f295
Jan 25 16:35:24 linux kernel:  0000000000000000
Jan 25 16:35:24 linux kernel:  ffffe8ffffdc1200
Jan 25 16:35:24 linux kernel:  ffff88082dbe3e40 ffff8805187e7de0 ffff8808bedd5440
Jan 25 16:35:24 linux kernel:  ffffffff8107e841
Jan 25 16:35:24 linux kernel:  0000000000000000 ffffffff81327b17 ffff8805187e7d90 ffffffffa038fdf8
Jan 25 16:35:24 linux kernel: Workqueue: events intel_mmio_flip_work_func [i915]
Jan 25 16:35:24 linux kernel: Hardware name:                  /NUC6i7KYB, BIOS KYSKLi70.86A.0042.2016.0929.1933 09/29/2016
Jan 25 16:35:24 linux kernel: CPU: 7 PID: 6802 Comm: kworker/7:2 Not tainted 4.4.36-8-default #1
Jan 25 16:35:24 linux kernel:  mmc_core libata usbcore usb_common drm video i2c_hid button sg scsi_mod efivarfs autofs4
Jan 25 16:35:24 linux kernel:  fb_sys_fops
Jan 25 16:35:24 linux kernel:  ir_lirc_codec ir_mce_kbd_decoder ir_sharp_decoder lirc_dev btusb ir_xmp_decoder ir_sanyo_decoder ir_sony_decoder btrtl drbg ir_jvc_decoder ir_rc5_decoder ir_rc6_decoder mac80211 snd_timer ansi_cprng joydev snd
Jan 25 16:35:24 linux kernel: Modules linked in: dm_crypt dm_mod uas usb_storage fuse cmac ecb rfcomm nf_log_ipv6 xt_pkttype nf_log_ipv4 nf_log_common xt_LOG xt_limit af_packet snd_hda_codec_hdmi iscsi_ibft iscsi_boot_sysfs ip6t_REJECT nf_r
Jan 25 16:35:24 linux kernel: WARN_ON(__i915_wait_request(mmio_flip->req, mmio_flip->crtc->reset_counter, false, NULL, &mmio_flip->i915->rps.mmioflips))
Jan 25 16:35:24 linux kernel: WARNING: CPU: 7 PID: 6802 at ../drivers/gpu/drm/i915/intel_display.c:11294 intel_mmio_flip_work_func+0x382/0x3d0 [i915]()
Jan 25 16:35:24 linux kernel: ------------[ cut here ]------------
Jan 25 16:35:24 linux kernel: [drm] GPU crash dump saved to /sys/class/drm/card0/error
Jan 25 16:35:24 linux kernel: [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
Jan 25 16:35:24 linux kernel: [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
Jan 25 16:35:24 linux kernel: [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
Jan 25 16:35:24 linux kernel: [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
Jan 25 16:35:24 linux kernel: [drm] GPU HANG: ecode 9:0:0x85df7fff, in kwin_x11 [1982], reason: Ring hung, action: reset
Jan 25 16:35:24 linux kernel: [drm] stuck on render ring

Jan 25 16:34:06 linux kernel: [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
Jan 25 16:34:06 linux kernel: [drm:intel_dp_link_training_channel_equalization [i915]] *ERROR* failed to start channel equalization
Jan 25 16:34:06 linux kernel: [drm:intel_dp_link_training_clock_recovery [i915]] *ERROR* failed to enable link training
Comment 1 Chris Wilson 2017-02-21 11:50:08 UTC
4.4 is excruciatingly old and to be able to triage the hang we need the /sys/class/drm/card0/error. Please try a new kernel (e.g. 4.10) and attach the error state.
Comment 2 Colin Walls 2017-02-21 20:17:45 UTC
Have switched to openSuse Tumbleweed which has a 4.9.10 kernel.

Will send a report as soon as I have a crash.
Comment 3 Mika Kuoppala 2017-03-07 10:05:34 UTC
Colin, no hangs anymore?
Comment 4 Colin Walls 2017-03-07 10:38:06 UTC
No, nothing since I changed to Tumbleweed with a 4.9 and now 4.10 kernel.

You can close this, if I have a repeat problem I will raise a new bug report.
Comment 5 yann 2017-03-07 14:31:41 UTC
(In reply to Colin Walls from comment #4)
> No, nothing since I changed to Tumbleweed with a 4.9 and now 4.10 kernel.
> 
> You can close this, if I have a repeat problem I will raise a new bug report.

Thanks Colin


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.