Bug 94499 - GPU Hang
Summary: GPU Hang
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: medium critical
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-03-11 16:35 UTC by emas80spam
Modified: 2016-10-11 07:32 UTC (History)
2 users (show)

See Also:
i915 platform: SKL
i915 features: GPU hang


Attachments
The /sys/class/drm/card0/error (83.72 KB, text/plain)
2016-03-11 16:36 UTC, emas80spam
no flags Details

Description emas80spam 2016-03-11 16:35:29 UTC
Hi, this keeps happening very very often, I don't understand to what it can be correlated, it is a brand-new installation of Fedora on a brand-new Dell XPS.
While I do anything - programming on Eclipse, browsing Facebook on Chrome, checking Jiira on Firefox, filling this bug report (twice).


Mar 11 16:24:58 winterfell2 kernel: ------------[ cut here ]------------
Mar 11 16:24:58 winterfell2 kernel: WARNING: CPU: 2 PID: 225 at drivers/gpu/drm/i915/intel_display.c:11289 intel_mmio_flip_work_func+0x387/0x3d0 [i915]()
Mar 11 16:24:58 winterfell2 kernel: WARN_ON(__i915_wait_request(mmio_flip->req, mmio_flip->crtc->reset_counter, false, NULL, &mmio_flip->i915->rps.mmioflips))
Mar 11 16:24:58 winterfell2 kernel: Modules linked in:
Mar 11 16:24:58 winterfell2 kernel:  rfcomm fuse cmac xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ip
Mar 11 16:24:58 winterfell2 kernel:  irqbypass crct10dif_pclmul snd_compress snd_hda_codec_generic crc32_pclmul dell_laptop snd_pcm_dmaengine crc32c_intel ac97_bus brcmfmac dcdbas dw_dmac_core brcmutil snd_hda_i
Mar 11 16:24:58 winterfell2 kernel:  sunrpc i915 rtsx_pci_sdmmc mmc_core i2c_algo_bit drm_kms_helper drm nvme serio_raw rtsx_pci i2c_hid video fjes
Mar 11 16:24:58 winterfell2 kernel: CPU: 2 PID: 225 Comm: kworker/2:3 Tainted: G        W       4.4.3-300.fc23.x86_64 #1
Mar 11 16:24:58 winterfell2 kernel: Hardware name: Dell Inc. XPS 13 9350/0JXC1H, BIOS 1.1.9 12/18/2015
Mar 11 16:24:58 winterfell2 kernel: Workqueue: events intel_mmio_flip_work_func [i915]
Mar 11 16:24:58 winterfell2 kernel:  0000000000000286 000000003e09c6c1 ffff880273fd7d20 ffffffff813b4b6e
Mar 11 16:24:58 winterfell2 kernel:  ffff880273fd7d68 ffffffffa01f2de8 ffff880273fd7d58 ffffffff810a40f2
Mar 11 16:24:58 winterfell2 kernel:  ffff880237464140 ffff880280d16600 ffff880280d1b000 0000000000000080
Mar 11 16:24:58 winterfell2 kernel: Call Trace:
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff813b4b6e>] dump_stack+0x63/0x85
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810a40f2>] warn_slowpath_common+0x82/0xc0
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810a418c>] warn_slowpath_fmt+0x5c/0x80
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffffa018c7a7>] intel_mmio_flip_work_func+0x387/0x3d0 [i915]
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810bc576>] process_one_work+0x156/0x430
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810bc89e>] worker_thread+0x4e/0x450
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff8179a985>] ? __schedule+0x3a5/0xa00
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810bc850>] ? process_one_work+0x430/0x430
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810bc850>] ? process_one_work+0x430/0x430
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810c2628>] kthread+0xd8/0xf0
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810c2550>] ? kthread_worker_fn+0x160/0x160
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff8179f4cf>] ret_from_fork+0x3f/0x70
Mar 11 16:24:58 winterfell2 kernel:  [<ffffffff810c2550>] ? kthread_worker_fn+0x160/0x160
Mar 11 16:24:58 winterfell2 kernel: ---[ end trace af498219330679d5 ]---
Mar 11 16:24:58 winterfell2 kernel: drm/i915: Resetting chip after gpu hang
Mar 11 16:24:58 winterfell2 kernel: ------------[ cut here ]------------
  

Mar 11 16:27:34 winterfell2 kernel: ------------[ cut here ]------------
Mar 11 16:27:34 winterfell2 kernel: WARNING: CPU: 2 PID: 120 at drivers/gpu/drm/i915/intel_lrc.c:702 intel_logical_ring_begin+0x1ab/0x250 [i915]()
Mar 11 16:27:34 winterfell2 kernel: WARN_ON(&target->list == &ring->request_list)
Mar 11 16:27:34 winterfell2 kernel: Modules linked in:
Mar 11 16:27:34 winterfell2 kernel:  rfcomm fuse cmac xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ip
Mar 11 16:27:34 winterfell2 kernel:  irqbypass crct10dif_pclmul snd_compress snd_hda_codec_generic crc32_pclmul dell_laptop snd_pcm_dmaengine crc32c_intel ac97_bus brcmfmac dcdbas dw_dmac_core brcmutil snd_hda_i
Mar 11 16:27:34 winterfell2 kernel:  sunrpc i915 rtsx_pci_sdmmc mmc_core i2c_algo_bit drm_kms_helper drm nvme serio_raw rtsx_pci i2c_hid video fjes
Mar 11 16:27:34 winterfell2 kernel: CPU: 2 PID: 120 Comm: kworker/u8:8 Tainted: G        W       4.4.3-300.fc23.x86_64 #1
Mar 11 16:27:34 winterfell2 kernel: Hardware name: Dell Inc. XPS 13 9350/0JXC1H, BIOS 1.1.9 12/18/2015
Mar 11 16:27:34 winterfell2 kernel: Workqueue: i915-hangcheck i915_hangcheck_elapsed [i915]
Mar 11 16:27:34 winterfell2 kernel:  0000000000000286 00000000e1b6df88 ffff880273c7b9e0 ffffffff813b4b6e
Mar 11 16:27:34 winterfell2 kernel:  ffff880273c7ba28 ffffffffa01f0c70 ffff880273c7ba18 ffffffff810a40f2
Mar 11 16:27:34 winterfell2 kernel:  ffff880273f64780 ffff880273be2228 0000000000000004 ffff8801ebfe0f00
Mar 11 16:27:34 winterfell2 kernel: Call Trace:
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff813b4b6e>] dump_stack+0x63/0x85
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810a40f2>] warn_slowpath_common+0x82/0xc0
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810a418c>] warn_slowpath_fmt+0x5c/0x80
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa0161a2b>] intel_logical_ring_begin+0x1ab/0x250 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa0161b5f>] gen8_emit_bb_start+0x8f/0x290 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa015578c>] ? i915_gem_render_state_prepare+0x33c/0x3d0 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa016240c>] gen8_init_rcs_context+0x1bc/0x290 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa0140818>] i915_gem_context_enable+0x28/0x50 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa0153199>] i915_gem_init_hw+0x1a9/0x4f0 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa0116cb0>] i915_reset+0x80/0x160 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa011b4ba>] i915_reset_and_wakeup+0xea/0x170 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa011fcae>] i915_handle_error+0xce/0x630 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810fac77>] ? vprintk_emit+0x2d7/0x520
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810fb049>] ? vprintk_default+0x29/0x40
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffffa01204c5>] i915_hangcheck_elapsed+0x275/0x470 [i915]
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810bc576>] process_one_work+0x156/0x430
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810bc89e>] worker_thread+0x4e/0x450
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810bc850>] ? process_one_work+0x430/0x430
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810c2628>] kthread+0xd8/0xf0
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810c2550>] ? kthread_worker_fn+0x160/0x160
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff8179f4cf>] ret_from_fork+0x3f/0x70
Mar 11 16:27:34 winterfell2 kernel:  [<ffffffff810c2550>] ? kthread_worker_fn+0x160/0x160
Mar 11 16:27:34 winterfell2 kernel: ---[ end trace af498219330679d9 ]---
Comment 1 emas80spam 2016-03-11 16:36:37 UTC
Created attachment 122232 [details]
The /sys/class/drm/card0/error
Comment 2 emas80spam 2016-03-11 16:40:59 UTC
I think this can be related to

https://bugs.freedesktop.org/show_bug.cgi?id=94161

it is probably a duplicate of that bug.
Comment 3 yann 2016-09-05 11:04:09 UTC
Please update your kernel (getting benefits of skl w/a added) and confirm that you are not reproducing it.
Comment 4 yann 2016-10-11 07:32:17 UTC
Timeout. Assuming that it is fixed by now. If this is not the case, please re-test with latest kernel & Mesa to see if this issue is still occurring since there were improvements pushed in kernel and Mesa that will benefit to your system.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.