Bug 88541

Summary: [SNB] GPU HANG: ecode 1:0xeca7047e, in mpv/vo [12171], reason: Ring hung, action: reset
Product: DRI Reporter: Nelson A. de Oliveira <naoliv>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED WONTFIX QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: SNB i915 features: GPU hang
Attachments:
Description Flags
/sys/class/drm/card0/error output none

Description Nelson A. de Oliveira 2015-01-17 18:41:33 UTC
Created attachment 112400 [details]
/sys/class/drm/card0/error output

While watching a movie using mpv I saw that the video got stuck for a while.
In dmesg I could see:

[497786.026819] [drm] stuck on bsd ring
[497786.028369] [drm] GPU HANG: ecode 1:0xeca7047e, in mpv/vo [12171], reason: Ring hung, action: reset
[497786.028371] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[497786.028371] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[497786.028372] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[497786.028373] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[497786.028374] [drm] GPU crash dump saved to /sys/class/drm/card0/error


/sys/class/drm/card0/error output is attached.

In dmesg I also see a lot (really a lot) of this:

[465595.963234] [drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off
[465634.980054] [drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off
[465655.977984] [drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off
[465695.018761] [drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off

There is also another error in dmseg:

[464146.236658] ------------[ cut here ]------------
[464146.236718] WARNING: CPU: 6 PID: 988 at /build/linux-CMiYW9/linux-3.16.7-ckt2/drivers/gpu/drm/i915/intel_uncore.c:47 gen6_read32+0x30/0x120 [i915]()
[464146.236722] Device suspended
[464146.236724] Modules linked in: pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs libcrc32c dm_mod cpuid nls_utf8 nls_cp437 vfat fat usb_storage msr cpufreq_stats binfmt_misc uvcvideo videobuf2_vmalloc videobuf2_memops x86_pkg_temp_thermal intel_powerclamp intel_rapl kvm_intel ecb snd_hda_codec_hdmi videobuf2_core v4l2_common videodev media kvm snd_hda_codec_realtek snd_hda_codec_generic btusb bluetooth 6lowpan_iphc arc4 crc32_pclmul iwldvm ghash_clmulni_intel snd_hda_intel joydev aesni_intel snd_hda_controller mac80211 snd_hda_codec aes_x86_64 lrw snd_hwdep i915 dell_wmi iTCO_wdt gf128mul glue_helper snd_pcm iwlwifi snd_timer iTCO_vendor_support cfg80211 snd ablk_helper cryptd sparse_keymap soundcore dell_laptop evdev rfkill
[464146.236807]  psmouse dcdbas lpc_ich serio_raw pcspkr drm_kms_helper mei_me shpchp mfd_core drm i2c_i801 i2c_algo_bit mei i2c_core video processor ac wmi dell_smo8800 battery button coretemp loop fuse autofs4 ext4 crc16 mbcache jbd2 sg sd_mod sr_mod crc_t10dif cdrom crct10dif_generic hid_generic usbhid hid crct10dif_pclmul ahci crct10dif_common libahci crc32c_intel libata ehci_pci xhci_hcd ehci_hcd scsi_mod r8169 mii usbcore usb_common thermal thermal_sys
[464146.236873] CPU: 6 PID: 988 Comm: Xorg Tainted: G        W  O  3.16.0-4-amd64 #1 Debian 3.16.7-ckt2-1
[464146.236877] Hardware name: Dell Inc.          Dell System XPS L502X/0MY6GN, BIOS A07 10/20/2011
[464146.236881]  0000000000000009 ffffffff81507263 ffff8800b6a07b30 ffffffff81065847
[464146.236888]  ffff880235270000 ffff8800b6a07b80 ffff880235270080 ffff88023335a000
[464146.236893]  0000000000000001 ffffffff810658ac ffffffffa04f919e 0000000000000018
[464146.236899] Call Trace:
[464146.236912]  [<ffffffff81507263>] ? dump_stack+0x41/0x51
[464146.236922]  [<ffffffff81065847>] ? warn_slowpath_common+0x77/0x90
[464146.236929]  [<ffffffff810658ac>] ? warn_slowpath_fmt+0x4c/0x50
[464146.236957]  [<ffffffffa048b580>] ? gen6_read32+0x30/0x120 [i915]
[464146.236983]  [<ffffffffa04c6bd5>] ? intel_lvds_compute_config+0x55/0x150 [i915]
[464146.237009]  [<ffffffffa04a1467>] ? __intel_set_mode+0xb87/0x1560 [i915]
[464146.237035]  [<ffffffffa04a4442>] ? intel_set_mode+0x12/0x30 [i915]
[464146.237059]  [<ffffffffa04a534f>] ? intel_crtc_set_config+0x8cf/0xd50 [i915]
[464146.237082]  [<ffffffffa02f6d11>] ? drm_mode_set_config_internal+0x61/0xe0 [drm]
[464146.237103]  [<ffffffffa02fa4e5>] ? drm_mode_setcrtc+0xd5/0x570 [drm]
[464146.237121]  [<ffffffffa02eb8b7>] ? drm_ioctl+0x1c7/0x5b0 [drm]
[464146.237132]  [<ffffffff811b7d2f>] ? do_vfs_ioctl+0x2cf/0x4b0
[464146.237138]  [<ffffffff811a7e0d>] ? __sb_end_write+0x2d/0x70
[464146.237146]  [<ffffffff811a5b92>] ? vfs_write+0x172/0x1f0
[464146.237152]  [<ffffffff811b7f91>] ? SyS_ioctl+0x81/0xa0
[464146.237160]  [<ffffffff8150d32d>] ? system_call_fast_compare_end+0x10/0x15
[464146.237164] ---[ end trace a7e77a5b8bb50e68 ]---


I don't know if they are all related.

What other kind of information do you need, please?
Comment 1 Chris Wilson 2015-01-17 21:14:17 UTC

*** This bug has been marked as a duplicate of bug 54266 ***
Comment 2 Chris Wilson 2015-01-17 21:14:25 UTC

*** This bug has been marked as a duplicate of bug 54226 ***
Comment 3 Chris Wilson 2015-01-17 21:16:30 UTC
Too fast. It hung on BSD instead.
Comment 4 yann 2017-02-24 08:13:03 UTC
We seem to have neglected the bug a bit, apologies.

Nelson A. de Oliveira, since There were improvements pushed in kernel that will benefit to your system, so please re-test with latest kernel and mark as REOPENED if you can reproduce (and attach fresh gpu error dump & kernel log) and RESOLVED/* if you cannot reproduce.
Comment 5 yann 2017-03-03 16:41:40 UTC
(In reply to yann from comment #4)
> We seem to have neglected the bug a bit, apologies.
> 
> Nelson A. de Oliveira, since There were improvements pushed in kernel that
> will benefit to your system, so please re-test with latest kernel and mark
> as REOPENED if you can reproduce (and attach fresh gpu error dump & kernel
> log) and RESOLVED/* if you cannot reproduce.

Timeout. Assuming that this is not occurring anymore. If this issue happens again, re-test with latest kernel and REOPEN if you can reproduce (and attach fresh gpu error dump & kernel log)

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.