Bug 95143 - Skylake complete freeze from time to time
Summary: Skylake complete freeze from time to time
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-04-26 02:35 UTC by Nikolay
Modified: 2016-09-06 06:11 UTC (History)
2 users (show)

See Also:
i915 platform: SKL
i915 features: GPU hang


Attachments

Description Nikolay 2016-04-26 02:35:19 UTC
I'm using Ubunty 15.10 with Oibaf drivers on lenovo laptop with i5-6200U. From time to time system completely freezes.

After reboot I found this in syslog:

Apr 25 21:40:06 laptop kernel: [31304.322858] [drm] stuck on render ring
Apr 25 21:40:06 laptop kernel: [31304.323652] [drm] GPU HANG: ecode 9:0:0x84dffff8, in Xorg [926], reason: Ring hung, action: reset
Apr 25 21:40:06 laptop kernel: [31304.323658] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
Apr 25 21:40:06 laptop kernel: [31304.323661] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
Apr 25 21:40:06 laptop kernel: [31304.323663] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
Apr 25 21:40:06 laptop kernel: [31304.323665] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
Apr 25 21:40:06 laptop kernel: [31304.323668] [drm] GPU crash dump saved to /sys/class/drm/card0/error
Apr 25 21:40:06 laptop kernel: [31304.323727] ------------[ cut here ]------------
Apr 25 21:40:06 laptop kernel: [31304.323812] WARNING: CPU: 0 PID: 8711 at /build/linux-HVWSXI/linux-4.2.0/drivers/gpu/drm/i915/intel_display.c:11008 intel_mmio_flip_work_func+0x3b2/0x3c0 [i915]()
Apr 25 21:40:06 laptop kernel: [31304.323817] WARN_ON(__i915_wait_request(mmio_flip->req, mmio_flip->crtc->reset_counter, false, NULL, &mmio_flip->i915->rps.mmioflips))
Apr 25 21:40:06 laptop kernel: [31304.323821] Modules linked in: snd_usb_audio snd_usbmidi_lib hid_logitech_hidpp hid_generic hid_logitech_dj gspca_zc3xx gspca_main cts rpcsec_gss_krb5 nfsv4 nfs lockd grace fscache rfcomm thinkpad_acpi nvram msr nf_conntrack_netbios_ns nf_conntrack_broadcast drbg ansi_cprng ctr ccm xt_tcpudp arc4 ip6t_rpfilter ip6t_REJEC
T nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_conntrack bnep ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_sec
urity iptable_raw iptable_filter ip_tables x_tables uvcvideo videobuf2_vmalloc videobuf2_memops btusb btrtl btbcm btintel hid_multitouch bluetooth videobuf2_core v4l2_common videodev usbhid media binfmt_misc snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic intel_rapl nls_iso8859_1 x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_intel i2c_d
esignware_platform snd_hda_codec kvm_intel i2c_designware_core snd_hda_core kvm snd_hwdep snd_pcm ath10k_pci(OE) crct10dif_pclmul crc32_pclmul snd_seq_midi ath10k_core(OE) snd_seq_midi_event ath(OE) snd_rawmidi mac80211(OE) aesni_intel aes_x86_64 lrw cfg80211(OE) gf128mul glue_helper snd_seq compat(OE) ablk_helper cryptd snd_seq_device snd_timer joydev input_l
eds snd serio_raw idma64 soundcore virt_dma mei_me shpchp mei intel_lpss_pci ideapad_laptop sparse_keymap mac_hid intel_lpss_acpi intel_lpss acpi_pad parport_pc auth_rpcgss ppdev sunrpc lp parport autofs4 i915 psmouse i2c_algo_bit r8169 drm_kms_helper mii ahci drm libahci wmi video i2c_hid pinctrl_sunrisepoint pinctrl_intel hid
Apr 25 21:40:06 laptop kernel: [31304.324141] CPU: 0 PID: 8711 Comm: kworker/0:1 Tainted: G        W  OE   4.2.0-35-generic #40-Ubuntu
Apr 25 21:40:06 laptop kernel: [31304.324147] Hardware name: LENOVO 80R3/Flex 3-1480, BIOS D3CN27WW 10/28/2015
Apr 25 21:40:06 laptop kernel: [31304.324218] Workqueue: events intel_mmio_flip_work_func [i915]
Apr 25 21:40:06 laptop kernel: [31304.324225]  0000000000000286 000000002232730e ffff880197e87cd8 ffffffff817f1d7e
Apr 25 21:40:06 laptop kernel: [31304.324234]  ffff880197e87d30 ffffffffc0244b00 ffff880197e87d18 ffffffff8107cb46
Apr 25 21:40:06 laptop kernel: [31304.324242]  ffff880197e87ce8 ffff8800112d0a80 ffff880243c15e40 ffff880243c1a900
Apr 25 21:40:06 laptop kernel: [31304.324249] Call Trace:
Apr 25 21:40:06 laptop kernel: [31304.324263]  [<ffffffff817f1d7e>] dump_stack+0x63/0x81
Apr 25 21:40:06 laptop kernel: [31304.324274]  [<ffffffff8107cb46>] warn_slowpath_common+0x86/0xc0
Apr 25 21:40:06 laptop kernel: [31304.324284]  [<ffffffff8107cbd5>] warn_slowpath_fmt+0x55/0x70
Apr 25 21:40:06 laptop kernel: [31304.324350]  [<ffffffffc01e1862>] intel_mmio_flip_work_func+0x3b2/0x3c0 [i915]
Apr 25 21:40:06 laptop kernel: [31304.324361]  [<ffffffff81095bc2>] process_one_work+0x162/0x480
Apr 25 21:40:06 laptop kernel: [31304.324370]  [<ffffffff81095f2b>] worker_thread+0x4b/0x4c0
Apr 25 21:40:06 laptop kernel: [31304.324380]  [<ffffffff81095ee0>] ? process_one_work+0x480/0x480
Apr 25 21:40:06 laptop kernel: [31304.324389]  [<ffffffff81095ee0>] ? process_one_work+0x480/0x480
Apr 25 21:40:06 laptop kernel: [31304.324397]  [<ffffffff8109c2b8>] kthread+0xd8/0xf0
Apr 25 21:40:06 laptop kernel: [31304.324406]  [<ffffffff8109c1e0>] ? kthread_create_on_node+0x1f0/0x1f0
Apr 25 21:40:06 laptop kernel: [31304.324415]  [<ffffffff817f909f>] ret_from_fork+0x3f/0x70
Apr 25 21:40:06 laptop kernel: [31304.324423]  [<ffffffff8109c1e0>] ? kthread_create_on_node+0x1f0/0x1f0
Apr 25 21:40:06 laptop kernel: [31304.324429] ---[ end trace 6625ebf4916c59b6 ]---
Apr 25 21:40:06 laptop kernel: [31304.324449] ------------[ cut here ]------------
Apr 25 21:40:06 laptop kernel: [31304.324516] WARNING: CPU: 0 PID: 8651 at /build/linux-HVWSXI/linux-4.2.0/drivers/gpu/drm/i915/intel_display.c:11008 intel_mmio_flip_work_func+0x3b2/0x3c0 [i915]()
Apr 25 21:40:06 laptop kernel: [31304.324520] WARN_ON(__i915_wait_request(mmio_flip->req, mmio_flip->crtc->reset_counter, false, NULL, &mmio_flip->i915->rps.mmioflips))
Apr 25 21:40:06 laptop kernel: [31304.324525] Modules linked in: snd_usb_audio snd_usbmidi_lib hid_logitech_hidpp hid_generic hid_logitech_dj gspca_zc3xx gspca_main cts rpcsec_gss_krb5 nfsv4 nfs lockd grace fscache rfcomm thinkpad_acpi nvram msr nf_conntrack_netbios_ns nf_conntrack_broadcast drbg ansi_cprng ctr ccm xt_tcpudp arc4 ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_conntrack bnep ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw iptable_filter ip_tables x_tables uvcvideo videobuf2_vmalloc videobuf2_memops btusb btrtl btbcm btintel hid_multitouch bluetooth videobuf2_core v4l2_common videodev usbhid media binfmt_misc snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic intel_rapl nls_iso8859_1 x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_intel i2c_designware_platform snd_hda_codec kvm_intel i2c_designware_core snd_hda_core kvm snd_hwdep snd_pcm ath10k_pci(OE) crct10dif_pclmul crc32_pclmul snd_seq_midi ath10k_core(OE) snd_seq_midi_event ath(OE) snd_rawmidi mac80211(OE) aesni_intel aes_x86_64 lrw cfg80211(OE) gf128mul glue_helper snd_seq compat(OE) ablk_helper cryptd snd_seq_device snd_timer joydev input_leds snd serio_raw idma64 soundcore virt_dma mei_me shpchp mei intel_lpss_pci ideapad_laptop sparse_keymap mac_hid intel_lpss_acpi intel_lpss acpi_pad parport_pc auth_rpcgss ppdev sunrpc lp parport autofs4 i915 psmouse i2c_algo_bit r8169 drm_kms_helper mii ahci drm libahci wmi video i2c_hid pinctrl_sunrisepoint pinctrl_intel hid
Apr 25 21:40:06 laptop kernel: [31304.324756] CPU: 0 PID: 8651 Comm: kworker/0:3 Tainted: G        W  OE   4.2.0-35-generic #40-Ubuntu
Apr 25 21:40:06 laptop kernel: [31304.324761] Hardware name: LENOVO 80R3/Flex 3-1480, BIOS D3CN27WW 10/28/2015
Apr 25 21:40:06 laptop kernel: [31304.324827] Workqueue: events intel_mmio_flip_work_func [i915]
Apr 25 21:40:06 laptop kernel: [31304.324832]  0000000000000286 000000004473536b ffff880212c73cd8 ffffffff817f1d7e
Apr 25 21:40:06 laptop kernel: [31304.324840]  ffff880212c73d30 ffffffffc0244b00 ffff880212c73d18 ffffffff8107cb46
Apr 25 21:40:06 laptop kernel: [31304.324848]  ffff880212c73ce8 ffff8800112d0680 ffff880243c15e40 ffff880243c1a900
Apr 25 21:40:06 laptop kernel: [31304.324857] Call Trace:
Apr 25 21:40:06 laptop kernel: [31304.324866]  [<ffffffff817f1d7e>] dump_stack+0x63/0x81
Apr 25 21:40:06 laptop kernel: [31304.324877]  [<ffffffff8107cb46>] warn_slowpath_common+0x86/0xc0
Apr 25 21:40:06 laptop kernel: [31304.324886]  [<ffffffff8107cbd5>] warn_slowpath_fmt+0x55/0x70
Apr 25 21:40:06 laptop kernel: [31304.324949]  [<ffffffffc01e1862>] intel_mmio_flip_work_func+0x3b2/0x3c0 [i915]
Apr 25 21:40:06 laptop kernel: [31304.324960]  [<ffffffff81095bc2>] process_one_work+0x162/0x480
Apr 25 21:40:06 laptop kernel: [31304.324968]  [<ffffffff81095f2b>] worker_thread+0x4b/0x4c0
Apr 25 21:40:06 laptop kernel: [31304.324976]  [<ffffffff81095ee0>] ? process_one_work+0x480/0x480
Apr 25 21:40:06 laptop kernel: [31304.324982]  [<ffffffff8109c2b8>] kthread+0xd8/0xf0
Apr 25 21:40:06 laptop kernel: [31304.324988]  [<ffffffff8109c1e0>] ? kthread_create_on_node+0x1f0/0x1f0
Apr 25 21:40:06 laptop kernel: [31304.324993]  [<ffffffff817f909f>] ret_from_fork+0x3f/0x70
Apr 25 21:40:06 laptop kernel: [31304.324998]  [<ffffffff8109c1e0>] ? kthread_create_on_node+0x1f0/0x1f0
Apr 25 21:40:06 laptop kernel: [31304.325002] ---[ end trace 6625ebf4916c59b7 ]---
Apr 25 21:40:06 laptop kernel: [31304.325923] drm/i915: Resetting chip after gpu hang
Apr 25 21:40:07 laptop kernel: [31305.359774] [drm] RC6 on

Unfortunately before reboot system is totally unresponsive, so I cannot take GPU crash dump.
Comment 1 yann 2016-04-26 06:33:58 UTC
Nikolay, unfortunately to be able to investigate and re-produce the issue we will need to get crash dump and also kernel log.
thanks
Comment 2 Nikolay 2016-04-26 13:05:09 UTC
Well, I understand that, but what should I do if system completely freezes after this event?
Freezes as in 'mouse not moving, keyboard doesn't respond'. Is there any way to unfreeze it to take that crash dump?
Comment 3 Nikolay 2016-04-26 14:04:01 UTC
Is there any way to make it make a dump automatically when this happens?
Comment 4 Jani Nikula 2016-04-27 07:44:11 UTC
Can you perhaps ssh into the machine?
Comment 5 yann 2016-04-27 08:04:43 UTC
If this is not possible, you may also consider to redirect the console via net or serial (changing on boot command line in grub)
Comment 6 Nikolay 2016-04-27 12:58:26 UTC
After the laptop got frozen it doesn't ping anymore, so I cannot ssh to it and I do not think I'll be able to use network console either. Unfortunately I do not have equipment to make serial connection with laptop :(.
Comment 7 yann 2016-05-02 14:46:10 UTC
Jani, is there any possible link with bug 95007 ?
Comment 8 Jani Nikula 2016-05-02 14:50:30 UTC
(In reply to yann from comment #7)
> Jani, is there any possible link with bug 95007 ?

I don't think so, but I'm not 100%.
Comment 9 Nikolay 2016-05-02 15:02:24 UTC
Just to add to additional report: this seems to be happening once in a few days. And anecdotally this may be triggered more often when some animated gif is left spinning on the screen in FF.
Comment 10 yann 2016-09-05 08:40:07 UTC
(In reply to Nikolay from comment #9)
> Just to add to additional report: this seems to be happening once in a few
> days. And anecdotally this may be triggered more often when some animated
> gif is left spinning on the screen in FF.

Nikolay, is it still occurring?

If this is occurring, can you try to change driver in your xorg.conf

Section "Device"
        Identifier "Intel Graphics"
        Driver "intel"
EndSection

by

Section "Device"
        Identifier "Intel Graphics"
        Driver "modesetting"
EndSection

and then confirm result.
Comment 11 Chris Wilson 2016-09-05 08:48:59 UTC
It's a skylake, make sure you have the most recent DMC (older firmware causes GPU hangs), and the most recent kernel (4.7+, older kernels are missing w/a). There are no known GPU hangs in -intel.
Comment 12 Nikolay 2016-09-06 04:00:41 UTC
Hi.

  Thanks for your responses. Indeed I didn't see this hang for quite some time now and I do not really know what fixed it. I use stock ubuntu kernel (4.4) and oibaf drivers. I guess this can be closed for now and I will post more comments if this hang happens again.

Thanks!


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.