Bug 96781 - [skl dp-mst] NULL pointer dereference after vblank/flip_done timed out
Summary: [skl dp-mst] NULL pointer dereference after vblank/flip_done timed out
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: highest critical
Assignee: Daniel Vetter
QA Contact: Elio
URL:
Whiteboard: ReadyForDev
Keywords: bisected
Depends on:
Blocks:
 
Reported: 2016-07-02 09:45 UTC by Chris Wilson
Modified: 2018-05-09 00:51 UTC (History)
18 users (show)

See Also:
i915 platform: I965GM, SKL
i915 features: display/atomic


Attachments
dmesg of a system with this bug (108.62 KB, text/plain)
2016-10-28 08:56 UTC, José Jorge
no flags Details
dmesg log with drm.debug enabled (659.06 KB, text/x-log)
2016-11-03 22:29 UTC, willma
no flags Details
revert ea0000f0d369a59c2086fe9c489e0a2a86e080ba for 4.8.9 (1.87 KB, patch)
2016-11-20 14:11 UTC, Jim Rees
no flags Details | Splinter Review
[PATCH] drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking" (2.26 KB, patch)
2016-12-21 12:29 UTC, Jim Rees
no flags Details | Splinter Review
linux-tip build with patch and drm.debug enabled dmesg output (25.31 KB, application/x-bzip)
2016-12-29 14:55 UTC, erroneous@gmail.com
no flags Details
journalctl 4.8.17 (365.80 KB, text/plain)
2017-01-11 04:49 UTC, Chris Murphy
no flags Details
4.10.0-rc6: drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking" (2.28 KB, patch)
2017-02-01 01:46 UTC, Jim Rees
no flags Details | Splinter Review
/sys/class/drm/card0/error (757.73 KB, text/plain)
2017-02-09 12:46 UTC, Chris Down
no flags Details
Don't fall over flip_done failures that hard (1.74 KB, patch)
2017-02-28 10:11 UTC, Daniel Vetter
no flags Details | Splinter Review
4.11.0-rc1: drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking" (2.21 KB, patch)
2017-03-12 04:33 UTC, Jim Rees
no flags Details | Splinter Review
4.11.0: drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking" (2.21 KB, patch)
2017-05-09 17:49 UTC, Jim Rees
no flags Details | Splinter Review
dmesg (67.94 KB, text/plain)
2017-05-31 02:30 UTC, Diego Viola
no flags Details

Description Chris Wilson 2016-07-02 09:45:13 UTC
[   43.103582] ------------[ cut here ]------------
[   43.103589] WARNING: CPU: 3 PID: 228 at drivers/gpu/drm/i915/intel_display.c:13617 intel_atomic_commit_tail+0x11d4/0x11e0
[   43.103589] pipe B vblank wait timed out
[   43.103620] Modules linked in: drbg ansi_cprng ctr ccm bnep binfmt_misc arc4 nls_iso8859_1 snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel x86_pkg_temp_thermal snd_hda_codec mac80211 snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi intel_powerclamp coretemp 8250_dw kvm_intel snd_seq kvm iwlwifi snd_seq_device snd_timer btusb hci_uart btrtl irqbypass crct10dif_pclmul crc32_pclmul snd ghash_clmulni_intel idma64 btbcm virt_dma btqca btintel cfg80211 aesni_intel aes_x86_64 bluetooth soundcore lrw gf128mul glue_helper ablk_helper cryptd intel_pch_thermal intel_lpss_pci intel_lpss_acpi intel_lpss mfd_core acpi_als acpi_pad kfifo_buf industrialio autofs4 dm_mirror dm_region_hash dm_log ahci sdhci_pci sdhci libahci i2c_hid hid
[   43.103623] CPU: 3 PID: 228 Comm: plymouthd Tainted: G        W       4.7.0-rc5+ #23
[   43.103624] Hardware name:                  /NUC6i3SYB, BIOS SYSKLi35.86A.0024.2015.1027.2142 10/27/2015
[   43.103626]  0000000000000286 00000000f6f6e55c ffff88084527b7e0 ffffffff81330dc8
[   43.103628]  ffff88084527b830 0000000000000000 ffff88084527b820 ffffffff81059a5b
[   43.103630]  00003531817ad522 00000000000000b0 ffff88084d959000 0000000000000004
[   43.103630] Call Trace:
[   43.103635]  [<ffffffff81330dc8>] dump_stack+0x4d/0x65
[   43.103638]  [<ffffffff81059a5b>] __warn+0xcb/0xf0
[   43.103640]  [<ffffffff81059adf>] warn_slowpath_fmt+0x5f/0x80
[   43.103642]  [<ffffffff81098635>] ? finish_wait+0x55/0x70
[   43.103645]  [<ffffffff814df7e4>] intel_atomic_commit_tail+0x11d4/0x11e0
[   43.103647]  [<ffffffff81098ac0>] ? wake_atomic_t_function+0x60/0x60
[   43.103649]  [<ffffffff814dfc22>] intel_atomic_commit+0x432/0x560
[   43.103652]  [<ffffffff8145aad7>] ? drm_atomic_check_only+0x187/0x610
[   43.103654]  [<ffffffff8145b547>] ? drm_atomic_add_affected_connectors+0x27/0xf0
[   43.103656]  [<ffffffff8145af97>] drm_atomic_commit+0x37/0x60
[   43.103659]  [<ffffffff81439b35>] drm_fb_helper_restore_fbdev_mode_unlocked+0x1a5/0x2c0
[   43.103662]  [<ffffffff81439c7d>] drm_fb_helper_set_par+0x2d/0x50
[   43.103664]  [<ffffffff814faa7a>] intel_fbdev_set_par+0x1a/0x60
[   43.103667]  [<ffffffff8103e441>] ? flat_send_IPI_mask+0x11/0x20
[   43.103669]  [<ffffffff81392c16>] fb_set_var+0x236/0x470
[   43.103670]  [<ffffffff810822e6>] ? resched_curr+0x66/0xc0
[   43.103672]  [<ffffffff8108b919>] ? check_preempt_wakeup+0x199/0x220
[   43.103673]  [<ffffffff81082da0>] ? check_preempt_curr+0x80/0x90
[   43.103675]  [<ffffffff811a504a>] ? __slab_free+0x9a/0x2c0
[   43.103677]  [<ffffffff8138c9cf>] fbcon_blank+0x30f/0x350
[   43.103678]  [<ffffffff8108cb16>] ? dequeue_entity+0x256/0xb30
[   43.103681]  [<ffffffff81408fa3>] do_unblank_screen+0xd3/0x1a0
[   43.103682]  [<ffffffff813ff23b>] vt_ioctl+0x90b/0x1300
[   43.103683]  [<ffffffff8143a89a>] ? drm_dropmaster_ioctl+0x4a/0x70
[   43.103684]  [<ffffffff814400b6>] ? drm_ioctl+0x2f6/0x490
[   43.103686]  [<ffffffff813f3ee6>] tty_ioctl+0x356/0xc40
[   43.103688]  [<ffffffff810b8fb0>] ? __hrtimer_init+0x90/0x90
[   43.103690]  [<ffffffff817aed39>] ? schedule_hrtimeout_range_clock+0xb9/0x1a0
[   43.103693]  [<ffffffff811d6a51>] do_vfs_ioctl+0xa1/0x5a0
[   43.103694]  [<ffffffff811c53fe>] ? ____fput+0xe/0x10
[   43.103696]  [<ffffffff81083c90>] ? wake_up_q+0x70/0x70
[   43.103697]  [<ffffffff811d6f91>] SyS_ioctl+0x41/0x70
[   43.103699]  [<ffffffff817af75f>] entry_SYSCALL_64_fastpath+0x17/0x93
[   43.103700] ---[ end trace b2783a69c5e45156 ]---
[   53.099417] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[   63.298451] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[   73.298180] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[   83.306290] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[   93.318428] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  103.326473] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  113.350477] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  123.466563] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  123.473004] [drm] RC6 on
[  133.514663] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  133.618630] ------------[ cut here ]------------
[  133.618639] WARNING: CPU: 2 PID: 1739 at drivers/gpu/drm/i915/intel_display.c:13617 intel_atomic_commit_tail+0x11d4/0x11e0
[  133.618640] pipe B vblank wait timed out
[  133.618642] Modules linked in: drbg ansi_cprng ctr ccm bnep binfmt_misc arc4 nls_iso8859_1 snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel x86_pkg_temp_thermal snd_hda_codec mac80211 snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi intel_powerclamp coretemp 8250_dw kvm_intel snd_seq kvm iwlwifi snd_seq_device snd_timer btusb hci_uart btrtl irqbypass crct10dif_pclmul crc32_pclmul snd ghash_clmulni_intel idma64 btbcm virt_dma btqca btintel cfg80211 aesni_intel aes_x86_64 bluetooth soundcore lrw gf128mul glue_helper ablk_helper cryptd intel_pch_thermal intel_lpss_pci intel_lpss_acpi intel_lpss mfd_core acpi_als acpi_pad kfifo_buf industrialio autofs4 dm_mirror dm_region_hash dm_log ahci sdhci_pci sdhci libahci i2c_hid hid
[  133.618687] CPU: 2 PID: 1739 Comm: Xorg Tainted: G        W       4.7.0-rc5+ #23
[  133.618688] Hardware name:                  /NUC6i3SYB, BIOS SYSKLi35.86A.0024.2015.1027.2142 10/27/2015
[  133.618690]  0000000000000286 00000000814872b6 ffff880848d8ba58 ffffffff81330dc8
[  133.618693]  ffff880848d8baa8 0000000000000000 ffff880848d8ba98 ffffffff81059a5b
[  133.618695]  0000353100000286 00000000000000b0 ffff88084d959000 0000000000000004
[  133.618698] Call Trace:
[  133.618705]  [<ffffffff81330dc8>] dump_stack+0x4d/0x65
[  133.618709]  [<ffffffff81059a5b>] __warn+0xcb/0xf0
[  133.618712]  [<ffffffff81059adf>] warn_slowpath_fmt+0x5f/0x80
[  133.618716]  [<ffffffff81098635>] ? finish_wait+0x55/0x70
[  133.618719]  [<ffffffff814df7e4>] intel_atomic_commit_tail+0x11d4/0x11e0
[  133.618723]  [<ffffffff81098ac0>] ? wake_atomic_t_function+0x60/0x60
[  133.618726]  [<ffffffff814dfc22>] intel_atomic_commit+0x432/0x560
[  133.618729]  [<ffffffff8145aad7>] ? drm_atomic_check_only+0x187/0x610
[  133.618731]  [<ffffffff8145b6a7>] ? drm_atomic_set_crtc_for_connector+0x97/0x100
[  133.618734]  [<ffffffff8145af97>] drm_atomic_commit+0x37/0x60
[  133.618737]  [<ffffffff81436521>] drm_atomic_helper_set_config+0x81/0xc0
[  133.618740]  [<ffffffff81449625>] drm_mode_set_config_internal+0x65/0x110
[  133.618743]  [<ffffffff8144e37a>] drm_mode_setcrtc+0x43a/0x550
[  133.618745]  [<ffffffff8144008d>] drm_ioctl+0x2cd/0x490
[  133.618748]  [<ffffffff8144df40>] ? drm_mode_setplane+0x1c0/0x1c0
[  133.618752]  [<ffffffff811d6a51>] do_vfs_ioctl+0xa1/0x5a0
[  133.618755]  [<ffffffff81066ef1>] ? __set_task_blocked+0x41/0xa0
[  133.618759]  [<ffffffff810697f6>] ? __set_current_blocked+0x36/0x50
[  133.618761]  [<ffffffff811d6f91>] SyS_ioctl+0x41/0x70
[  133.618764]  [<ffffffff81069aae>] ? SyS_rt_sigprocmask+0x8e/0xc0
[  133.618768]  [<ffffffff817af75f>] entry_SYSCALL_64_fastpath+0x17/0x93
[  133.618770] ---[ end trace b2783a69c5e45157 ]---
[  143.618784] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  143.766774] ------------[ cut here ]------------
[  143.766791] WARNING: CPU: 2 PID: 1739 at drivers/gpu/drm/drm_irq.c:1227 drm_wait_one_vblank+0x1ba/0x1c0
[  143.766795] vblank wait timed out on crtc 1
[  143.766797] Modules linked in: drbg ansi_cprng ctr ccm bnep binfmt_misc arc4 nls_iso8859_1 snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel x86_pkg_temp_thermal snd_hda_codec mac80211 snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi intel_powerclamp coretemp 8250_dw kvm_intel snd_seq kvm iwlwifi snd_seq_device snd_timer btusb hci_uart btrtl irqbypass crct10dif_pclmul crc32_pclmul snd ghash_clmulni_intel idma64 btbcm virt_dma btqca btintel cfg80211 aesni_intel aes_x86_64 bluetooth soundcore lrw gf128mul glue_helper ablk_helper cryptd intel_pch_thermal intel_lpss_pci intel_lpss_acpi intel_lpss mfd_core acpi_als acpi_pad kfifo_buf industrialio autofs4 dm_mirror dm_region_hash dm_log ahci sdhci_pci sdhci libahci i2c_hid hid
[  143.766883] CPU: 2 PID: 1739 Comm: Xorg Tainted: G        W       4.7.0-rc5+ #23
[  143.766887] Hardware name:                  /NUC6i3SYB, BIOS SYSKLi35.86A.0024.2015.1027.2142 10/27/2015
[  143.766890]  0000000000000286 00000000814872b6 ffff880848d8b8d0 ffffffff81330dc8
[  143.766896]  ffff880848d8b920 0000000000000000 ffff880848d8b910 ffffffff81059a5b
[  143.766902]  000004cb00003246 ffff880845960000 0000000000000001 0000000000000000
[  143.766908] Call Trace:
[  143.766919]  [<ffffffff81330dc8>] dump_stack+0x4d/0x65
[  143.766927]  [<ffffffff81059a5b>] __warn+0xcb/0xf0
[  143.766933]  [<ffffffff81059adf>] warn_slowpath_fmt+0x5f/0x80
[  143.766941]  [<ffffffff81098635>] ? finish_wait+0x55/0x70
[  143.766947]  [<ffffffff81441faa>] drm_wait_one_vblank+0x1ba/0x1c0
[  143.766954]  [<ffffffff81098ac0>] ? wake_atomic_t_function+0x60/0x60
[  143.766965]  [<ffffffff814717ad>] skl_update_wm+0x53d/0x560
[  143.766972]  [<ffffffff81477f3e>] intel_update_watermarks+0x1e/0x30
[  143.766980]  [<ffffffff814de10a>] intel_pre_plane_update+0x18a/0x1c0
[  143.766987]  [<ffffffff814de900>] intel_atomic_commit_tail+0x2f0/0x11e0
[  143.766992]  [<ffffffff811a4b69>] ? kmem_cache_alloc_trace+0x1a9/0x1c0
[  143.766999]  [<ffffffff814dfc22>] intel_atomic_commit+0x432/0x560
[  143.767005]  [<ffffffff8145aad7>] ? drm_atomic_check_only+0x187/0x610
[  143.767009]  [<ffffffff8145af97>] drm_atomic_commit+0x37/0x60
[  143.767017]  [<ffffffff8143612c>] drm_atomic_helper_disable_plane+0xac/0xf0
[  143.767022]  [<ffffffff81449b6b>] __setplane_internal+0x17b/0x280
[  143.767027]  [<ffffffff81459eea>] ? drm_modeset_lock_all_ctx+0x9a/0xb0
[  143.767032]  [<ffffffff8144dec8>] drm_mode_setplane+0x148/0x1c0
[  143.767038]  [<ffffffff8144008d>] drm_ioctl+0x2cd/0x490
[  143.767043]  [<ffffffff8144dd80>] ? drm_plane_check_pixel_format+0x50/0x50
[  143.767051]  [<ffffffff811e3f64>] ? mntput+0x24/0x40
[  143.767058]  [<ffffffff811d6a51>] do_vfs_ioctl+0xa1/0x5a0
[  143.767064]  [<ffffffff81066ef1>] ? __set_task_blocked+0x41/0xa0
[  143.767071]  [<ffffffff810697f6>] ? __set_current_blocked+0x36/0x50
[  143.767077]  [<ffffffff811d6f91>] SyS_ioctl+0x41/0x70
[  143.767083]  [<ffffffff81069a94>] ? SyS_rt_sigprocmask+0x74/0xc0
[  143.767092]  [<ffffffff817af75f>] entry_SYSCALL_64_fastpath+0x17/0x93
[  143.767096] ---[ end trace b2783a69c5e45158 ]---
[  143.838665] ------------[ cut here ]------------
[  143.838682] WARNING: CPU: 2 PID: 1739 at drivers/gpu/drm/i915/intel_display.c:13617 intel_atomic_commit_tail+0x11d4/0x11e0
[  143.838685] pipe B vblank wait timed out
[  143.838688] Modules linked in: drbg ansi_cprng ctr ccm bnep binfmt_misc arc4 nls_iso8859_1 snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel x86_pkg_temp_thermal snd_hda_codec mac80211 snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi intel_powerclamp coretemp 8250_dw kvm_intel snd_seq kvm iwlwifi snd_seq_device snd_timer btusb hci_uart btrtl irqbypass crct10dif_pclmul crc32_pclmul snd ghash_clmulni_intel idma64 btbcm virt_dma btqca btintel cfg80211 aesni_intel aes_x86_64 bluetooth soundcore lrw gf128mul glue_helper ablk_helper cryptd intel_pch_thermal intel_lpss_pci intel_lpss_acpi intel_lpss mfd_core acpi_als acpi_pad kfifo_buf industrialio autofs4 dm_mirror dm_region_hash dm_log ahci sdhci_pci sdhci libahci i2c_hid hid
[  143.838773] CPU: 2 PID: 1739 Comm: Xorg Tainted: G        W       4.7.0-rc5+ #23
[  143.838777] Hardware name:                  /NUC6i3SYB, BIOS SYSKLi35.86A.0024.2015.1027.2142 10/27/2015
[  143.838780]  0000000000000286 00000000814872b6 ffff880848d8ba28 ffffffff81330dc8
[  143.838787]  ffff880848d8ba78 0000000000000000 ffff880848d8ba68 ffffffff81059a5b
[  143.838792]  0000353100000286 00000000000000b0 ffff88084d959000 0000000000000004
[  143.838798] Call Trace:
[  143.838811]  [<ffffffff81330dc8>] dump_stack+0x4d/0x65
[  143.838819]  [<ffffffff81059a5b>] __warn+0xcb/0xf0
[  143.838825]  [<ffffffff81059adf>] warn_slowpath_fmt+0x5f/0x80
[  143.838833]  [<ffffffff81098635>] ? finish_wait+0x55/0x70
[  143.838839]  [<ffffffff814df7e4>] intel_atomic_commit_tail+0x11d4/0x11e0
[  143.838846]  [<ffffffff81098ac0>] ? wake_atomic_t_function+0x60/0x60
[  143.838852]  [<ffffffff814dfc22>] intel_atomic_commit+0x432/0x560
[  143.838859]  [<ffffffff8145aad7>] ? drm_atomic_check_only+0x187/0x610
[  143.838864]  [<ffffffff8145af97>] drm_atomic_commit+0x37/0x60
[  143.838872]  [<ffffffff8143612c>] drm_atomic_helper_disable_plane+0xac/0xf0
[  143.838877]  [<ffffffff81449b6b>] __setplane_internal+0x17b/0x280
[  143.838882]  [<ffffffff81459eea>] ? drm_modeset_lock_all_ctx+0x9a/0xb0
[  143.838888]  [<ffffffff8144dec8>] drm_mode_setplane+0x148/0x1c0
[  143.838893]  [<ffffffff8144008d>] drm_ioctl+0x2cd/0x490
[  143.838899]  [<ffffffff8144dd80>] ? drm_plane_check_pixel_format+0x50/0x50
[  143.838907]  [<ffffffff811e3f64>] ? mntput+0x24/0x40
[  143.838914]  [<ffffffff811d6a51>] do_vfs_ioctl+0xa1/0x5a0
[  143.838921]  [<ffffffff81066ef1>] ? __set_task_blocked+0x41/0xa0
[  143.838928]  [<ffffffff810697f6>] ? __set_current_blocked+0x36/0x50
[  143.838933]  [<ffffffff811d6f91>] SyS_ioctl+0x41/0x70
[  143.838941]  [<ffffffff817af75f>] entry_SYSCALL_64_fastpath+0x17/0x93
[  143.838946] ---[ end trace b2783a69c5e45159 ]---
[  153.838833] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  167.022595] [drm:drm_atomic_helper_commit_cleanup_done] *ERROR* [CRTC:30:pipe B] flip_done timed out
[  179.786793] BUG: unable to handle kernel NULL pointer dereference at           (null)
[  179.786840] IP: [<ffffffff810983eb>] __wake_up_common+0x2b/0x90
[  179.786875] PGD 0 
[  179.786889] Oops: 0000 [#1] SMP
[  179.786906] Modules linked in: drbg ansi_cprng ctr ccm bnep binfmt_misc arc4 nls_iso8859_1 snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel x86_pkg_temp_thermal snd_hda_codec mac80211 snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi intel_powerclamp coretemp 8250_dw kvm_intel snd_seq kvm iwlwifi snd_seq_device snd_timer btusb hci_uart btrtl irqbypass crct10dif_pclmul crc32_pclmul snd ghash_clmulni_intel idma64 btbcm virt_dma btqca btintel cfg80211 aesni_intel aes_x86_64 bluetooth soundcore lrw gf128mul glue_helper ablk_helper cryptd intel_pch_thermal intel_lpss_pci intel_lpss_acpi intel_lpss mfd_core acpi_als acpi_pad kfifo_buf industrialio autofs4 dm_mirror dm_region_hash dm_log ahci sdhci_pci sdhci libahci i2c_hid hid
[  179.787353] CPU: 2 PID: 1780 Comm: Xorg Tainted: G        W       4.7.0-rc5+ #23
[  179.787387] Hardware name:                  /NUC6i3SYB, BIOS SYSKLi35.86A.0024.2015.1027.2142 10/27/2015
[  179.787429] task: ffff880845b1ab80 ti: ffff880848d88000 task.ti: ffff880848d88000
[  179.787463] RIP: 0010:[<ffffffff810983eb>]  [<ffffffff810983eb>] __wake_up_common+0x2b/0x90
[  179.787505] RSP: 0018:ffff880848d8b978  EFLAGS: 00010082
[  179.787530] RAX: 0000000000000082 RBX: ffff8808450dd0d8 RCX: 0000000000000000
[  179.787563] RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff8808450dd0d8
[  179.787595] RBP: ffff880848d8b9b0 R08: 0000000000000000 R09: ffff880848d8b910
[  179.787627] R10: 0000000000000005 R11: ffff880848d8b97c R12: ffff8808450dd0e0
[  179.787660] R13: 0000000000000082 R14: ffff88084513e700 R15: 0000000000000003
[  179.787692] FS:  00007f4c095d0a00(0000) GS:ffff880871d00000(0000) knlGS:0000000000000000
[  179.787729] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  179.787756] CR2: 0000000000000000 CR3: 000000084cdab000 CR4: 00000000001406e0
[  179.787788] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  179.787821] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  179.787853] Stack:
[  179.787864]  0000000081441293 0000000000000000 ffff8808450dd0d8 ffff8808450dd0d0
[  179.787905]  0000000000000082 ffff88084513e700 0000000000000001 ffff880848d8b9c0
[  179.787945]  ffffffff810984b3 ffff880848d8b9e8 ffffffff81098f57 ffff88084528a900
[  179.787986] Call Trace:
[  179.788002]  [<ffffffff810984b3>] __wake_up_locked+0x13/0x20
[  179.788030]  [<ffffffff81098f57>] complete_all+0x37/0x50
[  179.788057]  [<ffffffff8143e628>] drm_send_event_locked+0x28/0xe0
[  179.788087]  [<ffffffff81440dfe>] send_vblank_event+0x3e/0xa0
[  179.788115]  [<ffffffff81441ad9>] drm_vblank_off+0x179/0x240
[  179.788142]  [<ffffffff81441bb7>] drm_crtc_vblank_off+0x17/0x20
[  179.788172]  [<ffffffff814d7338>] haswell_crtc_disable+0x78/0x2b0
[  179.788201]  [<ffffffff814de785>] intel_atomic_commit_tail+0x175/0x11e0
[  179.788234]  [<ffffffff814a3af5>] ? __i915_wait_request+0x3c5/0x4f0
[  179.788265]  [<ffffffff81098ac0>] ? wake_atomic_t_function+0x60/0x60
[  179.788296]  [<ffffffff814dfc22>] intel_atomic_commit+0x432/0x560
[  179.788326]  [<ffffffff8145aad7>] ? drm_atomic_check_only+0x187/0x610
[  179.788356]  [<ffffffff8145b28c>] ? drm_atomic_set_mode_prop_for_crtc+0x2c/0x110
[  179.788391]  [<ffffffff8145af97>] drm_atomic_commit+0x37/0x60
[  179.788429]  [<ffffffff81436521>] drm_atomic_helper_set_config+0x81/0xc0
[  179.788461]  [<ffffffff81449625>] drm_mode_set_config_internal+0x65/0x110
[  179.789635]  [<ffffffff8144e37a>] drm_mode_setcrtc+0x43a/0x550
[  179.790787]  [<ffffffff8144008d>] drm_ioctl+0x2cd/0x490
[  179.791919]  [<ffffffff8144df40>] ? drm_mode_setplane+0x1c0/0x1c0
[  179.793053]  [<ffffffff811d6a51>] do_vfs_ioctl+0xa1/0x5a0
[  179.794374]  [<ffffffff8106a4f4>] ? restore_altstack+0x24/0x40
[  179.795703]  [<ffffffff811d6f91>] SyS_ioctl+0x41/0x70
[  179.796576]  [<ffffffff817af75f>] entry_SYSCALL_64_fastpath+0x17/0x93
[  179.797425] Code: 0f 1f 44 00 00 55 48 89 e5 41 57 41 56 41 55 41 54 4c 8d 67 08 53 41 89 f7 48 83 ec 10 89 55 cc 48 8b 57 08 4c 89 45 d0 49 39 d4 <48> 8b 32 74 45 41 89 ce 48 8d 42 e8 4c 8d 6e e8 eb 03 49 89 d5 
[  179.799245] RIP  [<ffffffff810983eb>] __wake_up_common+0x2b/0x90
[  179.800084]  RSP <ffff880848d8b978>
[  179.800851] CR2: 0000000000000000
[  179.804542] ---[ end trace b2783a69c5e4515a ]---
Comment 1 Rami 2016-10-20 15:21:22 UTC
Hi Chris,
what are setps to reproduce this bug?
Comment 2 Jiri Slaby 2016-10-23 08:41:37 UTC
Different path in the warn+crash (via drm_mode_getconnector), but this seems to have same symptoms:
https://apibugzilla.suse.com/show_bug.cgi?id=1006392
Comment 3 michael 2016-10-23 15:20:38 UTC
I reported http://bugzilla.suse.com/show_bug.cgi?id=1006392

I have in my laptop:
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (primary) (rev 03)
00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (secondary) (rev 03)

Work-around: Adding i915.semaphores=1 as kernel parameter I can boot kernel 4.8.3.

System is a bit slow in some situations though. E.g. when switching to/from text console there are still some backtraces logged:

------------[ cut here ]------------
WARNING: CPU: 0 PID: 2025 at ../drivers/gpu/drm/drm_irq.c:1224 drm_wait_one_vblank+0x17d/0x190 [drm]
vblank wait timed out on crtc 0
Modules linked in: ipt_REJECT nf_reject_ipv4 tun bridge stp llc fuse ebtable_filter ebtables ip6table_filter ip6_tables af_packet ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack xt_tcpudp nf_log_ipv4 nf_log_common xt_LOG xt_limit iptable_filter ip_tables x_tables snd_hda_codec_realtek snd_hda_codec_generic iTCO_wdt iTCO_vendor_support arc4 ppdev coretemp kvm_intel kvm irqbypass pcspkr joydev iwl4965 iwlegacy i2c_i801 i2c_smbus mac80211 snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep lpc_ich mfd_core snd_pcm cfg80211 smsc_ircc2 sky2 rfkill irda i915 parport_pc snd_timer parport drm_kms_helper snd battery drm thermal fb_sys_fops syscopyarea sysfillrect sysimgblt i2c_algo_bit fjes shpchp soundcore acpi_cpufreq button fujitsu_laptop tpm_tis tpm_tis_core video ac tpm dm_crypt algif_skcipher af_alg hid_generic usbhid sr_mod cdrom ata_generic pcmcia ata_piix serio_raw sdhci_pci sdhci yenta_socket pcmcia_rsrc pcmcia_core mmc_core uhci_hcd ehci_pci ehci_hcd usbcore usb_common dm_mod sg
CPU: 0 PID: 2025 Comm: X Tainted: G     U  W       4.8.3-1-default #1
Hardware name: FUJITSU SIEMENS LIFEBOOK E8310/FJNB1CE, BIOS Version 1.14  08/20/2008
 0000000000000000 ffffffffb53a3e62 ffff957168e6b8c8 0000000000000000
 ffffffffb507ddde ffff957169210000 ffff957168e6b918 0000000000000000
 000000000c000006 ffff9571698e0e08 ffff9571424e3c00 ffffffffb507de4f
Call Trace:
 [<ffffffffb502eefe>] dump_trace+0x5e/0x310
 [<ffffffffb502f2cb>] show_stack_log_lvl+0x11b/0x1a0
 [<ffffffffb5030001>] show_stack+0x21/0x40
 [<ffffffffb53a3e62>] dump_stack+0x5c/0x7a
 [<ffffffffb507ddde>] __warn+0xbe/0xe0
 [<ffffffffb507de4f>] warn_slowpath_fmt+0x4f/0x60
 [<ffffffffc054b4ed>] drm_wait_one_vblank+0x17d/0x190 [drm]
 [<ffffffffc06c69e7>] intel_pre_plane_update+0x157/0x180 [i915]
 [<ffffffffc06c6d39>] intel_atomic_commit_tail+0x129/0x1060 [i915]
 [<ffffffffc06c807c>] intel_atomic_commit+0x40c/0x510 [i915]
 [<ffffffffc06cd6cf>] intel_release_load_detect_pipe+0x1f/0x80 [i915]
 [<ffffffffc0706f2a>] intel_tv_detect+0x33a/0x5c0 [i915]
 [<ffffffffc05f6dfd>] drm_helper_probe_single_connector_modes+0x26d/0x510 [drm_kms_helper]
 [<ffffffffc0556314>] drm_mode_getconnector+0x324/0x360 [drm]
 [<ffffffffc0549913>] drm_ioctl+0x1b3/0x440 [drm]
 [<ffffffffb522c31f>] do_vfs_ioctl+0x8f/0x5d0
 [<ffffffffb522c8d4>] SyS_ioctl+0x74/0x80
 [<ffffffffb56d43f6>] entry_SYSCALL_64_fastpath+0x1e/0xa8
DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x1e/0xa8
Leftover inexact backtrace:
---[ end trace 2b5094a97699f467 ]---
Comment 4 José Jorge 2016-10-28 08:53:37 UTC
(In reply to Rami from comment #1)
> Hi Chris,
> what are setps to reproduce this bug?

I have the same bug. The steps are simply to boot the system with a 4.8.x kernel.
With a 4.7.5 I don't have this bug. Same hardware : 

00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller

I am using Mageia Cauldron (future Mageia 6).
Comment 5 José Jorge 2016-10-28 08:56:26 UTC
Created attachment 127582 [details]
dmesg of a system with this bug
Comment 6 willma 2016-11-03 22:27:34 UTC
I have the same error on Fedora with the current 4.8.4 Kernel. It is also present in 4.9-rc3

[drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out

This leads to a system freeze after exiting a full screen game.

lspci -v

00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (primary) (rev 0c) (prog-if 00 [VGA controller])
	Subsystem: Dell Latitude D630
	Flags: bus master, fast devsel, latency 0, IRQ 28
	Memory at f6e00000 (64-bit, non-prefetchable) [size=1M]
	Memory at e0000000 (64-bit, prefetchable) [size=256M]
	I/O ports at eff8 [size=8]
	[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
	Capabilities: <access denied>
	Kernel driver in use: i915
	Kernel modules: i915
Comment 7 willma 2016-11-03 22:29:00 UTC
Created attachment 127741 [details]
dmesg log with drm.debug enabled
Comment 8 willma 2016-11-09 15:54:45 UTC
I did a kernel bisect and the flip_done timeout error came with commit ea0000f0d369a59c2086fe9c489e0a2a86e080ba - drm/i915: Roll out the helper nonblock tracking. 

The last good commit was 1f7528c4dbea46bd266798d3c374a961b1228055 - drm/i915: Signal drm events for atomic

The vblank time out bug has been in the kernel since 4.4 or so and there is already a report here: https://bugs.freedesktop.org/show_bug.cgi?id=93782
I don't know if they are somehow related.

The vblank bug has no visible effects on my laptop other than an error message in dmesg but the flip_done time out bug is more serious. It makes the boot hang for about 10 seconds with a blank screen and crashes the computer when switching back to the desktop from a full screen game.
Comment 9 Jim Rees 2016-11-20 14:11:07 UTC
Created attachment 128085 [details] [review]
revert ea0000f0d369a59c2086fe9c489e0a2a86e080ba for 4.8.9

fix for https://bugs.freedesktop.org/show_bug.cgi?id=96781
patch adapted from https://bugs.freedesktop.org/show_bug.cgi?id=97529
Comment 10 Jim Rees 2016-11-20 14:15:01 UTC
I'm not sure what happened to my comment on the attachment I just sent. This bug causes a flip_done timeout and crash when I exit Xorg on my Thinkpad x220 with i915. The attached patch fixes it. As this is a regression, this should go upstream and to stable. What's the procedure for pushing it up? Has there been any discussion on lkml?
Comment 11 Jani Nikula 2016-11-23 15:25:00 UTC
(In reply to willma from comment #8)
> I did a kernel bisect and the flip_done timeout error came with commit
> ea0000f0d369a59c2086fe9c489e0a2a86e080ba - drm/i915: Roll out the helper
> nonblock tracking. 

Bad commit:

commit ea0000f0d369a59c2086fe9c489e0a2a86e080ba
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Mon Jun 13 16:13:46 2016 +0200

    drm/i915: Roll out the helper nonblock tracking
Comment 12 Maarten Lankhorst 2016-12-12 13:07:20 UTC
*** Bug 98554 has been marked as a duplicate of this bug. ***
Comment 13 Maarten Lankhorst 2016-12-12 13:09:07 UTC
*** Bug 95165 has been marked as a duplicate of this bug. ***
Comment 14 Jim Rees 2016-12-13 02:19:22 UTC
I sent a patch to i915 maintainers and list but got no response.

I wonder if commit e411072d57 "drm/i915: drop the struct_mutex when wedged or trying to reset" fixes the flip_done timeout. I'm going to do some testing.
Comment 15 Jim Rees 2016-12-13 19:38:46 UTC
Commit e411072d57 did not help. In fact it seems to have made things worse. I now get the flip_done timeout both on X startup and shutdown. Kernel 4.9 went out with this bug unfixed.
Comment 16 info.artur 2016-12-17 21:23:43 UTC
4.8.13-1-ARCH

VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (primary) (rev 0c) (prog-if 00 [VGA controller])
	Subsystem: Dell Latitude D630
	Flags: bus master, fast devsel, latency 0, IRQ 28
	Memory at f6e00000 (64-bit, non-prefetchable) [size=1M]
	Memory at e0000000 (64-bit, prefetchable) [size=256M]
	I/O ports at eff8 [size=8]
	[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
	Capabilities: <access denied>
	Kernel driver in use: i915
	Kernel modules: i915

[drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe B] flip_done timed out

It makes occasional Xorg freeze, often changing from console mode.

So it seems, that the same/similar hardware is affected :/
Comment 17 Jani Nikula 2016-12-19 10:50:40 UTC
(In reply to info.artur from comment #16)
> 4.8.13-1-ARCH

If you're chiming in "me too", please try to use the latest kernels. Preferrably drm-tip branch of https://cgit.freedesktop.org/drm-tip. Thanks.
Comment 18 Jim Rees 2016-12-20 16:50:18 UTC
I have just finished testing drm-tip, and it fixes the flip_done timeout for me (Thinkpad x220 i915). I have not bisected so I do not know what the fix is. Just to repeat, kernel 4.9 is broken, so it would be nice if either the ea0000f revert could be pushed to -stable, or the fix isolated and pushed to -stable. Also it would be good if others could test drm-tip.

I still have not seen any comments on the ea0000f revert that I sent to the drm mailing list. This seems like a bug that affects a lot of people and several kernel versions, and is a regression from previous kernels. Should this be escalated? How?
Comment 19 Daniel Vetter 2016-12-21 12:23:23 UTC
Typed up a patch to avoid the oops and make the nonblocking helpers more robust:

https://patchwork.freedesktop.org/patch/128918/

There will still be warnings in dmesg (and timeouts), but the driver should at least survive.

For the vblank timeout issue itself I think the best approach is to bisect what fixed it, and then backport that.
Comment 20 Jim Rees 2016-12-21 12:29:44 UTC
Created attachment 128603 [details] [review]
[PATCH] drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking"

This is the patch I sent to the drm mailing list. It is the same as my previous patch but based on 4.9 and with summary and signoff.
Comment 21 Jani Nikula 2016-12-21 13:15:16 UTC
(In reply to Jim Rees from comment #20)
> Created attachment 128603 [details] [review] [review]
> [PATCH] drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking"
> 
> This is the patch I sent to the drm mailing list. It is the same as my
> previous patch but based on 4.9 and with summary and signoff.

Please try the patch from comment #19.
Comment 22 Daniel Vetter 2016-12-21 14:56:51 UTC
(In reply to Jani Nikula from comment #21)
> (In reply to Jim Rees from comment #20)
> > Created attachment 128603 [details] [review] [review] [review]
> > [PATCH] drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking"
> > 
> > This is the patch I sent to the drm mailing list. It is the same as my
> > previous patch but based on 4.9 and with summary and signoff.
> 
> Please try the patch from comment #19.

Yeah, as soon as that has a tested-by (and note that it only fixes the hard crash/oops, there wills till be timeouts) I can apply it and stuff it into the stable kernel queue.
Comment 23 Jim Rees 2016-12-27 18:07:52 UTC
If there will still be timeouts, what am I testing for? Is the flip_done timeout a separate bug, and should I open a separate bug report for it?
Comment 24 erroneous@gmail.com 2016-12-29 14:55:45 UTC
Created attachment 128686 [details]
linux-tip build with patch and drm.debug enabled dmesg output

dmesg output for drm-tip kernel with patch 128603 on Arch Linux. Still get the crash in addition to timeouts.
Comment 25 Jim Rees 2016-12-29 15:04:57 UTC
Is there some reason not to revert ea0000f0 while we debug this? It's clearly causing problems, is a regression from previous working behavior, and we have a fix that works. ea0000f0 was applied before it was ready, has broken previously working configurations, and should be reverted.
Comment 26 Jani Nikula 2016-12-30 10:12:26 UTC
(In reply to Jim Rees from comment #25)
> Is there some reason not to revert ea0000f0 while we debug this? It's
> clearly causing problems, is a regression from previous working behavior,
> and we have a fix that works. ea0000f0 was applied before it was ready, has
> broken previously working configurations, and should be reverted.

We can only *backport* commits to stable. We can't apply commits to stable kernels unless the commits are present in Linus' master. So we need to have this debugged and fixed upstream first. Unfortunately, the revert working for you in v4.9 does not get us any closer to having this fixed upstream. The commit also doesn't cleanly revert on v4.10-rc1 anymore.

Can you please try v4.10-rc1 and/or drm-tip branch of https://cgit.freedesktop.org/drm-tip, plus https://patchwork.freedesktop.org/patch/128918/ on top?
Comment 27 Daniel Vetter 2016-12-30 13:02:19 UTC
(In reply to erroneous@gmail.com from comment #24)
> Created attachment 128686 [details]
> linux-tip build with patch and drm.debug enabled dmesg output
> 
> dmesg output for drm-tip kernel with patch 128603 on Arch Linux. Still get
> the crash in addition to timeouts.

This patch will _not_ fix the WARNING backtrace, but it should fix the Oops/hard-hangs of the driver. Looking at dmesg, only the WARNING is left. Can you pls confirm that the hard hangs (not the long delays when vt switching or similar) are gone?
Comment 28 erroneous@gmail.com 2016-12-30 13:44:15 UTC
(In reply to Daniel Vetter from comment #27)
> (In reply to erroneous@gmail.com from comment #24)
> > Created attachment 128686 [details]
> > linux-tip build with patch and drm.debug enabled dmesg output
> > 
> > dmesg output for drm-tip kernel with patch 128603 on Arch Linux. Still get
> > the crash in addition to timeouts.
> 
> This patch will _not_ fix the WARNING backtrace, but it should fix the
> Oops/hard-hangs of the driver. Looking at dmesg, only the WARNING is left.
> Can you pls confirm that the hard hangs (not the long delays when vt
> switching or similar) are gone?

Sorry, didn't realize it was a warning BT. The system continues on without an oops, but now that I realize that the BT is just a warning I realize that it never crashed in the first place for my hardware with the 4.8.13 kernel. It only does the the same warning, not the same Oops. Please disregard my tests then since I couldn't reproduce the same Oops.
Comment 29 Jim Rees 2016-12-30 15:31:38 UTC
I don't get the hard hang or the oops. Do you still want me to test?

So should I file a new bug report for the timeout?
Comment 30 willma 2016-12-30 19:45:22 UTC
(In reply to Daniel Vetter from comment #27)

> This patch will _not_ fix the WARNING backtrace, but it should fix the
> Oops/hard-hangs of the driver. Looking at dmesg, only the WARNING is left.
> Can you pls confirm that the hard hangs (not the long delays when vt
> switching or similar) are gone?

I am currently using drm-tip with your patch applied and my system hangs a couple seconds while booting and when switching modes but I did not experience any hard hangs or crashes as I did with kernel 4.8.

I still have this line im my dmesg though:

[drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:34:pipe B] flip_done timed out

Is this what you mean by WARNING?
Comment 31 Jani Nikula 2017-01-02 08:44:40 UTC
(In reply to Jim Rees from comment #29)
> I don't get the hard hang or the oops. Do you still want me to test?

On which kernel do you not get the hard hang or the oops?
Comment 32 Chris Murphy 2017-01-11 04:49:10 UTC
Created attachment 128882 [details]
journalctl 4.8.17

I'm hitting this rarely when hitting a key to "wake up" the display; system is not in suspend but gnome-shell has dimmed the display. Hitting a key is what happened 2 seconds before this crash. Including the entire journal output. Two call traces are there, maybe the first one caused the instability leading to the first. I was able to login remotely with ssh; but the keyboard and mouse were unresponsive after this oops, I couldn't get to a VT.

4.8.17-200.fc24.x86_64

Parameters i915.enable_guc_loading=-1 i915.enable_guc_submission=-1 are used for this event.

[12446.222314] f25h kernel: WARNING: CPU: 3 PID: 1549 at drivers/gpu/drm/i915/intel_display.c:13714 intel_atomic_commit_tail+0x1043/0x1050 [i915]
[12446.222320] f25h kernel: pipe A vblank wait timed out


and then


[12534.964553] f25h kernel: BUG: unable to handle kernel paging request at 00007fb63e44c94b
[12534.964737] f25h kernel: IP: [<ffffffffa80e46eb>] __wake_up_common+0x2b/0x80
[12534.964865] f25h kernel: PGD 2b2190067 PUD 0 
[12534.964946] f25h kernel: Oops: 0000 [#1] SMP
Comment 33 Kadir 2017-01-19 07:12:28 UTC
Hi,

I have the same issue on Fedora 25 with kernel 4.9.3-200.fc25.x86_64. I have reported about it here https://bugzilla.redhat.com/show_bug.cgi?id=1409228

The system hard freezes when logging out of Gnome, it does not show the GDM login screen. Also when switching TTYs it is either slow or hangs. I am on:

00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)

Dell latitude e6320 with Sandy Bridge graphics, HD3000.

I get the following logs:

Jan 18 18:36:57 latitude kernel:  [<ffffffff950a202b>] __warn+0xcb/0xf0
Jan 18 18:36:57 latitude kernel:  [<ffffffff950a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 18 18:36:57 latitude kernel:  [<ffffffff950e7054>] ? finish_wait+0x54/0x70
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04cda6a>] drm_wait_one_vblank+0x1aa/0x1b0 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffff950e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09aa1f9>] ironlake_crtc_enable+0x779/0xbe0 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09a6420>] intel_update_crtc+0x50/0xe0 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09a6516>] intel_update_crtcs+0x66/0x80 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09a6c1e>] intel_atomic_commit_tail+0x33e/0xff0 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09a7c23>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04e055a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04e0ae3>] ? drm_atomic_set_mode_prop_for_crtc+0x103/0x110 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04e0829>] drm_atomic_commit+0x49/0x50 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc0649c3d>] drm_atomic_helper_set_config+0x7d/0xb0 [drm_kms_helper]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04d3895>] drm_mode_set_config_internal+0x65/0x110 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04d507d>] drm_mode_setcrtc+0x3fd/0x4f0 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04cbdcb>] drm_ioctl+0x21b/0x4c0 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04d4c80>] ? drm_mode_getcrtc+0x140/0x140 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffff9526db43>] do_vfs_ioctl+0xa3/0x5f0
Jan 18 18:36:57 latitude kernel:  [<ffffffff9526e109>] SyS_ioctl+0x79/0x90
Jan 18 18:36:57 latitude kernel:  [<ffffffff9581bbf7>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 18 18:36:57 latitude kernel: ---[ end trace 337cd55a01ebeea6 ]---
Jan 18 18:36:57 latitude kernel: ------------[ cut here ]------------
Jan 18 18:36:57 latitude kernel: WARNING: CPU: 3 PID: 886 at drivers/gpu/drm/i915/intel_display.c:14191 intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 18 18:36:57 latitude kernel: pipe A vblank wait timed out
Jan 18 18:36:57 latitude kernel: Modules linked in: binfmt_misc fuse uas usb_storage ccm snd_hda_codec_hdmi snd_hda_codec_idt snd_hda_codec_generic intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel iTCO_wdt i915 mei_wdt i
Jan 18 18:36:57 latitude kernel: CPU: 3 PID: 886 Comm: Xorg Tainted: G        W       4.9.3-200.fc25.x86_64 #1
Jan 18 18:36:57 latitude kernel: Hardware name: Dell Inc. Latitude E6320/09PHH9, BIOS A19 11/14/2013
Jan 18 18:36:57 latitude kernel:  ffff98e5414afa60 ffffffff953f3ddd ffff98e5414afab0 0000000000000000
Jan 18 18:36:57 latitude kernel:  ffff98e5414afaa0 ffffffff950a202b 0000376fd279f000 0000000000000000
Jan 18 18:36:57 latitude kernel:  0000000000000000 0000000000000000 0000000000000001 ffff8b4ae0fda000
Jan 18 18:36:57 latitude kernel: Call Trace:
Jan 18 18:36:57 latitude kernel:  [<ffffffff953f3ddd>] dump_stack+0x63/0x86
Jan 18 18:36:57 latitude kernel:  [<ffffffff950a202b>] __warn+0xcb/0xf0
Jan 18 18:36:57 latitude kernel:  [<ffffffff950a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 18 18:36:57 latitude kernel:  [<ffffffff950e7054>] ? finish_wait+0x54/0x70
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09a78b0>] intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffff950e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 18 18:36:57 latitude kernel:  [<ffffffffc09a7c23>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04e055a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04e0ae3>] ? drm_atomic_set_mode_prop_for_crtc+0x103/0x110 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04e0829>] drm_atomic_commit+0x49/0x50 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc0649c3d>] drm_atomic_helper_set_config+0x7d/0xb0 [drm_kms_helper]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04d3895>] drm_mode_set_config_internal+0x65/0x110 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04d507d>] drm_mode_setcrtc+0x3fd/0x4f0 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04cbdcb>] drm_ioctl+0x21b/0x4c0 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffffc04d4c80>] ? drm_mode_getcrtc+0x140/0x140 [drm]
Jan 18 18:36:57 latitude kernel:  [<ffffffff9526db43>] do_vfs_ioctl+0xa3/0x5f0
Jan 18 18:36:57 latitude kernel:  [<ffffffff9526e109>] SyS_ioctl+0x79/0x90
Jan 18 18:36:57 latitude kernel:  [<ffffffff9581bbf7>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 18 18:36:57 latitude kernel: ---[ end trace 337cd55a01ebeea7 ]---
Jan 18 18:37:07 latitude /usr/libexec/gdm-x-session[884]: (EE) intel(0): sna_mode_shutdown_crtc: invalid state found on pipe 1, disabling CRTC:30
Jan 18 18:37:07 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 18 18:37:17 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 18 18:37:28 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out


Jan 18 18:41:39 latitude kernel: ------------[ cut here ]------------
Jan 18 18:41:39 latitude kernel: WARNING: CPU: 0 PID: 1127 at drivers/gpu/drm/i915/intel_display.c:14191 intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 18 18:41:39 latitude kernel: pipe A vblank wait timed out
Jan 18 18:41:39 latitude kernel: Modules linked in: ccm snd_hda_codec_hdmi snd_hda_codec_idt snd_hda_codec_generic intel_rapl snd_hda_intel x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_codec kvm_intel iTCO_wdt arc4 kvm mei_wdt iT
Jan 18 18:41:39 latitude kernel: CPU: 0 PID: 1127 Comm: Xorg Not tainted 4.9.3-200.fc25.x86_64 #1
Jan 18 18:41:39 latitude kernel: Hardware name: Dell Inc. Latitude E6320/09PHH9, BIOS A19 11/14/2013
Jan 18 18:41:39 latitude kernel:  ffffac8e417e77e0 ffffffff8d3f3ddd ffffac8e417e7830 0000000000000000
Jan 18 18:41:39 latitude kernel:  ffffac8e417e7820 ffffffff8d0a202b 0000376f4de99800 0000000000000000
Jan 18 18:41:39 latitude kernel:  0000000000000000 0000000000000000 0000000000000001 ffff9f216190f000
Jan 18 18:41:39 latitude kernel: Call Trace:
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d3f3ddd>] dump_stack+0x63/0x86
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d0a202b>] __warn+0xcb/0xf0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d0a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d0e7054>] ? finish_wait+0x54/0x70
Jan 18 18:41:39 latitude kernel:  [<ffffffffc07768b0>] intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d0e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 18 18:41:39 latitude kernel:  [<ffffffffc0776c23>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc04d255a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc04d2d21>] ? drm_atomic_add_affected_connectors+0x61/0xf0 [drm]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc04d2829>] drm_atomic_commit+0x49/0x50 [drm]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc051bbdc>] restore_fbdev_mode+0x14c/0x270 [drm_kms_helper]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc051d7b4>] drm_fb_helper_restore_fbdev_mode_unlocked+0x34/0x80 [drm_kms_helper]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc051d82d>] drm_fb_helper_set_par+0x2d/0x60 [drm_kms_helper]
Jan 18 18:41:39 latitude kernel:  [<ffffffffc0790cd8>] intel_fbdev_set_par+0x18/0x70 [i915]
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d4774f6>] fb_set_var+0x236/0x460
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d406559>] ? flex_array_get_ptr+0x9/0x20
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d38caa6>] ? type_attribute_bounds_av+0x46/0x1e0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d1c5567>] ? find_get_entries+0x177/0x2b0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d22fac1>] ? __slab_free+0xa1/0x2a0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d46d31f>] fbcon_blank+0x30f/0x350
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d501592>] do_unblank_screen+0xd2/0x1a0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d4f7277>] vt_ioctl+0x507/0x12a0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d4eb795>] tty_ioctl+0x355/0xc40
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d37aa58>] ? selinux_inode_free_security+0x58/0x70
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d29e271>] ? fsnotify_destroy_marks+0x61/0x80
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d10d45d>] ? call_rcu_sched+0x1d/0x20
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d1df53d>] ? shmem_destroy_inode+0x2d/0x40
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d26db43>] do_vfs_ioctl+0xa3/0x5f0
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d26e109>] SyS_ioctl+0x79/0x90
Jan 18 18:41:39 latitude kernel:  [<ffffffff8d81bbf7>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 18 18:41:39 latitude kernel: ---[ end trace 819c87644d1c8c2a ]---
Jan 18 18:41:39 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 18 18:41:39 latitude /usr/libexec/gdm-x-session[1125]: (II) Server terminated successfully (0). Closing log file.

Jan 18 18:44:59 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 18 18:44:59 latitude kernel: ------------[ cut here ]------------
Jan 18 18:44:59 latitude kernel: WARNING: CPU: 0 PID: 853 at drivers/gpu/drm/i915/intel_display.c:14191 intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 18 18:44:59 latitude kernel: pipe A vblank wait timed out
Jan 18 18:44:59 latitude kernel: Modules linked in: ccm snd_hda_codec_hdmi snd_hda_codec_idt snd_hda_codec_generic i915 mei_wdt intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel arc4 iTCO_wdt iTCO_vendor_support ppdev snd
Jan 18 18:44:59 latitude kernel: CPU: 0 PID: 853 Comm: Xorg Tainted: G        W       4.9.3-200.fc25.x86_64 #1
Jan 18 18:44:59 latitude kernel: Hardware name: Dell Inc. Latitude E6320/09PHH9, BIOS A19 11/14/2013
Jan 18 18:44:59 latitude kernel:  ffffb14e4136fa60 ffffffff8b3f3ddd ffffb14e4136fab0 0000000000000000
Jan 18 18:44:59 latitude kernel:  ffffb14e4136faa0 ffffffff8b0a202b 0000376f1ff22b40 0000000000000000
Jan 18 18:44:59 latitude kernel:  0000000000000000 0000000000000000 0000000000000001 ffff89341ff91000
Jan 18 18:44:59 latitude kernel: Call Trace:
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b3f3ddd>] dump_stack+0x63/0x86
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b0a202b>] __warn+0xcb/0xf0
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b0a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b0e7054>] ? finish_wait+0x54/0x70
Jan 18 18:44:59 latitude kernel:  [<ffffffffc07168b0>] intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b0e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 18 18:44:59 latitude kernel:  [<ffffffffc0716c23>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc019855a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc0197fa0>] ? drm_atomic_set_crtc_for_connector+0xc0/0xf0 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc0198829>] drm_atomic_commit+0x49/0x50 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc02d2c3d>] drm_atomic_helper_set_config+0x7d/0xb0 [drm_kms_helper]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc018b895>] drm_mode_set_config_internal+0x65/0x110 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc018d07d>] drm_mode_setcrtc+0x3fd/0x4f0 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc0183dcb>] drm_ioctl+0x21b/0x4c0 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffffc018cc80>] ? drm_mode_getcrtc+0x140/0x140 [drm]
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b26db43>] do_vfs_ioctl+0xa3/0x5f0
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b26e109>] SyS_ioctl+0x79/0x90
Jan 18 18:44:59 latitude kernel:  [<ffffffff8b81bbf7>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 18 18:44:59 latitude kernel: ---[ end trace 65ae7d3243347a21 ]---
Jan 18 18:45:09 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Comment 34 willma 2017-01-25 20:44:28 UTC
I just discovered that the workaround described in bug 93782, comment 40 also solves the flip_done timout on my machine (kernel 4.9.5-100.fc24.x86_64). All hangs are now gone and dmesg is clean.

Just add

video=SVIDEO-1:d

to the kernel command line and give it a try.
Comment 35 Ferenc Wágner 2017-01-27 20:59:13 UTC
I can confirm that the patch in comment #19 gets rid of the crashes (leaving the timeouts) and that the workaround in comment #34 gets rid of the timeouts as well.

I'll test drm-tip eventually.

00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (primary) (rev 03)
00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (secondary) (rev 03)
Comment 36 Kadir 2017-01-27 22:17:29 UTC
(In reply to willma from comment #34)
> I just discovered that the workaround described in bug 93782, comment 40
> also solves the flip_done timout on my machine (kernel
> 4.9.5-100.fc24.x86_64). All hangs are now gone and dmesg is clean.
> 
> Just add
> 
> video=SVIDEO-1:d
> 
> to the kernel command line and give it a try.

So far so good, haven't had any hangs/messages or slowdowns when switching TTYs yet. So it seems adding video=SVIDEO-1:d helps. Will keep an eye on it for a couple of days of uptime before I can tell if my GNU+Linux is great again :)

Let's hope a definitive fix is coming soon.
Comment 37 José JORGE 2017-01-28 08:32:27 UTC
(In reply to willma from comment #34)
> video=SVIDEO-1:d

Good catch here also. So the SVIDEO is wrongly enabled since kernels 4.8 ...
Comment 38 Kadir 2017-01-29 09:50:21 UTC
The workaround as described from comment#34 does not fully solve the problem.

Today I did some further testing, based on some of the hangs/freezes I got earlier without the mentioned workaround. 

First I did:

systemctl isolate multi-user.target

and then I tried starting the GUI (X11) while doing startx

And I tried logging out and thereby going back to the TTY.

I did this in total 3 times. The third time the system locked up just as before with the following log:

Jan 29 10:39:14 latitude kernel: ------------[ cut here ]------------
Jan 29 10:39:14 latitude kernel: WARNING: CPU: 2 PID: 7178 at drivers/gpu/drm/i915/intel_display.c:14180 intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 29 10:39:14 latitude kernel: pipe A vblank wait timed out
Jan 29 10:39:14 latitude kernel: Modules linked in: ccm snd_hda_codec_hdmi intel_rapl snd_hda_codec_idt snd_hda_codec_generic x86_pkg_temp_thermal inte
Jan 29 10:39:14 latitude kernel: CPU: 2 PID: 7178 Comm: Xorg Not tainted 4.9.5-200.fc25.x86_64 #1
Jan 29 10:39:14 latitude kernel: Hardware name: Dell Inc. Latitude E6320/09PHH9, BIOS A19 11/14/2013
Jan 29 10:39:14 latitude kernel:  ffffb2ca80d6f7e0 ffffffff933f40bd ffffb2ca80d6f830 0000000000000000
Jan 29 10:39:14 latitude kernel:  ffffb2ca80d6f820 ffffffff930a202b 0000376407463bc0 0000000000000000
Jan 29 10:39:14 latitude kernel:  0000000000000000 0000000000000000 0000000000000001 ffff97fc2a885000
Jan 29 10:39:14 latitude kernel: Call Trace:
Jan 29 10:39:14 latitude kernel:  [<ffffffff933f40bd>] dump_stack+0x63/0x86
Jan 29 10:39:14 latitude kernel:  [<ffffffff930a202b>] __warn+0xcb/0xf0
Jan 29 10:39:14 latitude kernel:  [<ffffffff930a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 29 10:39:14 latitude kernel:  [<ffffffff930e7054>] ? finish_wait+0x54/0x70
Jan 29 10:39:14 latitude kernel:  [<ffffffffc0767920>] intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 29 10:39:14 latitude kernel:  [<ffffffff930e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 29 10:39:14 latitude kernel:  [<ffffffffc0767c93>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc04f156a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc04f1d31>] ? drm_atomic_add_affected_connectors+0x61/0xf0 [drm]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc04f1839>] drm_atomic_commit+0x49/0x50 [drm]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc053abec>] restore_fbdev_mode+0x14c/0x270 [drm_kms_helper]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc053c7c4>] drm_fb_helper_restore_fbdev_mode_unlocked+0x34/0x80 [drm_kms_helper]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc053c83d>] drm_fb_helper_set_par+0x2d/0x60 [drm_kms_helper]
Jan 29 10:39:14 latitude kernel:  [<ffffffffc0781d68>] intel_fbdev_set_par+0x18/0x70 [i915]
Jan 29 10:39:14 latitude kernel:  [<ffffffff93478816>] fb_set_var+0x236/0x460
Jan 29 10:39:14 latitude kernel:  [<ffffffff9322f795>] ? kmem_cache_alloc+0x195/0x1b0
Jan 29 10:39:14 latitude kernel:  [<ffffffff933750a7>] ? avc_alloc_node+0x27/0x120
Jan 29 10:39:14 latitude kernel:  [<ffffffff93406839>] ? flex_array_get_ptr+0x9/0x20
Jan 29 10:39:14 latitude kernel:  [<ffffffff9338cdb6>] ? type_attribute_bounds_av+0x46/0x1e0
Jan 29 10:39:14 latitude kernel:  [<ffffffff9346e63f>] fbcon_blank+0x30f/0x350
Jan 29 10:39:14 latitude kernel:  [<ffffffff935028d2>] do_unblank_screen+0xd2/0x1a0
Jan 29 10:39:14 latitude kernel:  [<ffffffff934f85b7>] vt_ioctl+0x507/0x12a0
Jan 29 10:39:14 latitude kernel:  [<ffffffff934ecad5>] tty_ioctl+0x355/0xc40
Jan 29 10:39:14 latitude kernel:  [<ffffffff9337ad68>] ? selinux_inode_free_security+0x58/0x70
Jan 29 10:39:14 latitude kernel:  [<ffffffff9329e581>] ? fsnotify_destroy_marks+0x61/0x80
Jan 29 10:39:14 latitude kernel:  [<ffffffff9326de03>] do_vfs_ioctl+0xa3/0x5f0
Jan 29 10:39:14 latitude kernel:  [<ffffffff9326e3c9>] SyS_ioctl+0x79/0x90
Jan 29 10:39:14 latitude kernel:  [<ffffffff9381cc77>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 29 10:39:14 latitude kernel: ---[ end trace 0022451fc8e72435 ]---
Jan 29 10:39:14 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 29 10:39:24 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 29 10:39:43 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 29 10:39:43 latitude pkexec[7945]: pam_systemd(polkit-1:session): Cannot create session: Already running in a session
Jan 29 10:39:43 latitude audit[7945]: USER_START pid=7945 uid=1000 auid=1000 ses=6 subj=unconfined_u:unconfined_r:xserver_t:s0-s0:c0.c1023 msg='op=PAM:
Jan 29 10:39:43 latitude pkexec[7945]: pam_unix(polkit-1:session): session opened for user root by (uid=1000)
Jan 29 10:39:43 latitude pkexec[7945]: Kadir: Executing command [USER=root] [TTY=unknown] [CWD=/home/Kadir] [COMMAND=/usr/libexec/xf86-video-intel-back
Jan 29 10:39:47 latitude chronyd[739]: Selected source 188.166.57.207
Jan 29 10:39:53 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Jan 29 10:39:53 latitude kernel: ------------[ cut here ]------------
Jan 29 10:39:53 latitude kernel: WARNING: CPU: 2 PID: 7940 at drivers/gpu/drm/i915/intel_display.c:14180 intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 29 10:39:53 latitude kernel: pipe A vblank wait timed out
Jan 29 10:39:53 latitude kernel: Modules linked in: ccm snd_hda_codec_hdmi intel_rapl snd_hda_codec_idt snd_hda_codec_generic x86_pkg_temp_thermal inte
Jan 29 10:39:53 latitude kernel: CPU: 2 PID: 7940 Comm: Xorg Tainted: G        W       4.9.5-200.fc25.x86_64 #1
Jan 29 10:39:53 latitude kernel: Hardware name: Dell Inc. Latitude E6320/09PHH9, BIOS A19 11/14/2013
Jan 29 10:39:53 latitude kernel:  ffffb2ca80f37a60 ffffffff933f40bd ffffb2ca80f37ab0 0000000000000000
Jan 29 10:39:53 latitude kernel:  ffffb2ca80f37aa0 ffffffff930a202b 00003764f6f14540 0000000000000000
Jan 29 10:39:53 latitude kernel:  0000000000000000 0000000000000000 0000000000000001 ffff97fc2a885000
Jan 29 10:39:53 latitude kernel: Call Trace:
Jan 29 10:39:53 latitude kernel:  [<ffffffff933f40bd>] dump_stack+0x63/0x86
Jan 29 10:39:53 latitude kernel:  [<ffffffff930a202b>] __warn+0xcb/0xf0
Jan 29 10:39:53 latitude kernel:  [<ffffffff930a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 29 10:39:53 latitude kernel:  [<ffffffff930e7054>] ? finish_wait+0x54/0x70
Jan 29 10:39:53 latitude kernel:  [<ffffffffc0767920>] intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 29 10:39:53 latitude kernel:  [<ffffffff930e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 29 10:39:53 latitude kernel:  [<ffffffffc0767c93>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04f156a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04f0fb0>] ? drm_atomic_set_crtc_for_connector+0xc0/0xf0 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04f1839>] drm_atomic_commit+0x49/0x50 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc0538c4d>] drm_atomic_helper_set_config+0x7d/0xb0 [drm_kms_helper]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04e48a5>] drm_mode_set_config_internal+0x65/0x110 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04e608d>] drm_mode_setcrtc+0x3fd/0x4f0 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04dcdcb>] drm_ioctl+0x21b/0x4c0 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffff931fa3e5>] ? do_wp_page+0x105/0x870
Jan 29 10:39:53 latitude kernel:  [<ffffffffc04e5c90>] ? drm_mode_getcrtc+0x140/0x140 [drm]
Jan 29 10:39:53 latitude kernel:  [<ffffffff9326de03>] do_vfs_ioctl+0xa3/0x5f0
Jan 29 10:39:53 latitude kernel:  [<ffffffff9326e3c9>] SyS_ioctl+0x79/0x90
Jan 29 10:39:53 latitude kernel:  [<ffffffff9381cc77>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 29 10:39:53 latitude kernel: ---[ end trace 0022451fc8e72436 ]---
Jan 29 10:40:03 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out

So the workaround does not fully fix the ussue for me.
Comment 39 Kadir 2017-01-29 10:33:06 UTC
I just noticed that after a suspend and resume, a simple logout and back to GDM completely freezed the system. I am on Fedora 25 4.9.5-200.fc25.x86_64

Jan 29 11:25:09 latitude kernel:  [<ffffffff870a202b>] __warn+0xcb/0xf0
Jan 29 11:25:09 latitude kernel:  [<ffffffff870a20af>] warn_slowpath_fmt+0x5f/0x80
Jan 29 11:25:09 latitude kernel:  [<ffffffff870e7054>] ? finish_wait+0x54/0x70
Jan 29 11:25:09 latitude kernel:  [<ffffffffc0390920>] intel_atomic_commit_tail+0xfd0/0xff0 [i915]
Jan 29 11:25:09 latitude kernel:  [<ffffffff870e7270>] ? prepare_to_wait_event+0x100/0x100
Jan 29 11:25:09 latitude kernel:  [<ffffffffc0390c93>] intel_atomic_commit+0x353/0x4c0 [i915]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc017756a>] ? drm_atomic_check_only+0x30a/0x590 [drm]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc0177d31>] ? drm_atomic_add_affected_connectors+0x61/0xf0 [drm]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc0177839>] drm_atomic_commit+0x49/0x50 [drm]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc01ffbec>] restore_fbdev_mode+0x14c/0x270 [drm_kms_helper]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc02017c4>] drm_fb_helper_restore_fbdev_mode_unlocked+0x34/0x80 [drm_kms_helper]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc020183d>] drm_fb_helper_set_par+0x2d/0x60 [drm_kms_helper]
Jan 29 11:25:09 latitude kernel:  [<ffffffffc03aad68>] intel_fbdev_set_par+0x18/0x70 [i915]
Jan 29 11:25:09 latitude kernel:  [<ffffffff87478816>] fb_set_var+0x236/0x460
Jan 29 11:25:09 latitude kernel:  [<ffffffff87406839>] ? flex_array_get_ptr+0x9/0x20
Jan 29 11:25:09 latitude kernel:  [<ffffffff8738cdb6>] ? type_attribute_bounds_av+0x46/0x1e0
Jan 29 11:25:09 latitude kernel:  [<ffffffff871c55c7>] ? find_get_entries+0x177/0x2b0
Jan 29 11:25:09 latitude kernel:  [<ffffffff8746e63f>] fbcon_blank+0x30f/0x350
Jan 29 11:25:09 latitude kernel:  [<ffffffff875028d2>] do_unblank_screen+0xd2/0x1a0
Jan 29 11:25:09 latitude kernel:  [<ffffffff874f85b7>] vt_ioctl+0x507/0x12a0
Jan 29 11:25:09 latitude kernel:  [<ffffffff874ecad5>] tty_ioctl+0x355/0xc40
Jan 29 11:25:09 latitude kernel:  [<ffffffff8737ad68>] ? selinux_inode_free_security+0x58/0x70
Jan 29 11:25:09 latitude kernel:  [<ffffffff8729e581>] ? fsnotify_destroy_marks+0x61/0x80
Jan 29 11:25:09 latitude kernel:  [<ffffffff8710d45d>] ? call_rcu_sched+0x1d/0x20
Jan 29 11:25:09 latitude kernel:  [<ffffffff871df69d>] ? shmem_destroy_inode+0x2d/0x40
Jan 29 11:25:09 latitude kernel:  [<ffffffff8726de03>] do_vfs_ioctl+0xa3/0x5f0
Jan 29 11:25:09 latitude kernel:  [<ffffffff8726e3c9>] SyS_ioctl+0x79/0x90
Jan 29 11:25:09 latitude kernel:  [<ffffffff8781cc77>] entry_SYSCALL_64_fastpath+0x1a/0xa9
Jan 29 11:25:09 latitude kernel: ---[ end trace 1506db7661e4884c ]---
Jan 29 11:25:09 latitude kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out
Comment 40 Ferenc Wágner 2017-01-29 15:10:14 UTC
I tested commit 8c3608f from drm-tip. During a plain bootup into lightdm, it gave several WARNING backtraces like:

> [   12.356033] ------------[ cut here ]------------
> [   12.356081] WARNING: CPU: 1 PID: 5 at drivers/gpu/drm/drm_irq.c:1199 drm_wait_one_vblank+0x154/0x1a0
>  [drm]
> [   12.356090] vblank wait timed out on crtc 0
> [   12.356096] Modules linked in: hid_generic(E) usbhid(E) hid(E) arc4(E) psmouse(E) ahci(E) ata_piix(E) libahci(E) libata(E) scsi_mod(E) ath5k(E) mac80211(E) ath(E) cfg80211(E) rfkill(E) tg3(E) ptp(E) pps_core(E) libphy(E) thermal(E) i915(E) i2c_algo_bit(E) drm_kms_helper(E) ehci_pci(E) uhci_hcd(E) ehci_hcd(E) fjes(E) video(E) button(E) usbcore(E) drm(E)
> [   12.356148] CPU: 1 PID: 5 Comm: kworker/u4:0 Tainted: G            E   4.10.0-rc5+ #1
> [   12.356158] Hardware name: Acer            Aspire 2920                    /Calado                         , BIOS V1.13           02/14/2008
> [   12.356172] Workqueue: events_unbound async_run_entry_fn
> [   12.356179] Call Trace:
> [   12.356189]  ? dump_stack+0x5c/0x77
> [   12.356196]  ? __warn+0xc4/0xe0
> [   12.356202]  ? warn_slowpath_fmt+0x5f/0x80
> [   12.356209]  ? finish_wait+0x3c/0x80
> [   12.356230]  ? drm_wait_one_vblank+0x154/0x1a0 [drm]
> [   12.356236]  ? remove_wait_queue+0x60/0x60
> [   12.356322]  ? intel_get_load_detect_pipe+0x5a8/0x610 [i915]
> [   12.356375]  ? intel_tv_detect+0x156/0x520 [i915]
> [   12.356390]  ? drm_helper_probe_single_connector_modes+0x2bb/0x510 [drm_kms_helper]
> [   12.356407]  ? drm_setup_crtcs+0x7d/0xa10 [drm_kms_helper]
> [   12.356415]  ? check_preempt_wakeup+0xeb/0x200
> [   12.356421]  ? sched_clock_cpu+0x41/0x90
> [   12.356434]  ? drm_fb_helper_initial_config+0x79/0x400 [drm_kms_helper]
> [   12.356441]  ? ttwu_do_wakeup+0x14/0xe0
> [   12.356494]  ? intel_fbdev_initial_config+0x14/0x30 [i915]
> [   12.356501]  ? async_run_entry_fn+0x34/0x160
> [   12.356508]  ? process_one_work+0x15e/0x420
> [   12.356514]  ? worker_thread+0x65/0x4b0
> [   12.356520]  ? rescuer_thread+0x390/0x390
> [   12.356526]  ? kthread+0x104/0x140
> [   12.356532]  ? kthread_park+0x80/0x80
> [   12.356540]  ? ret_from_fork+0x26/0x40
> [   12.356546] ---[ end trace 591a4980f651c4a0 ]---
> [   12.502499] ------------[ cut here ]------------

and also an *ERROR* (followed by some more similar backtraces):

> [  338.652100] [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe A] flip_done timed out
> [  338.755173] ------------[ cut here ]------------
> [  338.755231] WARNING: CPU: 1 PID: 1751 at drivers/gpu/drm/drm_irq.c:1199 drm_wait_one_vblank+0x154/0x1a0 [drm]
> [  338.755237] vblank wait timed out on crtc 0
> [  338.755241] Modules linked in: ib_iser(E) rdma_cm(E) iw_cm(E) ib_cm(E) ib_core(E) configfs(E) ext4(E) crc16(E) jbd2(E) fscrypto(E) mbcache(E) uvcvideo(E) videobuf2_vmalloc(E) videobuf2_memops(E) videobuf2_v4l2(E) videobuf2_core(E) videodev(E) media(E) iTCO_wdt(E) iTCO_vendor_support(E) snd_hda_codec_hdmi(E) snd_hda_codec_realtek(E) snd_hda_codec_generic(E) snd_hda_intel(E) snd_hda_codec(E) snd_hda_core(E) snd_hwdep(E) snd_pcm(E) snd_timer(E) coretemp(E) snd(E) i2c_i801(E) joydev(E) soundcore(E) pcspkr(E) lpc_ich(E) mfd_core(E) serio_raw(E) ac(E) battery(E) evdev(E) acpi_cpufreq(E) shpchp(E) tpm_tis(E) tpm_tis_core(E) tpm(E) fuse(E) parport_pc(E) ppdev(E) lp(E) parport(E) autofs4(E) ecb(E) crypto_simd(E) glue_helper(E) cryptd(E) aes_x86_64(E) xts(E) gf128mul(E) algif_skcipher(E) af_alg(E) dm_crypt(E)
> [  338.755355]  xfs(E) crc32c_generic(E) libcrc32c(E) dm_round_robin(E) iscsi_tcp(E) libiscsi_tcp(E) libiscsi(E) scsi_transport_iscsi(E) iscsi_ibft(E) iscsi_boot_sysfs(E) virtio_pci(E) virtio_net(E) virtio_ring(E) virtio(E) ctr(E) ccm(E) dm_service_time(E) dm_multipath(E) nf_conntrack_ipv4(E) nf_defrag_ipv4(E) nf_conntrack(E) dm_mod(E) sg(E) sd_mod(E) sr_mod(E) cdrom(E) ata_generic(E) hid_generic(E) usbhid(E) hid(E) arc4(E) psmouse(E) ahci(E) ata_piix(E) libahci(E) libata(E) scsi_mod(E) ath5k(E) mac80211(E) ath(E) cfg80211(E) rfkill(E) tg3(E) ptp(E) pps_core(E) libphy(E) thermal(E) i915(E) i2c_algo_bit(E) drm_kms_helper(E) ehci_pci(E) uhci_hcd(E) ehci_hcd(E) fjes(E) video(E) button(E) usbcore(E) drm(E)
> [  338.755463] CPU: 1 PID: 1751 Comm: Xorg Tainted: G        W   E   4.10.0-rc5+ #1
> [  338.755468] Hardware name: Acer            Aspire 2920                    /Calado                         , BIOS V1.13           02/14/2008
> [  338.755473] Call Trace:
> [  338.755487]  ? dump_stack+0x5c/0x77
> [  338.755494]  ? __warn+0xc4/0xe0
> [  338.755500]  ? warn_slowpath_fmt+0x5f/0x80
> [  338.755508]  ? finish_wait+0x3c/0x80
> [  338.755543]  ? drm_wait_one_vblank+0x154/0x1a0 [drm]
> [  338.755550]  ? remove_wait_queue+0x60/0x60
> [  338.755633]  ? intel_get_load_detect_pipe+0x5a8/0x610 [i915]
> [  338.755702]  ? intel_tv_detect+0x156/0x520 [i915]
> [  338.755730]  ? drm_helper_probe_single_connector_modes+0x2bb/0x510 [drm_kms_helper]
> [  338.755769]  ? drm_mode_getconnector+0x2f0/0x320 [drm]
> [  338.755804]  ? drm_ioctl+0x200/0x430 [drm]
> [  338.755843]  ? drm_mode_connector_property_set_ioctl+0x60/0x60 [drm]
> [  338.755957]  ? xfs_file_write_iter+0x10b/0x150 [xfs]
> [  338.755966]  ? do_vfs_ioctl+0x9b/0x600
> [  338.755973]  ? vfs_write+0x163/0x1a0
> [  338.755979]  ? SyS_ioctl+0x76/0x90
> [  338.755987]  ? entry_SYSCALL_64_fastpath+0x1e/0xad
> [  338.755993] ---[ end trace 591a4980f651c4a3 ]---
> [  349.148087] [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe A] flip_done timed out
> [  349.252053] ------------[ cut here ]------------

After all, it looks like this regression isn't fixed in drm-tip, though the hardening patch from comment #19 (24835e44) helps to avoid hard crashes.
Please tell if I can help by providing further info or testing.
Comment 41 Jim Rees 2017-02-01 01:46:55 UTC
Created attachment 129266 [details] [review]
4.10.0-rc6: drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking"

This is the patch to revert ea0000f0 "Roll out the helper nonblock tracking", updated to apply to 4.10.0-rc6.
Comment 42 Jari Tahvanainen 2017-02-03 11:28:18 UTC
Rami - please check if you can reproduce and push this forward with developers.
Comment 43 Chris Down 2017-02-08 23:20:29 UTC
What's the status on pushing out this revert? Can we please aim to get this into 4.10?

I've been running for 2 weeks on 4.9 with the patch in comment #20, and the hard lockups have totally gone. This is certainly way, way better than the current stock experience, which results in my T460s entirely locking up daily.
Comment 44 Chris Down 2017-02-09 12:46:36 UTC
Created attachment 129434 [details]
/sys/class/drm/card0/error

Less than a day after I made that comment I got a hard lockup, typical :-(

/sys/class/drm/card0/error is attached. No messages about flip_done timing out now, instead I get this:

[ 9945.797615] [drm] GPU HANG: ecode 9:0:0xfffffffe, in Xorg [1858], reason: Hang on render ring, action: reset
[...]
[ 9945.797628] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[ 9945.797708] drm/i915: Resetting chip after gpu hang
[ 9945.800332] [drm] RC6 on
[ 9945.813539] [drm] GuC firmware load skipped
Comment 45 Daniel Vetter 2017-02-28 09:50:17 UTC
(In reply to Chris Down from comment #44)
> Created attachment 129434 [details]
> /sys/class/drm/card0/error
> 
> Less than a day after I made that comment I got a hard lockup, typical :-(
> 
> /sys/class/drm/card0/error is attached. No messages about flip_done timing
> out now, instead I get this:
> 
> [ 9945.797615] [drm] GPU HANG: ecode 9:0:0xfffffffe, in Xorg [1858], reason:
> Hang on render ring, action: reset
> [...]
> [ 9945.797628] [drm] GPU crash dump saved to /sys/class/drm/card0/error
> [ 9945.797708] drm/i915: Resetting chip after gpu hang
> [ 9945.800332] [drm] RC6 on
> [ 9945.813539] [drm] GuC firmware load skipped

This is unrelated to the bug here. It might explain your hard lookup though, so please file a new bug report with all the details.
Comment 46 Daniel Vetter 2017-02-28 10:11:32 UTC
Created attachment 129979 [details] [review]
Don't fall over flip_done failures that hard

Another hack on top of the already merged hack. Please make sure you have the referenced patch, so either drm-tip or apply both patches.

Again it won't fix the stalls, but should help with full freeze.
Comment 47 Ferenc Wágner 2017-03-09 12:49:32 UTC
I've been running with drm-tip+"Don't fall over flip_done failures that hard" (4.10.0+) for a couple of days, and didn't notice any change: there are stalls but no freezes, as advertised.
Comment 48 Jim Rees 2017-03-12 04:33:14 UTC
Created attachment 130168 [details] [review]
4.11.0-rc1: drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking"

This is the patch to revert ea0000f0 "Roll out the helper nonblock tracking", updated to apply to 4.11.0-rc1.
Comment 49 Jari Tahvanainen 2017-03-17 12:42:12 UTC
You can follow this on patchwork: https://patchwork.freedesktop.org/patch/124229/
Comment 50 Jari Tahvanainen 2017-03-17 12:43:01 UTC
link to series https://patchwork.freedesktop.org/series/16022/
Comment 51 Daniel Vetter 2017-03-21 11:03:19 UTC
We can't apply the revert because the entire atomic house will come crashing down on us.

But Maarten fixed another potential oops with nonblocking commits with his atomic iterator patches. Those all now landed in drm-tip. We need to retest, if that's ok we can figure out how to backport the entire pile (or apply the revert just to some old stable kernels).
Comment 52 Ferenc Wágner 2017-03-21 12:56:49 UTC
Daniel, what exactly shall we test? Current drm-tip with or without attachment 129979 [details] [review] (Don't fall over flip_done failures that hard)?  I haven't seen oopses for long, but the stalls with long timeouts are there on every reboot or mode switch.
Comment 53 Kadir 2017-03-28 08:36:13 UTC
Just updated to Fedora kernel 4.10.5-200.fc25.x86_64.

Until kernel 4.9.XX it said:

[drm_kms_helper]] *ERROR* [CRTC:26:pipe A] flip_done timed out

now it says:

[drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:31:pipe A] flip_done timed out

I still get the timeouts, journalctl says:

Mar 28 10:25:05 elif kernel: ------------[ cut here ]------------
Mar 28 10:25:05 elif kernel: WARNING: CPU: 2 PID: 839 at drivers/gpu/drm/i915/intel_display.c:14189 intel_atomic_commit_tail+0xf97/0xfc0
Mar 28 10:25:05 elif kernel: pipe A vblank wait timed out
Mar 28 10:25:05 elif kernel: Modules linked in: ccm snd_hda_codec_hdmi snd_hda_codec_idt snd_hda_codec_generic i915 intel_rapl x86_pkg_t
Mar 28 10:25:05 elif kernel: CPU: 2 PID: 839 Comm: Xorg Not tainted 4.10.5-200.fc25.x86_64 #1
Mar 28 10:25:05 elif kernel: Hardware name: Dell Inc. Latitude E6320/09PHH9, BIOS A19 11/14/2013
Mar 28 10:25:05 elif kernel: Call Trace:
Mar 28 10:25:05 elif kernel:  dump_stack+0x63/0x86
Mar 28 10:25:05 elif kernel:  __warn+0xcb/0xf0
Mar 28 10:25:05 elif kernel:  warn_slowpath_fmt+0x5f/0x80
Mar 28 10:25:05 elif kernel:  ? finish_wait+0x67/0x80
Mar 28 10:25:05 elif kernel:  intel_atomic_commit_tail+0xf97/0xfc0 [i915]
Mar 28 10:25:05 elif kernel:  ? __switch_to+0x227/0x460
Mar 28 10:25:05 elif kernel:  ? remove_wait_queue+0x70/0x70
Mar 28 10:25:05 elif kernel:  intel_atomic_commit+0x3cb/0x4f0 [i915]
Mar 28 10:25:05 elif kernel:  drm_atomic_commit+0x4b/0x50 [drm]
Mar 28 10:25:05 elif kernel:  restore_fbdev_mode+0x14c/0x2a0 [drm_kms_helper]
Mar 28 10:25:05 elif kernel:  drm_fb_helper_restore_fbdev_mode_unlocked+0x34/0x80 [drm_kms_helper]
Mar 28 10:25:05 elif kernel:  drm_fb_helper_set_par+0x2d/0x60 [drm_kms_helper]
Mar 28 10:25:05 elif kernel:  intel_fbdev_set_par+0x18/0x70 [i915]
Mar 28 10:25:05 elif kernel:  fb_set_var+0x236/0x460
Mar 28 10:25:05 elif kernel:  ? kmem_cache_alloc+0x195/0x1b0
Mar 28 10:25:05 elif kernel:  ? avc_alloc_node+0x27/0x120
Mar 28 10:25:05 elif kernel:  ? flex_array_get_ptr+0x9/0x20
Mar 28 10:25:05 elif kernel:  ? type_attribute_bounds_av+0x46/0x1e0
Mar 28 10:25:05 elif kernel:  fbcon_blank+0x30f/0x350
Mar 28 10:25:05 elif kernel:  do_unblank_screen+0xd2/0x1a0
Mar 28 10:25:05 elif kernel:  vt_ioctl+0x507/0x12a0
Mar 28 10:25:05 elif kernel:  tty_ioctl+0x355/0xc40
Mar 28 10:25:05 elif kernel:  ? selinux_inode_free_security+0x6d/0x80
Mar 28 10:25:05 elif kernel:  ? fsnotify_destroy_marks+0x61/0x80
Mar 28 10:25:05 elif kernel:  ? call_rcu_sched+0x1d/0x20
Mar 28 10:25:05 elif kernel:  do_vfs_ioctl+0xa3/0x5f0
Mar 28 10:25:05 elif kernel:  SyS_ioctl+0x79/0x90
Mar 28 10:25:05 elif kernel:  ? call_rcu_sched+0x1d/0x20
Mar 28 10:25:05 elif kernel:  do_vfs_ioctl+0xa3/0x5f0
Mar 28 10:25:05 elif kernel:  SyS_ioctl+0x79/0x90
Mar 28 10:25:05 elif kernel:  do_syscall_64+0x67/0x180
Mar 28 10:25:05 elif kernel:  entry_SYSCALL64_slow_path+0x25/0x25
Mar 28 10:25:05 elif kernel: RIP: 0033:0x7f5ac7f87787
Mar 28 10:25:05 elif kernel: RSP: 002b:00007ffc5e2587b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Mar 28 10:25:05 elif kernel: RAX: ffffffffffffffda RBX: 000000000082c800 RCX: 00007f5ac7f87787
Mar 28 10:25:05 elif kernel: RDX: 0000000000000000 RSI: 0000000000004b3a RDI: 000000000000000a
Mar 28 10:25:05 elif kernel: RBP: 000000000084b698 R08: 0000000001150a50 R09: 0000000001158e00
Mar 28 10:25:05 elif kernel: R10: 00007ffc5e258750 R11: 0000000000000246 R12: 000000000084b6a0
Mar 28 10:25:05 elif kernel: R13: 000000000084b6d8 R14: 000000000084f818 R15: 0000000000830bd8
Mar 28 10:25:05 elif kernel: ---[ end trace 4ca50fbca84bf134 ]---
Mar 28 10:25:05 elif kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:31:pipe A] flip_done timed out
Mar 28 10:25:25 elif kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:31:pipe A] flip_done timed out
Mar 28 10:25:25 elif kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:31:pipe A] flip_done timed out
Mar 28 10:25:38 elif kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:31:pipe A] flip_done timed out
Comment 54 Daniel Vetter 2017-03-28 08:42:48 UTC
(In reply to Ferenc Wágner from comment #52)
> Daniel, what exactly shall we test? Current drm-tip with or without
> attachment 129979 [details] [review] [review] (Don't fall over flip_done failures
> that hard)?  I haven't seen oopses for long, but the stalls with long
> timeouts are there on every reboot or mode switch.

drm-tip has all current patches.
Comment 55 Ferenc Wágner 2017-03-28 10:31:10 UTC
OK, compiled and booted the 2017y-03m-28d-08h-54m-35s UTC integration manifest. There are several hangs during bootup:

> [   11.996006] [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:30:pipe A] flip_done timed out
> [   12.100021] ------------[ cut here ]------------
> [   12.100021] WARNING: CPU: 0 PID: 5 at drivers/gpu/drm/drm_irq.c:1242 drm_wait_one_vblank+0x154/0x1a0 [drm]
> [   12.100021] vblank wait timed out on crtc 0
> [   12.100021] Modules linked in: hid_generic(E) usbhid(E) hid(E) arc4(E) psmouse(E) ata_piix(E) ahci(E) libahci(E) libata(E) scsi_mod(E) ath5k(E) mac80211(E) ath(E) cfg80211(E) rfkill(E) i915(E) tg3(E) ptp(E) pps_core(E) prime_numbers(E) libphy(E) i2c_algo_bit(E) drm_kms_helper(E) thermal(E) uhci_hcd(E) ehci_pci(E) ehci_hcd(E) video(E) drm(E) button(E) usbcore(E)
> [   12.100021] CPU: 0 PID: 5 Comm: kworker/u4:0 Tainted: G            E   4.11.0-rc4+ #3
> [   12.100021] Hardware name: Acer            Aspire 2920                    /Calado                         , BIOS V1.13           02/14/2008
> [   12.100021] Workqueue: events_unbound async_run_entry_fn
> [   12.100021] Call Trace:
> [   12.100021]  ? dump_stack+0x5c/0x77
> [   12.100021]  ? __warn+0xc4/0xe0
> [   12.100021]  ? warn_slowpath_fmt+0x5f/0x80
> [   12.100021]  ? finish_wait+0x3c/0x80
> [   12.100021]  ? drm_wait_one_vblank+0x154/0x1a0 [drm]
> [   12.100021]  ? remove_wait_queue+0x60/0x60
> [   12.100021]  ? intel_get_load_detect_pipe+0x5ea/0x640 [i915]
> [   12.100021]  ? intel_tv_detect+0x156/0x520 [i915]
> [   12.100021]  ? drm_helper_probe_single_connector_modes+0x2fc/0x550 [drm_kms_helper]
> [   12.100021]  ? drm_setup_crtcs+0x7d/0xa10 [drm_kms_helper]
> [   12.100021]  ? check_preempt_wakeup+0xeb/0x200
> [   12.100021]  ? drm_fb_helper_initial_config+0x79/0x420 [drm_kms_helper]
> [   12.100021]  ? try_to_wake_up+0x54/0x460
> [   12.100021]  ? intel_fbdev_initial_config+0x14/0x30 [i915]
> [   12.100021]  ? async_run_entry_fn+0x34/0x160
> [   12.100021]  ? process_one_work+0x15e/0x420
> [   12.100021]  ? worker_thread+0x65/0x4b0
> [   12.100021]  ? rescuer_thread+0x390/0x390
> [   12.100021]  ? kthread+0x104/0x140
> [   12.100021]  ? kthread_park+0x80/0x80
> [   12.100021]  ? ret_from_fork+0x26/0x40
> [   12.100021] ---[ end trace 4fb69e1dcd9a9df7 ]---
> [   12.235090] ------------[ cut here ]------------
> [   12.235090] WARNING: CPU: 0 PID: 5 at drivers/gpu/drm/drm_irq.c:1242 drm_wait_one_vblank+0x154/0x1a0 [drm]
> [   12.235090] vblank wait timed out on crtc 0
> [   12.235090] Modules linked in: hid_generic(E) usbhid(E) hid(E) arc4(E) psmouse(E) ata_piix(E) ahci(E) libahci(E) libata(E) scsi_mod(E) ath5k(E) mac80211(E) ath(E) cfg80211(E) rfkill(E) i915(E) tg3(E) ptp(E) pps_core(E) prime_numbers(E) libphy(E) i2c_algo_bit(E) drm_kms_helper(E) thermal(E) uhci_hcd(E) ehci_pci(E) ehci_hcd(E) video(E) drm(E) button(E) usbcore(E)
> [   12.235090] CPU: 0 PID: 5 Comm: kworker/u4:0 Tainted: G        W   E   4.11.0-rc4+ #3
> [   12.235090] Hardware name: Acer            Aspire 2920                    /Calado                         , BIOS V1.13           02/14/2008
> [   12.235090] Workqueue: events_unbound async_run_entry_fn
> [   12.235090] Call Trace:
> [   12.235090]  ? dump_stack+0x5c/0x77
> [   12.235090]  ? __warn+0xc4/0xe0
> [   12.235090]  ? warn_slowpath_fmt+0x5f/0x80
> [   12.235090]  ? finish_wait+0x3c/0x80
> [   12.235090]  ? drm_wait_one_vblank+0x154/0x1a0 [drm]
> [   12.235090]  ? remove_wait_queue+0x60/0x60
> [   12.235090]  ? intel_pre_plane_update+0x10c/0x190 [i915]
> [   12.235090]  ? intel_atomic_commit_tail+0x9f/0xed0 [i915]
> [   12.235090]  ? __queue_work+0x13c/0x440
> [   12.235090]  ? intel_atomic_commit+0x452/0x4f0 [i915]
> [   12.235090]  ? intel_release_load_detect_pipe+0x58/0xa0 [i915]
> [   12.235090]  ? intel_tv_detect+0x374/0x520 [i915]
> [   12.235090]  ? drm_helper_probe_single_connector_modes+0x2fc/0x550 [drm_kms_helper]
> [   12.235090]  ? drm_setup_crtcs+0x7d/0xa10 [drm_kms_helper]
> [   12.235090]  ? check_preempt_wakeup+0xeb/0x200
> [   12.235090]  ? drm_fb_helper_initial_config+0x79/0x420 [drm_kms_helper]
> [   12.235090]  ? try_to_wake_up+0x54/0x460
> [   12.235090]  ? intel_fbdev_initial_config+0x14/0x30 [i915]
> [   12.235090]  ? async_run_entry_fn+0x34/0x160
> [   12.235090]  ? process_one_work+0x15e/0x420
> [   12.235090]  ? worker_thread+0x65/0x4b0
> [   12.235090]  ? rescuer_thread+0x390/0x390
> [   12.235090]  ? kthread+0x104/0x140
> [   12.235090]  ? kthread_park+0x80/0x80
> [   12.235090]  ? ret_from_fork+0x26/0x40
> [   12.235090] ---[ end trace 4fb69e1dcd9a9df8 ]---
> [   12.295126] fbcon: inteldrmfb (fb0) is primary device
> [   13.189124] Console: switching to colour frame buffer device 160x50
> [   13.208494] i915 0000:00:02.0: fb0: inteldrmfb frame buffer device

and several more which I don't include here, but each has drm_helper_probe_single_connector_modes and intel_tv_detect on its call trace.
Please tell if there's anything more I could do here.
Comment 56 Maarten Lankhorst 2017-04-11 14:46:42 UTC
The tv out issue is separate and has its own bug: https://bugs.freedesktop.org/show_bug.cgi?id=93782

Please use the workaround listed there so we can concentrate on this bug here. :)
Comment 57 Jani Saarinen 2017-04-26 06:41:32 UTC
Is this still issue seen here that are _for this bug_ ?
Comment 58 Ricardo 2017-05-09 17:16:41 UTC
Adding tag into "Whiteboard" field - ReadyForDev
The bug still active
*Status is correct
*Platform is included
*Feature is included
*Priority and Severity correctly set
*Logs included
Comment 59 Jim Rees 2017-05-09 17:49:00 UTC
Created attachment 131280 [details] [review]
4.11.0: drm/i915: Revert ea0000f0 "Roll out the helper nonblock tracking"

This is the patch to revert ea0000f0 "Roll out the helper nonblock tracking", updated to apply to 4.11.0.
Comment 60 Daniel Vetter 2017-05-18 08:55:50 UTC
Ok, the original report was about the Oops, i.e. the backtrace containing:

[  179.786889] Oops: 0000 [#1] SMP

I think those are gone now since the refcount patch was merged. If that's not the case, then please pipe up.
Comment 61 Maarten Lankhorst 2017-05-18 08:59:10 UTC
Can this still be reproduced with the current drm-tip? If so, what are the steps needed to reproduce this?
Comment 62 Maarten Lankhorst 2017-05-18 09:08:26 UTC
This should have been fixed in v4.11

commit 24835e442f289813aa568d142a755672a740503c
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Wed Dec 21 11:23:30 2016 +0100

    drm: reference count event->completion

Since that commit explicitly mentions this bug, I think it's best to close this bug now.
Comment 63 Christopher Chavez 2017-05-22 10:56:49 UTC
(In reply to Maarten Lankhorst from comment #62)
> This should have been fixed in v4.11
> 
> commit 24835e442f289813aa568d142a755672a740503c
> Author: Daniel Vetter <daniel.vetter@ffwll.ch>
> Date:   Wed Dec 21 11:23:30 2016 +0100
> 
>     drm: reference count event->completion
> 
> Since that commit explicitly mentions this bug, I think it's best to close
> this bug now.

I'm still getting "flip_done timed out" in 4.11. I haven't tried drm-tip though. Should I file a new issue or reopen this one?
Comment 64 Christopher Chavez 2017-05-22 11:08:15 UTC
(In reply to Christopher Chavez from comment #63)
> I'm still getting "flip_done timed out" in 4.11. I haven't tried drm-tip
> though. Should I file a new issue or reopen this one?

Forgot to mention steps for reproducing. I can think of several ways: commands like `xrandr --query`, programs such as mate-display-properties, and switching to/from graphical VT are enough to trigger.
Comment 65 Diego Viola 2017-05-31 02:30:55 UTC
Created attachment 131591 [details]
dmesg

I'm having a similar issue here.

OS: Arch Linux (x86_64)

00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset Integrated Graphics Controller (rev 03)

mesa 17.1.0-1

Linux myhost 4.11.3-1-ARCH #1 SMP PREEMPT Sun May 28 10:40:17 CEST 2017 x86_64 GNU/Linux

I was playing a game (NFSIISE) while I got this, I remember making the game go into windowed mode and then tile it to the right (I use i3wm), at that point my machine just crashed and I had to do a hard reboot.

Please see the dmesg I'm attaching with the information about the crash.

If you think my issue is different, please let me know and I'll open a different bug report.
Comment 66 Diego Viola 2017-05-31 10:41:54 UTC
I also see this in my journal:

May 30 21:00:53 myhost kernel: ------------[ cut here ]------------
May 30 21:00:53 myhost kernel: WARNING: CPU: 1 PID: 362 at drivers/gpu/drm/i915/intel_display.c:14229 intel_atomic_commit_tail+0xfd5/0xfe0 [i915]
May 30 21:00:53 myhost kernel: pipe A vblank wait timed out
May 30 21:00:53 myhost kernel: Modules linked in: uas usb_storage fuse cfg80211 rfkill gpio_ich iTCO_wdt iTCO_vendor_support i915 mousedev coretemp input_leds joydev kvm_intel kvm snd_hda_codec_idt snd_hda_codec_generic video drm_kms_helper drm snd_hda_intel led_class lpc_ich syscopyarea irqbypass snd_hda_codec sysfillrect evdev mac_hid sysimgblt jme snd_hda_core fb_sys_fops mii i2c_algo_bit psmouse pcspkr i2c_i801 rng_core intel_agp thermal intel_gtt shpchp button snd_hwdep snd_pcm snd_timer snd soundcore acpi_cpufreq tpm_tis tpm_tis_core tpm sch_fq_codel ip_tables x_tables ext4 crc16 jbd2 fscrypto mbcache hid_generic usbhid hid sr_mod cdrom sd_mod ata_generic pata_acpi serio_raw atkbd libps2 uhci_hcd i8042 serio ata_piix libata scsi_mod ehci_pci ehci_hcd usbcore usb_common
May 30 21:00:53 myhost kernel: CPU: 1 PID: 362 Comm: Xorg Tainted: G        W       4.11.3-1-ARCH #1
May 30 21:00:53 myhost kernel: Hardware name: Positivo Informatica SA POS-ECIG41BS/POS-ECIG41BS, BIOS 080015  05/14/2010
May 30 21:00:53 myhost kernel: Call Trace:
May 30 21:00:53 myhost kernel:  dump_stack+0x63/0x81
May 30 21:00:53 myhost kernel:  __warn+0xcb/0xf0
May 30 21:00:53 myhost kernel:  warn_slowpath_fmt+0x5a/0x80
May 30 21:00:53 myhost kernel:  intel_atomic_commit_tail+0xfd5/0xfe0 [i915]
May 30 21:00:53 myhost kernel:  ? wake_bit_function+0x60/0x60
May 30 21:00:53 myhost kernel:  intel_atomic_commit+0x360/0x480 [i915]
May 30 21:00:53 myhost kernel:  ? drm_atomic_check_only+0x39e/0x580 [drm]
May 30 21:00:53 myhost kernel:  drm_atomic_commit+0x4b/0x50 [drm]
May 30 21:00:53 myhost kernel:  restore_fbdev_mode+0x222/0x280 [drm_kms_helper]
May 30 21:00:53 myhost kernel:  drm_fb_helper_restore_fbdev_mode_unlocked+0x2e/0x80 [drm_kms_helper]
May 30 21:00:53 myhost kernel:  drm_fb_helper_set_par+0x2d/0x60 [drm_kms_helper]
May 30 21:00:53 myhost kernel:  intel_fbdev_set_par+0x18/0x70 [i915]
May 30 21:00:53 myhost kernel:  fb_set_var+0x193/0x430
May 30 21:00:53 myhost kernel:  ? update_curr+0xf2/0x1e0
May 30 21:00:53 myhost kernel:  ? __enqueue_entity+0x6c/0x70
May 30 21:00:53 myhost kernel:  ? put_prev_entity+0x80/0xc10
May 30 21:00:53 myhost kernel:  ? set_next_entity+0x57/0xdb0
May 30 21:00:53 myhost kernel:  fbcon_blank+0x206/0x390
May 30 21:00:53 myhost kernel:  do_unblank_screen+0xa4/0x190
May 30 21:00:53 myhost kernel:  complete_change_console+0x59/0xe0
May 30 21:00:53 myhost kernel:  vt_ioctl+0x10e7/0x11e0
May 30 21:00:53 myhost kernel:  ? __generic_file_write_iter+0x108/0x1c0
May 30 21:00:53 myhost kernel:  ? __wake_up+0x44/0x50
May 30 21:00:53 myhost kernel:  tty_ioctl+0x229/0xc40
May 30 21:00:53 myhost kernel:  ? n_tty_open+0xd0/0xd0
May 30 21:00:53 myhost kernel:  ? __fget+0x77/0xb0
May 30 21:00:53 myhost kernel:  ? sock_poll+0x68/0x90
May 30 21:00:53 myhost kernel:  do_vfs_ioctl+0xa5/0x600
May 30 21:00:53 myhost kernel:  ? __fget+0x77/0xb0
May 30 21:00:53 myhost kernel:  SyS_ioctl+0x79/0x90
May 30 21:00:53 myhost kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
May 30 21:00:53 myhost kernel: RIP: 0033:0x7f79fbea7cb7
May 30 21:00:53 myhost kernel: RSP: 002b:00007fffcc5fe908 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
May 30 21:00:53 myhost kernel: RAX: ffffffffffffffda RBX: 00007f79fc162ae0 RCX: 00007f79fbea7cb7
May 30 21:00:53 myhost kernel: RDX: 0000000000000001 RSI: 0000000000005605 RDI: 000000000000000a
May 30 21:00:53 myhost kernel: RBP: 000000000004b640 R08: 0000000000000000 R09: 0000000000000001
May 30 21:00:53 myhost kernel: R10: 00007fffcc5fe8b0 R11: 0000000000000246 R12: 000000000005b680
May 30 21:00:53 myhost kernel: R13: 00000000034b3980 R14: 000000000084c3c8 R15: 0000000002e2da70
May 30 21:00:53 myhost kernel: ---[ end trace c02755ea47d64b4d ]---
May 30 21:00:53 myhost kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe A] flip_done timed out
May 30 21:00:53 myhost kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe A] flip_done timed out
May 30 21:00:53 myhost kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe A] flip_done timed out
May 30 21:01:03 myhost kernel: [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR* [CRTC:29:pipe A] flip_done timed out
Comment 67 Diego Viola 2017-05-31 10:43:42 UTC
May 30 21:01:03 myhost kernel: ------------[ cut here ]------------
May 30 21:01:03 myhost kernel: WARNING: CPU: 0 PID: 362 at drivers/gpu/drm/i915/intel_display.c:14229 intel_atomic_commit_tail+0xfd5/0xfe0 [i915]
May 30 21:01:03 myhost kernel: pipe A vblank wait timed out
May 30 21:01:03 myhost kernel: Modules linked in: uas usb_storage fuse cfg80211 rfkill gpio_ich iTCO_wdt iTCO_vendor_support i915 mousedev coretemp input_leds joydev kvm_intel kvm snd_hda_codec_idt snd_hda_codec_generic video drm_kms_helper drm snd_hda_intel led_class lpc_ich syscopyarea irqbypass snd_hda_codec sysfillrect evdev mac_hid sysimgblt jme snd_hda_core fb_sys_fops mii i2c_algo_bit psmouse pcspkr i2c_i801 rng_core intel_agp thermal intel_gtt shpchp button snd_hwdep snd_pcm snd_timer snd soundcore acpi_cpufreq tpm_tis tpm_tis_core tpm sch_fq_codel ip_tables x_tables ext4 crc16 jbd2 fscrypto mbcache hid_generic usbhid hid sr_mod cdrom sd_mod ata_generic pata_acpi serio_raw atkbd libps2 uhci_hcd i8042 serio ata_piix libata scsi_mod ehci_pci ehci_hcd usbcore usb_common
May 30 21:01:03 myhost kernel: CPU: 0 PID: 362 Comm: Xorg Tainted: G        W       4.11.3-1-ARCH #1
May 30 21:01:03 myhost kernel: Hardware name: Positivo Informatica SA POS-ECIG41BS/POS-ECIG41BS, BIOS 080015  05/14/2010
May 30 21:01:03 myhost kernel: Call Trace:
May 30 21:01:03 myhost kernel:  dump_stack+0x63/0x81
May 30 21:01:03 myhost kernel:  __warn+0xcb/0xf0
May 30 21:01:03 myhost kernel:  warn_slowpath_fmt+0x5a/0x80
May 30 21:01:03 myhost kernel:  intel_atomic_commit_tail+0xfd5/0xfe0 [i915]
May 30 21:01:03 myhost kernel:  ? wake_bit_function+0x60/0x60
May 30 21:01:03 myhost kernel:  intel_atomic_commit+0x360/0x480 [i915]
May 30 21:01:03 myhost kernel:  ? drm_atomic_check_only+0x39e/0x580 [drm]
May 30 21:01:03 myhost kernel:  drm_atomic_commit+0x4b/0x50 [drm]
May 30 21:01:03 myhost kernel:  drm_atomic_helper_set_config+0x83/0xe0 [drm_kms_helper]
May 30 21:01:03 myhost kernel:  drm_mode_set_config_internal+0x65/0x110 [drm]
May 30 21:01:03 myhost kernel:  drm_mode_setcrtc+0x10c/0x560 [drm]
May 30 21:01:03 myhost kernel:  drm_ioctl+0x212/0x4d0 [drm]
May 30 21:01:03 myhost kernel:  ? drm_mode_getcrtc+0x170/0x170 [drm]
May 30 21:01:03 myhost kernel:  ? __vfs_write+0xe4/0x140
May 30 21:01:03 myhost kernel:  do_vfs_ioctl+0xa5/0x600
May 30 21:01:03 myhost kernel:  ? __fget+0x77/0xb0
May 30 21:01:03 myhost kernel:  SyS_ioctl+0x79/0x90
May 30 21:01:03 myhost kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
May 30 21:01:03 myhost kernel: RIP: 0033:0x7f79fbea7cb7
May 30 21:01:03 myhost kernel: RSP: 002b:00007fffcc5fd828 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
May 30 21:01:03 myhost kernel: RAX: ffffffffffffffda RBX: 000000000000001a RCX: 00007f79fbea7cb7
May 30 21:01:03 myhost kernel: RDX: 00007fffcc5fd860 RSI: 00000000c06864a2 RDI: 000000000000000b
May 30 21:01:03 myhost kernel: RBP: 000000000083dbe0 R08: 0000000000000000 R09: 0000000002b60870
May 30 21:01:03 myhost kernel: R10: 00007fffcc5fda00 R11: 0000000000000246 R12: 00007fffcc5fdbe0
May 30 21:01:03 myhost kernel: R13: 0000000000000000 R14: 00007fffcc5fdc80 R15: 00007fffcc5fe994
May 30 21:01:03 myhost kernel: ---[ end trace c02755ea47d64b4e ]---
Comment 68 Jani Nikula 2017-05-31 14:20:31 UTC
If you're still seeing the oops i.e.

[  179.786793] BUG: unable to handle kernel NULL pointer dereference at           (null)
[  179.786840] IP: [<ffffffff810983eb>] __wake_up_common+0x2b/0x90

then please reopen this one.

Do not use this one to report about vblank wait time outs. File a new bug for that if you're seeing it on or after v4.11.
Comment 69 Diego Viola 2017-07-05 22:29:15 UTC
My problem was solved here: Bug 101261

Sorry for spamming unrelated information.
Comment 70 Elizabeth 2017-07-07 23:02:03 UTC
(In reply to Diego Viola from comment #69)
> My problem was solved here: Bug 101261
> 
> Sorry for spamming unrelated information.

Thank you for the information. Closing the bug.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.