We have a report [1] of the following backtrace with the 4.0-rc1 kernel: WARNING: CPU: 0 PID: 0 at drivers/gpu/drm/i915/intel_display.c:9711 intel_check_page_flip+0xdb/0xf0 [i915]() Kicking stuck page flip: queued at 11492, now 11497 Modules linked in: ccm bnep bluetooth fuse xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ebtable_nat ebtable_broute bridge ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw rtsx_usb_ms memstick intel_rapl iosf_mbi arc4 x86_pkg_temp_thermal rtl8192ce rtl_pci coretemp rtl8192c_common kvm_intel rtlwifi kvm snd_hda_codec_hdmi mac80211 crct10dif_pclmul iTCO_wdt iTCO_vendor_support snd_hda_codec_realtek snd_hda_codec_generic crc32_pclmul sparse_keymap crc32c_intel cfg80211 snd_hda_intel snd_hda_controller ghash_clmulni_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm rfkill snd_timer snd soundcore serio_raw mei_me mei shpchp i2c_i801 lpc_ich tpm_tis tpm wmi nfsd auth_rpcgss nfs_acl lockd grace sunrpc i915 rtsx_usb_sdmmc i2c_algo_bit drm_kms_helper mmc_core drm 8021q garp stp llc mrp r8169 mii rtsx_usb mfd_core video uas usb_storage CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.0.0-0.rc1.git0.1.fc22.x86_64 #1 Hardware name: WB WB-B06211/WB-B0621, BIOS EB062IWB V1.0 12/12/2013 0000000000000000 e29f8732215d5724 ffff88011f203cf8 ffffffff8177fc48 0000000000000000 ffff88011f203d50 ffff88011f203d38 ffffffff8109c7ba ffff88011f203d68 ffff8800367f3800 ffff8800365a5000 0000000000000000 Call Trace: <IRQ> [<ffffffff8177fc48>] dump_stack+0x45/0x57 [<ffffffff8109c7ba>] warn_slowpath_common+0x8a/0xc0 [<ffffffff8109c845>] warn_slowpath_fmt+0x55/0x70 [<ffffffffa019900b>] intel_check_page_flip+0xdb/0xf0 [i915] [<ffffffffa016515c>] ironlake_irq_handler+0x42c/0x1000 [i915] [<ffffffff811069dd>] ? hrtimer_get_next_event+0x3d/0x90 [<ffffffff814a9fdd>] ? credit_entropy_bits+0x26d/0x2d0 [<ffffffff8110594a>] ? get_next_timer_interrupt+0x1fa/0x270 [<ffffffffa0593384>] ? _rtl_pci_interrupt+0x1b4/0x410 [rtl_pci] [<ffffffff810f4be7>] handle_irq_event_percpu+0x77/0x1a0 [<ffffffff810f4d4b>] handle_irq_event+0x3b/0x60 [<ffffffff810f7f3e>] handle_edge_irq+0x6e/0x120 [<ffffffff81017534>] handle_irq+0x74/0x140 [<ffffffff810a00b1>] ? _local_bh_enable+0x21/0x50 [<ffffffff81788dcf>] do_IRQ+0x4f/0xf0 [<ffffffff81786b2d>] common_interrupt+0x6d/0x6d <EOI> [<ffffffff81616cbd>] ? cpuidle_enter_state+0x5d/0x170 [<ffffffff81616cad>] ? cpuidle_enter_state+0x4d/0x170 [<ffffffff81616fb7>] cpuidle_enter+0x17/0x20 [<ffffffff810df9d9>] cpu_startup_entry+0x349/0x3e0 [<ffffffff81776547>] rest_init+0x77/0x80 [<ffffffff81d3803f>] start_kernel+0x4a0/0x4c1 [<ffffffff81d37120>] ? early_idt_handlers+0x120/0x120 [<ffffffff81d37339>] x86_64_start_reservations+0x2a/0x2c [<ffffffff81d3749c>] x86_64_start_kernel+0x161/0x184 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1195870
I got what seems to be the same bug while running the 4.1-rc1 kernel with Ubuntu 14.04. At least comparing the source files https://github.com/torvalds/linux/blob/v4.1-rc1/drivers/gpu/drm/i915/intel_display.c (line 10098) https://github.com/torvalds/linux/blob/v4.0-rc1/drivers/gpu/drm/i915/intel_display.c (line 9711) it is exactly the same check that fails. The symptoms are that the screen gets stuck with only the cursor moving. I was able to recover by CTRL+ALT+F1 and restarting unity from the terminal. The problem occured while I was watching a video on youtube. The machine is a Lenovo T420s with a sandy bridge CPU (i5-2540M). I updated to the 4.1-rc1 kernel from Ubuntu stock kernel 3.16.0-34 to check if it fixes this bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1384342 I don't know if they have anything in common. At least both have to do with page flips. Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522245] ------------[ cut here ]------------ Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522292] WARNING: CPU: 0 PID: 1670 at /home/kernel/COD/linux/drivers/gpu/drm/i915/intel_display.c:10098 intel_check_page_flip+0xe6/0xf0 [i915]() Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522296] Kicking stuck page flip: queued at 877877, now 889273 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522298] Modules linked in: ctr ccm joydev serio_raw rfcomm bnep intel_rapl iosf_mbi x86_pkg_temp_thermal intel_powerclamp snd_hda_codec_hdmi coretemp snd_hda_codec_conexant snd_hda_codec_generic kvm i915 crct10dif_pclmul snd_hda_intel crc32_pclmul ghash_clmulni_intel arc4 snd_hda_controller aesni_intel iwldvm aes_x86_64 lrw mac80211 gf128mul snd_hda_codec glue_helper uvcvideo snd_hda_core ablk_helper cryptd snd_hwdep snd_seq_midi btusb snd_seq_midi_event btbcm snd_pcm videobuf2_vmalloc snd_rawmidi btintel thinkpad_acpi videobuf2_memops drm_kms_helper iwlwifi videobuf2_core cdc_mbim snd_seq drm v4l2_common cdc_ncm snd_seq_device videodev cfg80211 usbnet bluetooth snd_timer media mii nvram cdc_acm cdc_wdm i2c_algo_bit lpc_ich mei_me snd parport_pc mei wmi soundcore ppdev shpchp video lp mac_hid parport e1000e psmouse ahci sdhci_pci ptp libahci sdhci pps_core Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522379] CPU: 0 PID: 1670 Comm: dbus-daemon Not tainted 4.1.0-040100rc1-generic #201504270235 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522382] Hardware name: LENOVO 4174NEG/4174NEG, BIOS 8CET50WW (1.30 ) 11/01/2011 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522385] 0000000000002772 ffff88021e203d28 ffffffff817f9785 0000000000000007 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522389] ffff88021e203d78 ffff88021e203d68 ffffffff8107d0a7 01000000000039fe Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522393] ffff8802120e9000 ffff88021202b1a8 ffff88021202b000 0000000000000000 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522397] Call Trace: Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522400] <IRQ> [<ffffffff817f9785>] dump_stack+0x45/0x57 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522411] [<ffffffff8107d0a7>] warn_slowpath_common+0x97/0xe0 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522415] [<ffffffff8107d1a6>] warn_slowpath_fmt+0x46/0x50 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522448] [<ffffffffc0734206>] intel_check_page_flip+0xe6/0xf0 [i915] Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522474] [<ffffffffc06f7635>] ilk_display_irq_handler+0xd5/0x230 [i915] Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522498] [<ffffffffc06f9775>] ironlake_irq_handler+0x275/0x2b0 [i915] Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522503] [<ffffffff810d7ded>] handle_irq_event_percpu+0x5d/0x1e0 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522509] [<ffffffff810fb5c0>] ? tick_nohz_stop_sched_tick+0x290/0x300 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522512] [<ffffffff810d7fb8>] handle_irq_event+0x48/0x70 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522516] [<ffffffff810db21f>] handle_edge_irq+0x7f/0x120 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522520] [<ffffffff81018862>] handle_irq+0x22/0x40 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522525] [<ffffffff8180919c>] do_IRQ+0x5c/0x110 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522529] [<ffffffff8180702e>] common_interrupt+0x6e/0x6e Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522531] <EOI> Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522534] ---[ end trace 81dd908546d849c6 ]---
I have no idea what hardware (GPU) either of you have. Please attach your full dmesg.
Created attachment 115522 [details] dmesg output
(In reply to Chris Wilson from comment #2) > I have no idea what hardware (GPU) either of you have. Please attach your > full dmesg. Ok, I have attached the dmesg output taken after the bug occured. Please let me know if you need more info.
The original report is on IVB (there's a dmesg in Redhat's bugzilla). And the attached dmesg was taken from an SNB machine.
Created attachment 116341 [details] dmesg output
Hi, I'm having same problem with kernel 4.0.5 : [ 4230.405612] ------------[ cut here ]------------ [ 4230.405648] WARNING: CPU: 0 PID: 0 at drivers/gpu/drm/i915/intel_display.c:9756 intel_check_page_flip+0xd1/0xe0 [i915]() [ 4230.405650] Kicking stuck page flip: queued at 253043, now 253048 [ 4230.405652] Modules linked in: nvram(E) msr(E) pci_stub(E) vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) vmw_vsock_vmci_transport(E) vsock(E) vmw_vmci(E) btusb(E) uvcvideo(E) videobuf2_vmalloc(E) videobuf2_memops(E) videobuf2_core(E) v4l2_common(E) videodev(E) media(E) snd_hda_codec_hdmi(E) snd_hda_codec_realtek(E) snd_hda_codec_generic(E) snd_hda_intel(E) snd_hda_controller(E) snd_hda_codec(E) snd_hwdep(E) snd_pcm(E) intel_rapl(E) iosf_mbi(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) kvm(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) cryptd(E) arc4(E) snd_seq_midi(E) snd_seq_midi_event(E) snd_rawmidi(E) joydev(E) serio_raw(E) snd_seq(E) iwlmvm(E) mac80211(E) bnep(E) rfcomm(E) bluetooth(E) iwlwifi(E) rtsx_pci_ms(E) memstick(E) cfg80211(E) lpc_ich(E) wmi(E) snd_seq_device(E) snd_timer(E) i915(E) snd(E) mei_me(E) binfmt_misc(E) mei(E) drm_kms_helper(E) tpm_infineon(E) drm(E) shpchp(E) i2c_algo_bit(E) dell_smo8800(E) ie31200_edac(E) soundcore(E) video(E) mac_hid(E) edac_core(E) parport_pc(E) ppdev(E) lp(E) parport(E) rtsx_pci_sdmmc(E) psmouse(E) ahci(E) libahci(E) r8169(E) mii(E) rtsx_pci(E) [ 4230.405695] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G OE 4.0.5-tbl-patched #2 [ 4230.405697] Hardware name: Notebook W54_55SU1,SUW/W54_55SU1,SUW, BIOS 4.6.5 11/26/2013 [ 4230.405699] ffffffffc029cee8 ffff88011fa03d38 ffffffff817b47a9 0000000000000000 [ 4230.405701] ffff88011fa03d88 ffff88011fa03d78 ffffffff81075cca ffff88011fa03d88 [ 4230.405703] ffff8800d58ed000 ffff8800d5e4b800 ffff8800d5e4b9a8 0000000000000000 [ 4230.405706] Call Trace: [ 4230.405708] <IRQ> [<ffffffff817b47a9>] dump_stack+0x45/0x57 [ 4230.405718] [<ffffffff81075cca>] warn_slowpath_common+0x8a/0xc0 [ 4230.405720] [<ffffffff81075d46>] warn_slowpath_fmt+0x46/0x50 [ 4230.405741] [<ffffffffc0245391>] intel_check_page_flip+0xd1/0xe0 [i915] [ 4230.405758] [<ffffffffc0211878>] ironlake_irq_handler+0x2e8/0xfc0 [i915] [ 4230.405762] [<ffffffff813ab300>] ? timerqueue_add+0x60/0xb0 [ 4230.405765] [<ffffffff810e0b09>] ? enqueue_hrtimer+0x29/0x90 [ 4230.405769] [<ffffffff810ce33e>] handle_irq_event_percpu+0x3e/0x1a0 [ 4230.405772] [<ffffffff810ce4e1>] handle_irq_event+0x41/0x70 [ 4230.405775] [<ffffffff810d148f>] handle_edge_irq+0x7f/0x120 [ 4230.405778] [<ffffffff81017732>] handle_irq+0x22/0x40 [ 4230.405782] [<ffffffff817bf1e1>] do_IRQ+0x51/0xf0 [ 4230.405785] [<ffffffff817bcfad>] common_interrupt+0x6d/0x6d [ 4230.405786] <EOI> [<ffffffff81653b40>] ? cpuidle_enter_state+0x70/0x170 [ 4230.405791] [<ffffffff81653b2d>] ? cpuidle_enter_state+0x5d/0x170 [ 4230.405793] [<ffffffff81653c77>] cpuidle_enter+0x17/0x20 [ 4230.405796] [<ffffffff810b73ff>] cpu_startup_entry+0x3bf/0x460 [ 4230.405799] [<ffffffff817a6477>] rest_init+0x77/0x80 [ 4230.405803] [<ffffffff81d3710b>] start_kernel+0x494/0x4a1 [ 4230.405805] [<ffffffff81d36a67>] ? set_init_arg+0x55/0x55 [ 4230.405808] [<ffffffff81d36120>] ? early_idt_handlers+0x120/0x120 [ 4230.405810] [<ffffffff81d365ee>] x86_64_start_reservations+0x2a/0x2c [ 4230.405812] [<ffffffff81d36742>] x86_64_start_kernel+0x152/0x161 [ 4230.405814] ---[ end trace b330d780ffc1f7f5 ]--- This bug was also reported at https://bugzilla.kernel.org/show_bug.cgi?id=91221 Thierry.
Created attachment 117278 [details] dmesg from LG NC1100
I saw the same oops on Fedora 22 w/ kernel 4.0.5-300.fc22.i686. I didn't save the oops output, but I attached the dmesg from that device. When I see the oops again, I'll provide a stacktrace.
Finally an oops after several days of testing: Jul 23 15:03:55 00110F0311C1 kernel: ------------[ cut here ]------------ Jul 23 15:03:55 00110F0311C1 kernel: WARNING: CPU: 1 PID: 1864 at drivers/gpu/drm/i915/intel_display.c:9756 intel_check_page_flip+0xc6/0xd0 [i915]() Jul 23 15:03:55 00110F0311C1 kernel: Kicking stuck page flip: queued at 12788104, now 12861367 Jul 23 15:03:55 00110F0311C1 kernel: Modules linked in: tun iTCO_wdt iTCO_vendor_support ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ebtable_nat ebtable_broute bridge ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw coretemp ppdev kvm_intel snd_hda_codec_hdmi kvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel i2c_i801 snd_hda_controller tpm_tis snd_hda_codec tpm wmi snd_hwdep snd_seq snd_seq_device parport_pc snd_pcm parport snd_timer mei_me snd soundcore acpi_cpufreq mei lpc_ich 8021q garp stp llc mrp i915 i2c_algo_bit drm_kms_helper e1000e drm serio_raw ata_generic ptp pps_core Jul 23 15:03:55 00110F0311C1 kernel: pata_acpi video Jul 23 15:03:55 00110F0311C1 kernel: CPU: 1 PID: 1864 Comm: lgutil Not tainted 4.0.7-300.fc22.i686 #1 Jul 23 15:03:55 00110F0311C1 kernel: Hardware name: LG Electronics Inc. NC1100/NC1000, BIOS LGDSSF01 12/08/2011 Jul 23 15:03:55 00110F0311C1 kernel: c0d1d947 cd0eb5a1 00000000 f4567ec4 c0a730de f4567f04 f4567ef4 c0458127 Jul 23 15:03:55 00110F0311C1 kernel: f8240064 f4567f24 00000748 f823d42c 0000261c f81ec486 f81ec486 f3f1a400 Jul 23 15:03:55 00110F0311C1 kernel: f3cf6800 00000000 f4567f10 c045819e 00000009 f4567f04 f8240064 f4567f24 Jul 23 15:03:55 00110F0311C1 kernel: Call Trace: Jul 23 15:03:55 00110F0311C1 kernel: [<c0a730de>] dump_stack+0x41/0x52 Jul 23 15:03:55 00110F0311C1 kernel: [<c0458127>] warn_slowpath_common+0x87/0xc0 Jul 23 15:03:55 00110F0311C1 kernel: [<f81ec486>] ? intel_check_page_flip+0xc6/0xd0 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<f81ec486>] ? intel_check_page_flip+0xc6/0xd0 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<c045819e>] warn_slowpath_fmt+0x3e/0x60 Jul 23 15:03:55 00110F0311C1 kernel: [<f81ec486>] intel_check_page_flip+0xc6/0xd0 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<f81bc22b>] ironlake_irq_handler+0x37b/0xe60 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<c084f4e8>] ? ata_bmdma_port_intr+0x28/0x100 Jul 23 15:03:55 00110F0311C1 kernel: [<c084c483>] ? ata_bmdma_status+0x13/0x20 Jul 23 15:03:55 00110F0311C1 kernel: [<c084f72c>] ? ata_bmdma_interrupt+0x16c/0x1f0 Jul 23 15:03:55 00110F0311C1 kernel: [<c04a7656>] handle_irq_event_percpu+0x76/0x190 Jul 23 15:03:55 00110F0311C1 kernel: [<c04a779a>] handle_irq_event+0x2a/0x50 Jul 23 15:03:55 00110F0311C1 kernel: [<c04aa2d0>] ? handle_level_irq+0xc0/0xc0 Jul 23 15:03:55 00110F0311C1 kernel: [<c04aa32d>] handle_edge_irq+0x5d/0xf0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0405906>] handle_irq+0x96/0xd0 Jul 23 15:03:55 00110F0311C1 kernel: <IRQ> [<c040512c>] do_IRQ+0x3c/0xd0 Jul 23 15:03:55 00110F0311C1 kernel: [<c04500d8>] ? aes_enc_blk+0x748/0xb64 Jul 23 15:03:55 00110F0311C1 kernel: [<c067d612>] ? security_file_permission+0x22/0xb0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0a78cf3>] common_interrupt+0x33/0x38 Jul 23 15:03:55 00110F0311C1 kernel: [<c07a007b>] ? dbg_pnp_show_option+0x18b/0x3f0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0595fb4>] ? vfs_write+0x74/0x200 Jul 23 15:03:55 00110F0311C1 kernel: [<c0437f88>] ? smp_apic_timer_interrupt+0x38/0x50 Jul 23 15:03:55 00110F0311C1 kernel: [<c0596791>] SyS_write+0x51/0xb0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0a7869f>] sysenter_do_call+0x12/0x12 Jul 23 15:03:55 00110F0311C1 kernel: [<c0a70000>] ? klist_put+0x60/0x80 Jul 23 15:03:55 00110F0311C1 kernel: ---[ end trace 6b0a49607c1db783 ]---
Note that this is 4.0.7-300.fc22.i686 and not 4.0.5-300.fc22.i686. But we have seen this on all kernel versions in FC22: 4.0.4-301, 4.0.4-303, 4.0.5-300, 4.0.6-300, and 4.0.7-300.
Created attachment 117474 [details] dmesg from Nexcom NDiS 163
Just had the same issue happen on another device, a Nexcom NDiS 163. I attached the dmesg. Since this is now affecting more than one device model on our end, our in-field software upgrade is blocked. It would be great if someone could give any sort of indication on the status of this bug...
Created attachment 117546 [details] dmesg for Lenovo ThinkCentre M70e Another oops, this time on a Lenovo ThinkCentre M70e.
Sometimes the oops isn't triggered until I reinitialize X. It would help if I could detect that the video has locked up and restart the box. Is there a liveliness function/syscall/etc. that I can consume?
Timeout. I'm going to presume we've fixed these; haven't seen new reports in a while. Please reopen if the problem persists with latest kernels.
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.