Summary: | [SNB/IVB] stuck page flips | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | Josh Boyer <jwboyer> | ||||||||||||
Component: | DRM/Intel | Assignee: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||||||||||
Status: | CLOSED FIXED | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||||||||||
Severity: | normal | ||||||||||||||
Priority: | medium | CC: | intel-gfx-bugs, tbl0605, tvanhala | ||||||||||||
Version: | DRI git | ||||||||||||||
Hardware: | Other | ||||||||||||||
OS: | All | ||||||||||||||
Whiteboard: | |||||||||||||||
i915 platform: | IVB, SNB | i915 features: | display/Other | ||||||||||||
Attachments: |
|
Description
Josh Boyer
2015-02-24 18:50:44 UTC
I got what seems to be the same bug while running the 4.1-rc1 kernel with Ubuntu 14.04. At least comparing the source files https://github.com/torvalds/linux/blob/v4.1-rc1/drivers/gpu/drm/i915/intel_display.c (line 10098) https://github.com/torvalds/linux/blob/v4.0-rc1/drivers/gpu/drm/i915/intel_display.c (line 9711) it is exactly the same check that fails. The symptoms are that the screen gets stuck with only the cursor moving. I was able to recover by CTRL+ALT+F1 and restarting unity from the terminal. The problem occured while I was watching a video on youtube. The machine is a Lenovo T420s with a sandy bridge CPU (i5-2540M). I updated to the 4.1-rc1 kernel from Ubuntu stock kernel 3.16.0-34 to check if it fixes this bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1384342 I don't know if they have anything in common. At least both have to do with page flips. Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522245] ------------[ cut here ]------------ Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522292] WARNING: CPU: 0 PID: 1670 at /home/kernel/COD/linux/drivers/gpu/drm/i915/intel_display.c:10098 intel_check_page_flip+0xe6/0xf0 [i915]() Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522296] Kicking stuck page flip: queued at 877877, now 889273 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522298] Modules linked in: ctr ccm joydev serio_raw rfcomm bnep intel_rapl iosf_mbi x86_pkg_temp_thermal intel_powerclamp snd_hda_codec_hdmi coretemp snd_hda_codec_conexant snd_hda_codec_generic kvm i915 crct10dif_pclmul snd_hda_intel crc32_pclmul ghash_clmulni_intel arc4 snd_hda_controller aesni_intel iwldvm aes_x86_64 lrw mac80211 gf128mul snd_hda_codec glue_helper uvcvideo snd_hda_core ablk_helper cryptd snd_hwdep snd_seq_midi btusb snd_seq_midi_event btbcm snd_pcm videobuf2_vmalloc snd_rawmidi btintel thinkpad_acpi videobuf2_memops drm_kms_helper iwlwifi videobuf2_core cdc_mbim snd_seq drm v4l2_common cdc_ncm snd_seq_device videodev cfg80211 usbnet bluetooth snd_timer media mii nvram cdc_acm cdc_wdm i2c_algo_bit lpc_ich mei_me snd parport_pc mei wmi soundcore ppdev shpchp video lp mac_hid parport e1000e psmouse ahci sdhci_pci ptp libahci sdhci pps_core Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522379] CPU: 0 PID: 1670 Comm: dbus-daemon Not tainted 4.1.0-040100rc1-generic #201504270235 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522382] Hardware name: LENOVO 4174NEG/4174NEG, BIOS 8CET50WW (1.30 ) 11/01/2011 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522385] 0000000000002772 ffff88021e203d28 ffffffff817f9785 0000000000000007 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522389] ffff88021e203d78 ffff88021e203d68 ffffffff8107d0a7 01000000000039fe Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522393] ffff8802120e9000 ffff88021202b1a8 ffff88021202b000 0000000000000000 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522397] Call Trace: Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522400] <IRQ> [<ffffffff817f9785>] dump_stack+0x45/0x57 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522411] [<ffffffff8107d0a7>] warn_slowpath_common+0x97/0xe0 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522415] [<ffffffff8107d1a6>] warn_slowpath_fmt+0x46/0x50 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522448] [<ffffffffc0734206>] intel_check_page_flip+0xe6/0xf0 [i915] Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522474] [<ffffffffc06f7635>] ilk_display_irq_handler+0xd5/0x230 [i915] Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522498] [<ffffffffc06f9775>] ironlake_irq_handler+0x275/0x2b0 [i915] Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522503] [<ffffffff810d7ded>] handle_irq_event_percpu+0x5d/0x1e0 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522509] [<ffffffff810fb5c0>] ? tick_nohz_stop_sched_tick+0x290/0x300 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522512] [<ffffffff810d7fb8>] handle_irq_event+0x48/0x70 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522516] [<ffffffff810db21f>] handle_edge_irq+0x7f/0x120 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522520] [<ffffffff81018862>] handle_irq+0x22/0x40 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522525] [<ffffffff8180919c>] do_IRQ+0x5c/0x110 Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522529] [<ffffffff8180702e>] common_interrupt+0x6e/0x6e Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522531] <EOI> Apr 30 22:33:08 tuomas-thinkpad kernel: [14842.522534] ---[ end trace 81dd908546d849c6 ]--- I have no idea what hardware (GPU) either of you have. Please attach your full dmesg. Created attachment 115522 [details]
dmesg output
(In reply to Chris Wilson from comment #2) > I have no idea what hardware (GPU) either of you have. Please attach your > full dmesg. Ok, I have attached the dmesg output taken after the bug occured. Please let me know if you need more info. The original report is on IVB (there's a dmesg in Redhat's bugzilla). And the attached dmesg was taken from an SNB machine. Created attachment 116341 [details]
dmesg output
Hi, I'm having same problem with kernel 4.0.5 : [ 4230.405612] ------------[ cut here ]------------ [ 4230.405648] WARNING: CPU: 0 PID: 0 at drivers/gpu/drm/i915/intel_display.c:9756 intel_check_page_flip+0xd1/0xe0 [i915]() [ 4230.405650] Kicking stuck page flip: queued at 253043, now 253048 [ 4230.405652] Modules linked in: nvram(E) msr(E) pci_stub(E) vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) vmw_vsock_vmci_transport(E) vsock(E) vmw_vmci(E) btusb(E) uvcvideo(E) videobuf2_vmalloc(E) videobuf2_memops(E) videobuf2_core(E) v4l2_common(E) videodev(E) media(E) snd_hda_codec_hdmi(E) snd_hda_codec_realtek(E) snd_hda_codec_generic(E) snd_hda_intel(E) snd_hda_controller(E) snd_hda_codec(E) snd_hwdep(E) snd_pcm(E) intel_rapl(E) iosf_mbi(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) kvm(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) cryptd(E) arc4(E) snd_seq_midi(E) snd_seq_midi_event(E) snd_rawmidi(E) joydev(E) serio_raw(E) snd_seq(E) iwlmvm(E) mac80211(E) bnep(E) rfcomm(E) bluetooth(E) iwlwifi(E) rtsx_pci_ms(E) memstick(E) cfg80211(E) lpc_ich(E) wmi(E) snd_seq_device(E) snd_timer(E) i915(E) snd(E) mei_me(E) binfmt_misc(E) mei(E) drm_kms_helper(E) tpm_infineon(E) drm(E) shpchp(E) i2c_algo_bit(E) dell_smo8800(E) ie31200_edac(E) soundcore(E) video(E) mac_hid(E) edac_core(E) parport_pc(E) ppdev(E) lp(E) parport(E) rtsx_pci_sdmmc(E) psmouse(E) ahci(E) libahci(E) r8169(E) mii(E) rtsx_pci(E) [ 4230.405695] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G OE 4.0.5-tbl-patched #2 [ 4230.405697] Hardware name: Notebook W54_55SU1,SUW/W54_55SU1,SUW, BIOS 4.6.5 11/26/2013 [ 4230.405699] ffffffffc029cee8 ffff88011fa03d38 ffffffff817b47a9 0000000000000000 [ 4230.405701] ffff88011fa03d88 ffff88011fa03d78 ffffffff81075cca ffff88011fa03d88 [ 4230.405703] ffff8800d58ed000 ffff8800d5e4b800 ffff8800d5e4b9a8 0000000000000000 [ 4230.405706] Call Trace: [ 4230.405708] <IRQ> [<ffffffff817b47a9>] dump_stack+0x45/0x57 [ 4230.405718] [<ffffffff81075cca>] warn_slowpath_common+0x8a/0xc0 [ 4230.405720] [<ffffffff81075d46>] warn_slowpath_fmt+0x46/0x50 [ 4230.405741] [<ffffffffc0245391>] intel_check_page_flip+0xd1/0xe0 [i915] [ 4230.405758] [<ffffffffc0211878>] ironlake_irq_handler+0x2e8/0xfc0 [i915] [ 4230.405762] [<ffffffff813ab300>] ? timerqueue_add+0x60/0xb0 [ 4230.405765] [<ffffffff810e0b09>] ? enqueue_hrtimer+0x29/0x90 [ 4230.405769] [<ffffffff810ce33e>] handle_irq_event_percpu+0x3e/0x1a0 [ 4230.405772] [<ffffffff810ce4e1>] handle_irq_event+0x41/0x70 [ 4230.405775] [<ffffffff810d148f>] handle_edge_irq+0x7f/0x120 [ 4230.405778] [<ffffffff81017732>] handle_irq+0x22/0x40 [ 4230.405782] [<ffffffff817bf1e1>] do_IRQ+0x51/0xf0 [ 4230.405785] [<ffffffff817bcfad>] common_interrupt+0x6d/0x6d [ 4230.405786] <EOI> [<ffffffff81653b40>] ? cpuidle_enter_state+0x70/0x170 [ 4230.405791] [<ffffffff81653b2d>] ? cpuidle_enter_state+0x5d/0x170 [ 4230.405793] [<ffffffff81653c77>] cpuidle_enter+0x17/0x20 [ 4230.405796] [<ffffffff810b73ff>] cpu_startup_entry+0x3bf/0x460 [ 4230.405799] [<ffffffff817a6477>] rest_init+0x77/0x80 [ 4230.405803] [<ffffffff81d3710b>] start_kernel+0x494/0x4a1 [ 4230.405805] [<ffffffff81d36a67>] ? set_init_arg+0x55/0x55 [ 4230.405808] [<ffffffff81d36120>] ? early_idt_handlers+0x120/0x120 [ 4230.405810] [<ffffffff81d365ee>] x86_64_start_reservations+0x2a/0x2c [ 4230.405812] [<ffffffff81d36742>] x86_64_start_kernel+0x152/0x161 [ 4230.405814] ---[ end trace b330d780ffc1f7f5 ]--- This bug was also reported at https://bugzilla.kernel.org/show_bug.cgi?id=91221 Thierry. Created attachment 117278 [details]
dmesg from LG NC1100
I saw the same oops on Fedora 22 w/ kernel 4.0.5-300.fc22.i686. I didn't save the oops output, but I attached the dmesg from that device. When I see the oops again, I'll provide a stacktrace. Finally an oops after several days of testing: Jul 23 15:03:55 00110F0311C1 kernel: ------------[ cut here ]------------ Jul 23 15:03:55 00110F0311C1 kernel: WARNING: CPU: 1 PID: 1864 at drivers/gpu/drm/i915/intel_display.c:9756 intel_check_page_flip+0xc6/0xd0 [i915]() Jul 23 15:03:55 00110F0311C1 kernel: Kicking stuck page flip: queued at 12788104, now 12861367 Jul 23 15:03:55 00110F0311C1 kernel: Modules linked in: tun iTCO_wdt iTCO_vendor_support ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ebtable_nat ebtable_broute bridge ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw coretemp ppdev kvm_intel snd_hda_codec_hdmi kvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel i2c_i801 snd_hda_controller tpm_tis snd_hda_codec tpm wmi snd_hwdep snd_seq snd_seq_device parport_pc snd_pcm parport snd_timer mei_me snd soundcore acpi_cpufreq mei lpc_ich 8021q garp stp llc mrp i915 i2c_algo_bit drm_kms_helper e1000e drm serio_raw ata_generic ptp pps_core Jul 23 15:03:55 00110F0311C1 kernel: pata_acpi video Jul 23 15:03:55 00110F0311C1 kernel: CPU: 1 PID: 1864 Comm: lgutil Not tainted 4.0.7-300.fc22.i686 #1 Jul 23 15:03:55 00110F0311C1 kernel: Hardware name: LG Electronics Inc. NC1100/NC1000, BIOS LGDSSF01 12/08/2011 Jul 23 15:03:55 00110F0311C1 kernel: c0d1d947 cd0eb5a1 00000000 f4567ec4 c0a730de f4567f04 f4567ef4 c0458127 Jul 23 15:03:55 00110F0311C1 kernel: f8240064 f4567f24 00000748 f823d42c 0000261c f81ec486 f81ec486 f3f1a400 Jul 23 15:03:55 00110F0311C1 kernel: f3cf6800 00000000 f4567f10 c045819e 00000009 f4567f04 f8240064 f4567f24 Jul 23 15:03:55 00110F0311C1 kernel: Call Trace: Jul 23 15:03:55 00110F0311C1 kernel: [<c0a730de>] dump_stack+0x41/0x52 Jul 23 15:03:55 00110F0311C1 kernel: [<c0458127>] warn_slowpath_common+0x87/0xc0 Jul 23 15:03:55 00110F0311C1 kernel: [<f81ec486>] ? intel_check_page_flip+0xc6/0xd0 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<f81ec486>] ? intel_check_page_flip+0xc6/0xd0 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<c045819e>] warn_slowpath_fmt+0x3e/0x60 Jul 23 15:03:55 00110F0311C1 kernel: [<f81ec486>] intel_check_page_flip+0xc6/0xd0 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<f81bc22b>] ironlake_irq_handler+0x37b/0xe60 [i915] Jul 23 15:03:55 00110F0311C1 kernel: [<c084f4e8>] ? ata_bmdma_port_intr+0x28/0x100 Jul 23 15:03:55 00110F0311C1 kernel: [<c084c483>] ? ata_bmdma_status+0x13/0x20 Jul 23 15:03:55 00110F0311C1 kernel: [<c084f72c>] ? ata_bmdma_interrupt+0x16c/0x1f0 Jul 23 15:03:55 00110F0311C1 kernel: [<c04a7656>] handle_irq_event_percpu+0x76/0x190 Jul 23 15:03:55 00110F0311C1 kernel: [<c04a779a>] handle_irq_event+0x2a/0x50 Jul 23 15:03:55 00110F0311C1 kernel: [<c04aa2d0>] ? handle_level_irq+0xc0/0xc0 Jul 23 15:03:55 00110F0311C1 kernel: [<c04aa32d>] handle_edge_irq+0x5d/0xf0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0405906>] handle_irq+0x96/0xd0 Jul 23 15:03:55 00110F0311C1 kernel: <IRQ> [<c040512c>] do_IRQ+0x3c/0xd0 Jul 23 15:03:55 00110F0311C1 kernel: [<c04500d8>] ? aes_enc_blk+0x748/0xb64 Jul 23 15:03:55 00110F0311C1 kernel: [<c067d612>] ? security_file_permission+0x22/0xb0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0a78cf3>] common_interrupt+0x33/0x38 Jul 23 15:03:55 00110F0311C1 kernel: [<c07a007b>] ? dbg_pnp_show_option+0x18b/0x3f0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0595fb4>] ? vfs_write+0x74/0x200 Jul 23 15:03:55 00110F0311C1 kernel: [<c0437f88>] ? smp_apic_timer_interrupt+0x38/0x50 Jul 23 15:03:55 00110F0311C1 kernel: [<c0596791>] SyS_write+0x51/0xb0 Jul 23 15:03:55 00110F0311C1 kernel: [<c0a7869f>] sysenter_do_call+0x12/0x12 Jul 23 15:03:55 00110F0311C1 kernel: [<c0a70000>] ? klist_put+0x60/0x80 Jul 23 15:03:55 00110F0311C1 kernel: ---[ end trace 6b0a49607c1db783 ]--- Note that this is 4.0.7-300.fc22.i686 and not 4.0.5-300.fc22.i686. But we have seen this on all kernel versions in FC22: 4.0.4-301, 4.0.4-303, 4.0.5-300, 4.0.6-300, and 4.0.7-300. Created attachment 117474 [details]
dmesg from Nexcom NDiS 163
Just had the same issue happen on another device, a Nexcom NDiS 163. I attached the dmesg. Since this is now affecting more than one device model on our end, our in-field software upgrade is blocked. It would be great if someone could give any sort of indication on the status of this bug... Created attachment 117546 [details]
dmesg for Lenovo ThinkCentre M70e
Another oops, this time on a Lenovo ThinkCentre M70e.
Sometimes the oops isn't triggered until I reinitialize X. It would help if I could detect that the video has locked up and restart the box. Is there a liveliness function/syscall/etc. that I can consume? Timeout. I'm going to presume we've fixed these; haven't seen new reports in a while. Please reopen if the problem persists with latest kernels. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.