Bug 102138 - [BAT] pipe A vblank wait timed out
Summary: [BAT] pipe A vblank wait timed out
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: high critical
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2017-08-10 08:57 UTC by Martin Peres
Modified: 2017-11-01 13:22 UTC (History)
1 user (show)

See Also:
i915 platform: SKL
i915 features: display/atomic


Attachments

Description Martin Peres 2017-08-10 08:57:27 UTC
When running on CI_DRM_2942, the machine fi-skl-6600u got the following WARNING when executing igt@gem_exec_suspend@basic-s3.

[  203.758766] pipe A vblank wait timed out
[  203.758783] ------------[ cut here ]------------
[  203.758808] WARNING: CPU: 3 PID: 3069 at drivers/gpu/drm/i915/intel_display.c:11907 intel_atomic_commit_tail+0xed5/0xf00 [i915]
[  203.758809] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic asix usbnet mii x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm i915 mei_me mei prime_numbers i2c_hid pinctrl_sunrisepoint pinctrl_intel
[  203.758831] CPU: 3 PID: 3069 Comm: kworker/u8:38 Tainted: G     U  W       4.13.0-rc4-CI-CI_DRM_2942+ #1
[  203.758831] Hardware name: Dell Inc. XPS 13 9350/      , BIOS 1.4.12 11/30/2016
[  203.758834] Workqueue: events_unbound async_run_entry_fn
[  203.758835] task: ffff88026ac32880 task.stack: ffffc9000092c000
[  203.758855] RIP: 0010:intel_atomic_commit_tail+0xed5/0xf00 [i915]
[  203.758856] RSP: 0000:ffffc9000092faa0 EFLAGS: 00010286
[  203.758857] RAX: 000000000000001c RBX: 0000000000000000 RCX: 0000000000000000
[  203.758858] RDX: 0000000080000001 RSI: 0000000000000001 RDI: ffffffff810ef316
[  203.758859] RBP: ffffc9000092fb48 R08: 0000000000000001 R09: 0000000000000000
[  203.758859] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88026c250000
[  203.758860] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000001
[  203.758861] FS:  0000000000000000(0000) GS:ffff88027dd80000(0000) knlGS:0000000000000000
[  203.758861] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  203.758862] CR2: 0000000000000000 CR3: 0000000003e0f000 CR4: 00000000003406e0
[  203.758863] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  203.758863] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  203.758864] Call Trace:
[  203.758868]  ? wait_woken+0x90/0x90
[  203.758887]  intel_atomic_commit+0x22a/0x2d0 [i915]
[  203.758890]  drm_atomic_commit+0x4b/0x50
[  203.758891]  drm_atomic_helper_commit_duplicated_state+0xe4/0x110
[  203.758911]  __intel_display_resume+0x86/0xd0 [i915]
[  203.758929]  intel_display_resume+0xbf/0xf0 [i915]
[  203.758943]  i915_pm_restore+0xf4/0x190 [i915]
[  203.758956]  i915_pm_resume+0xe/0x10 [i915]
[  203.758958]  pci_pm_resume+0x64/0xa0
[  203.758960]  dpm_run_callback+0x6f/0x310
[  203.758961]  ? pci_pm_freeze+0xf0/0xf0
[  203.758962]  device_resume+0xb1/0x1e0
[  203.758964]  ? dpm_watchdog_set+0x70/0x70
[  203.758966]  async_resume+0x1d/0x50
[  203.758967]  async_run_entry_fn+0x38/0x160
[  203.758969]  process_one_work+0x224/0x650
[  203.758971]  worker_thread+0x4e/0x3b0
[  203.758973]  kthread+0x114/0x150
[  203.758974]  ? process_one_work+0x650/0x650
[  203.758975]  ? kthread_create_on_node+0x40/0x40
[  203.758977]  ret_from_fork+0x27/0x40
[  203.758979] Code: ff ff ff 48 83 c7 08 e8 1a 42 fb e0 4c 8b 85 78 ff ff ff 4d 85 c0 0f 85 d3 fd ff ff 8d 73 41 48 c7 c7 b0 a5 1c a0 e8 cc 87 fd e0 <0f> ff e9 bd fd ff ff 8d 70 41 48 c7 c7 80 a5 1c a0 e8 b6 87 fd 
[  203.759006] ---[ end trace b2cd5421efccda7b ]---
[  203.762050] hpet1: lost 21 rtc interrupts

The last line sounds quite interesting.

Full logs: https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_2942/fi-skl-6600u/igt@gem_exec_suspend@basic-s3.html
Comment 1 Jani Saarinen 2017-09-01 08:10:41 UTC
Seen only once. Dropping priority
Comment 2 Piotr Kołaczkowski 2017-09-17 16:04:19 UTC
I have the same problem. 
Occurs very often on newly released 4.14.rc1 kernel on Dell Precision 5520.
Setup: Dell Precision 5520 + external Dell U3011 display connected through the WD15 dock and miniDP. 

Here is one such encounter followed by a total crash / kernel freeze.
This one happened after suspending, then disconnecting the laptop from the dock and resuming:

Sep 17 17:39:21 p5520 kernel: [12129.400105] pipe B vblank wait timed out
Sep 17 17:39:21 p5520 kernel: [12129.400154] ------------[ cut here ]------------
Sep 17 17:39:21 p5520 kernel: [12129.400209] WARNING: CPU: 6 PID: 16693 at /home/kernel/COD/linux/drivers/gpu/drm/i915/intel_display.c:12172 intel_atomic_commit_tail+0xf6e/0xf80 [i915]
Sep 17 17:39:21 p5520 kernel: [12129.400209] Modules linked in: usblp thunderbolt rfcomm ccm xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ipt_REJECT nf_reject_ipv4 cmac xt_tcpudp bridge stp llc cdc_ether usbnet r8152 mii snd_usb_audio snd_usbmidi_lib ebtable_filter ebtables msr ip6table_filter ip6_tables iptable_filter uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev media btusb btrtl ipmi_devintf ipmi_msghandler bnep arc4 nls_iso8859_1 snd_hda_codec_hdmi joydev iwlmvm hid_multitouch mac80211 iwlwifi snd_hda_codec_realtek snd_hda_codec_generic cfg80211 dell_rbtn rtsx_pci_ms memstick dell_laptop dell_smm_hwmon intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm
Sep 17 17:39:21 p5520 kernel: [12129.400232]  snd_hda_intel irqbypass snd_hda_codec crct10dif_pclmul snd_hda_core snd_hwdep crc32_pclmul ghash_clmulni_intel pcbc snd_pcm aesni_intel aes_x86_64 snd_seq_midi crypto_simd glue_helper snd_seq_midi_event cryptd snd_rawmidi intel_cstate intel_rapl_perf snd_seq snd_seq_device snd_timer idma64 virt_dma input_leds hci_uart snd dell_wmi serio_raw btbcm dell_smbios mei_me serdev dcdbas processor_thermal_device btqca wmi_bmof soundcore intel_lpss_pci shpchp intel_pch_thermal mei intel_soc_dts_iosf btintel ie31200_edac bluetooth ecdh_generic int3403_thermal int340x_thermal_zone intel_lpss_acpi intel_lpss dell_smo8800 intel_hid int3400_thermal acpi_thermal_rel sparse_keymap acpi_als acpi_pad mac_hid kfifo_buf industrialio parport_pc ppdev lp parport ip_tables x_tables autofs4 btrfs xor zstd_decompress
Sep 17 17:39:21 p5520 kernel: [12129.458861]  zstd_compress xxhash raid6_pq dm_mirror dm_region_hash dm_log hid_generic usbhid rtsx_pci_sdmmc nouveau i915 mxm_wmi ttm i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops nvme drm rtsx_pci nvme_core ahci libahci i2c_hid hid wmi pinctrl_sunrisepoint video pinctrl_intel
Sep 17 17:39:21 p5520 kernel: [12129.458875] CPU: 6 PID: 16693 Comm: kworker/u16:60 Tainted: G     U  W       4.14.0-041400rc1-generic #201709162031
Sep 17 17:39:21 p5520 kernel: [12129.458876] Hardware name: Dell Inc. Precision 5520/06X96V, BIOS 1.5.0 08/30/2017
Sep 17 17:39:21 p5520 kernel: [12129.458896] Workqueue: events_unbound intel_atomic_commit_work [i915]
Sep 17 17:39:21 p5520 kernel: [12129.458898] task: ffff98786c6cde00 task.stack: ffffb5ac87ac0000
Sep 17 17:39:21 p5520 kernel: [12129.458914] RIP: 0010:intel_atomic_commit_tail+0xf6e/0xf80 [i915]
Sep 17 17:39:21 p5520 kernel: [12129.458915] RSP: 0018:ffffb5ac87ac3d88 EFLAGS: 00010282
Sep 17 17:39:21 p5520 kernel: [12129.458916] RAX: 000000000000001c RBX: 0000000000000001 RCX: 0000000000000000
Sep 17 17:39:21 p5520 kernel: [12129.458917] RDX: 0000000000000000 RSI: ffff98787f58dc78 RDI: ffff98787f58dc78
Sep 17 17:39:21 p5520 kernel: [12129.458917] RBP: ffffb5ac87ac3e40 R08: 0000000000000001 R09: 0000000000000ab0
Sep 17 17:39:21 p5520 kernel: [12129.458918] R10: ffffb5ac87ac3d88 R11: 0000000000000000 R12: 00000000000ac202
Sep 17 17:39:21 p5520 kernel: [12129.458919] R13: ffff987862560000 R14: ffff98786c99a000 R15: 0000000000000002
Sep 17 17:39:21 p5520 kernel: [12129.458920] FS:  0000000000000000(0000) GS:ffff98787f580000(0000) knlGS:0000000000000000
Sep 17 17:39:21 p5520 kernel: [12129.458920] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 17 17:39:21 p5520 kernel: [12129.458921] CR2: 00007ff0c0000010 CR3: 0000000419273005 CR4: 00000000003606e0
Sep 17 17:39:21 p5520 kernel: [12129.458922] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Sep 17 17:39:21 p5520 kernel: [12129.458922] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Sep 17 17:39:21 p5520 kernel: [12129.458923] Call Trace:
Sep 17 17:39:21 p5520 kernel: [12129.458928]  ? dequeue_task_fair+0x4f8/0x650
Sep 17 17:39:21 p5520 kernel: [12129.458930]  ? wait_woken+0x80/0x80
Sep 17 17:39:21 p5520 kernel: [12129.458954]  intel_atomic_commit_work+0x12/0x20 [i915]
Sep 17 17:39:21 p5520 kernel: [12129.458960]  process_one_work+0x1e7/0x410
Sep 17 17:39:21 p5520 kernel: [12129.458962]  worker_thread+0x4a/0x410
Sep 17 17:39:21 p5520 kernel: [12129.458966]  kthread+0x125/0x140
Sep 17 17:39:21 p5520 kernel: [12129.458968]  ? process_one_work+0x410/0x410
Sep 17 17:39:21 p5520 kernel: [12129.458970]  ? kthread_create_on_node+0x70/0x70
Sep 17 17:39:21 p5520 kernel: [12129.458973]  ret_from_fork+0x25/0x30
Sep 17 17:39:21 p5520 kernel: [12129.458975] Code: ff ff ff 48 83 c7 08 e8 91 98 01 fe 4c 8b 85 70 ff ff ff 4d 85 c0 0f 85 7b fa ff ff 8d 73 41 48 c7 c7 a8 14 35 c0 e8 73 4e 03 fe <0f> ff e9 65 fa ff ff 90 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 
Sep 17 17:39:21 p5520 kernel: [12129.459001] ---[ end trace 14ce5c393944d276 ]---
Sep 17 17:39:21 p5520 kernel: [12129.555015] pci_bus 0000:07: Allocating resources
Sep 17 17:39:21 p5520 kernel: [12129.555080] pcieport 0000:07:02.0: bridge window [mem 0x00100000-0x000fffff 64bit pref] to [bus 3e] add_size 200000 add_align 100000
Sep 17 17:39:21 p5520 kernel: [12129.555103] pcieport 0000:07:02.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
Sep 17 17:39:21 p5520 kernel: [12129.555104] pcieport 0000:07:02.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
Sep 17 17:39:21 p5520 kernel: [12129.555105] pcieport 0000:07:02.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
Sep 17 17:39:21 p5520 kernel: [12129.555106] pcieport 0000:07:02.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
Sep 17 17:39:21 p5520 kernel: [12129.711876] cups-browsed[1298]: segfault at 0 ip 00007f17e4e92711 sp 00007fffe92f09d8 error 4 in libc-2.24.so[7f17e4d45000+1be000]
Sep 17 17:39:24 p5520 kernel: [12132.490755] usblp1: removed
Sep 17 17:39:24 p5520 kernel: [12132.492122] usblp 3-1.6:1.1: usblp1: USB Bidirectional printer dev 4 if 1 alt 0 proto 2 vid 0x03F0 pid 0x342A
Sep 17 17:39:26 p5520 kernel: [12134.230857] pci_bus 0000:07: Allocating resources
Sep 17 17:39:26 p5520 kernel: [12134.230900] pcieport 0000:07:02.0: bridge window [mem 0x00100000-0x000fffff 64bit pref] to [bus 3e] add_size 200000 add_align 100000
Sep 17 17:39:26 p5520 kernel: [12134.230912] pcieport 0000:07:02.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
Sep 17 17:39:26 p5520 kernel: [12134.230913] pcieport 0000:07:02.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
Sep 17 17:39:26 p5520 kernel: [12134.230914] pcieport 0000:07:02.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
Sep 17 17:39:26 p5520 kernel: [12134.230915] pcieport 0000:07:02.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
Sep 17 17:39:59 p5520 kernel: [12167.456363] wlp2s0: deauthenticating from 90:5c:44:98:88:59 by local choice (Reason: 3=DEAUTH_LEAVING)
Sep 17 17:39:59 p5520 kernel: [12167.461582] wlp2s0: failed to remove key (1, ff:ff:ff:ff:ff:ff) from hardware (-22)
Sep 17 17:39:59 p5520 kernel: [12167.483789] IPv6: ADDRCONF(NETDEV_UP): wlp2s0: link is not ready
^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
Comment 3 Martin Peres 2017-11-01 13:22:42 UTC
Never seen again. Closing


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.