Bug 105571 - [SKL] Occasional display hang on DPMS resume
Summary: [SKL] Occasional display hang on DPMS resume
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-03-17 14:31 UTC by Paul Donohue
Modified: 2018-05-17 08:56 UTC (History)
1 user (show)

See Also:
i915 platform: SKL
i915 features: power/Other


Attachments
Xorg.0.log (45.48 KB, text/x-log)
2018-03-17 14:37 UTC, Paul Donohue
no flags Details
dmesg from second failure (78.10 KB, text/plain)
2018-04-11 17:52 UTC, Paul Donohue
no flags Details
Xorg.0.log from second failure (30.57 KB, text/x-log)
2018-04-11 18:03 UTC, Paul Donohue
no flags Details
xrandr --verbose after second failure (4.48 KB, text/plain)
2018-04-11 18:06 UTC, Paul Donohue
no flags Details

Description Paul Donohue 2018-03-17 14:31:34 UTC
Since I last booted my laptop on 02/11/2018, I've toggled DPMS between "off" and on states many times (at least several times daily).  I don't hibernate or suspend, and I don't use DPMS standby or suspend modes.

When I opened the screen this morning, the backlight didn't come on for ~5 seconds, and the screen remained black even after it did come on.  The system was otherwise responsive (I was able to get in via network), but when I tried a graceful `poweroff`, it hung at some point during the shutdown and never completed it.  /sys/class/drm/card0/error said "No error state collected".  Xorg.0.log is attached, relevant dmesg output is below.

I've seen this behavior a few times before and the kernel stack trace looks familiar, but google doesn't find any similar stack traces, so I figured I should probably file a bug report about it, at least for posterity's sake.

Laptop is a Dell Precision 7510, CM236 chipset, Xeon E3-1545Mv5 processor, Intel P580 graphics, and an nVidia Quadro M1000M discrete video card (which I haven't used since the last reboot, although the nouveau driver is loaded to ensure the card is powered down when not in use and so it can be dynamically enabled via PRIME when needed).  No external monitors were connected when it crashed, although I do use DP and HDMI regularly.

I'm running Ubuntu artful (17.10) with kernel 4.13.0-37-generic, DMC 1.26.

Mar 17 09:13:42 [2906797.343265] [drm:intel_dp_aux_ch [i915]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Mar 17 09:13:42 [2906797.363249] [drm:intel_dp_aux_ch [i915]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Mar 17 09:13:42 [2906797.383251] [drm:intel_dp_aux_ch [i915]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Mar 17 09:13:42 [2906797.403248] [drm:intel_dp_aux_ch [i915]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Mar 17 09:13:42 [2906797.423248] [drm:intel_dp_aux_ch [i915]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Mar 17 09:13:42 [2906797.423265] [drm:intel_dp_aux_ch [i915]] *ERROR* dp_aux_ch not done status 0xad4003ff
Mar 17 09:13:42 [2906797.459228] dp_aux_ch not started status 0xad4003ff
Mar 17 09:13:42 [2906797.459257] ------------[ cut here ]------------
Mar 17 09:13:42 [2906797.459275] WARNING: CPU: 5 PID: 1985 at /build/linux-UKCsxy/linux-4.13.0/drivers/gpu/drm/i915/intel_dp.c:1088 intel_dp_aux_ch+0x6e8/0x720 [i915]
Mar 17 09:13:42 [2906797.459275] Modules linked in: nfnetlink_queue nfnetlink_log btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs algif_skcipher af_alg ccm pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) bnep xt_hashlimit xt_connlimit xt_limit nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_log_ipv6 nf_log_ipv4 nf_log_common xt_LOG ip6t_rpfilter ipt_rpfilter ip6table_filter iptable_filter ip6table_mangle ip6_tables iptable_mangle arc4 ip_set nfnetlink snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp kvm_intel iwlmvm kvm mac80211 snd_hda_intel irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc
Mar 17 09:13:42 [2906797.459298]  snd_hda_codec snd_hda_core snd_hwdep dell_laptop aesni_intel dell_smm_hwmon btusb aes_x86_64 iwlwifi crypto_simd btrtl glue_helper cryptd snd_pcm intel_cstate snd_seq_midi snd_seq_midi_event intel_rapl_perf snd_rawmidi dell_wmi joydev input_leds dell_smbios dcdbas serio_raw snd_seq sparse_keymap wmi_bmof cfg80211 snd_seq_device rtsx_pci_ms snd_timer memstick snd mei_me soundcore mei processor_thermal_device intel_pch_thermal shpchp ie31200_edac intel_soc_dts_iosf hci_uart btbcm serdev btqca btintel bluetooth int3403_thermal dell_smo8800 ecdh_generic intel_lpss_acpi intel_lpss int3402_thermal int340x_thermal_zone mac_hid int3400_thermal dell_rbtn acpi_thermal_rel acpi_als kfifo_buf acpi_pad industrialio coretemp firewire_sbp2 firewire_core crc_itu_t parport_pc ppdev lp parport dm_crypt
Mar 17 09:13:42 [2906797.459320]  ip_tables x_tables autofs4 hid_generic usbhid nouveau i915 rtsx_pci_sdmmc ttm mxm_wmi i2c_algo_bit drm_kms_helper syscopyarea e1000e sysfillrect sysimgblt fb_sys_fops ptp nvme psmouse drm pps_core ahci nvme_core rtsx_pci libahci i2c_hid wmi pinctrl_sunrisepoint video hid pinctrl_intel
Mar 17 09:13:42 [2906797.459333] CPU: 5 PID: 1985 Comm: Xorg Tainted: G           OE   4.13.0-32-generic #35-Ubuntu
Mar 17 09:13:42 [2906797.459333] Hardware name: Dell Inc. Precision 7510/0HV79W, BIOS 1.14.4 07/28/2017
Mar 17 09:13:42 [2906797.459334] task: ffff88ba82fbaf80 task.stack: ffff95930882c000
Mar 17 09:13:42 [2906797.459351] RIP: 0010:intel_dp_aux_ch+0x6e8/0x720 [i915]
Mar 17 09:13:42 [2906797.459351] RSP: 0018:ffff95930882f6e0 EFLAGS: 00010282
Mar 17 09:13:42 [2906797.459352] RAX: 0000000000000027 RBX: 00000000ad4003ff RCX: 0000000000000000
Mar 17 09:13:42 [2906797.459353] RDX: 0000000000000000 RSI: ffff88bacdd56578 RDI: ffff88bacdd56578
Mar 17 09:13:42 [2906797.459353] RBP: ffff95930882f788 R08: 0000000000000001 R09: 000000000003e4af
Mar 17 09:13:42 [2906797.459354] R10: ffff95930882f6c8 R11: 0000000000000000 R12: ffff88ba7ca38100
Mar 17 09:13:42 [2906797.459354] R13: 0000000000000005 R14: 0000000000000005 R15: ffff88ba7b030000
Mar 17 09:13:42 [2906797.459355] FS:  00007f615b85c500(0000) GS:ffff88bacdd40000(0000) knlGS:0000000000000000
Mar 17 09:13:42 [2906797.459356] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 17 09:13:42 [2906797.459357] CR2: 00007f9121c4e000 CR3: 000000104398a006 CR4: 00000000003606e0
Mar 17 09:13:42 [2906797.459357] Call Trace:
Mar 17 09:13:42 [2906797.459361]  ? hrtimer_try_to_cancel+0x2a/0x110
Mar 17 09:13:42 [2906797.459363]  ? schedule_hrtimeout_range_clock+0xc5/0x190
Mar 17 09:13:42 [2906797.459379]  intel_dp_aux_transfer+0x170/0x240 [i915]
Mar 17 09:13:42 [2906797.459381]  ? schedule_hrtimeout_range+0x13/0x20
Mar 17 09:13:42 [2906797.459386]  drm_dp_dpcd_access+0x71/0x110 [drm_kms_helper]
Mar 17 09:13:42 [2906797.459389]  drm_dp_dpcd_write+0x1b/0x20 [drm_kms_helper]
Mar 17 09:13:42 [2906797.459404]  intel_dp_sink_dpms+0x54/0xf0 [i915]
Mar 17 09:13:42 [2906797.459420]  intel_ddi_pre_enable+0x31a/0x4c0 [i915]
Mar 17 09:13:42 [2906797.459437]  intel_encoders_pre_enable.isra.95+0x63/0x80 [i915]
Mar 17 09:13:42 [2906797.459454]  haswell_crtc_enable+0x196/0x820 [i915]
Mar 17 09:13:42 [2906797.459470]  intel_update_crtc+0x4b/0xe0 [i915]
Mar 17 09:13:42 [2906797.459484]  skl_update_crtcs+0x268/0x290 [i915]
Mar 17 09:13:42 [2906797.459498]  intel_atomic_commit_tail+0x254/0xf90 [i915]
Mar 17 09:13:42 [2906797.459500]  ? __schedule+0x293/0x880
Mar 17 09:13:42 [2906797.459513]  intel_atomic_commit+0x3d5/0x490 [i915]
Mar 17 09:13:42 [2906797.459523]  ? drm_atomic_check_only+0x37b/0x540 [drm]
Mar 17 09:13:42 [2906797.459529]  drm_atomic_commit+0x4b/0x50 [drm]
Mar 17 09:13:42 [2906797.459533]  drm_atomic_helper_set_config+0x68/0x90 [drm_kms_helper]
Mar 17 09:13:42 [2906797.459540]  __drm_mode_set_config_internal+0x65/0x110 [drm]
Mar 17 09:13:42 [2906797.459546]  drm_mode_setcrtc+0x479/0x630 [drm]
Mar 17 09:13:42 [2906797.459553]  ? drm_mode_cursor_ioctl+0x4a/0x60 [drm]
Mar 17 09:13:42 [2906797.459558]  ? drm_mode_getcrtc+0x180/0x180 [drm]
Mar 17 09:13:42 [2906797.459564]  drm_ioctl_kernel+0x5d/0xb0 [drm]
Mar 17 09:13:42 [2906797.459569]  drm_ioctl+0x31b/0x3d0 [drm]
Mar 17 09:13:42 [2906797.459574]  ? drm_mode_getcrtc+0x180/0x180 [drm]
Mar 17 09:13:42 [2906797.459576]  ? new_sync_read+0xde/0x130
Mar 17 09:13:42 [2906797.459578]  do_vfs_ioctl+0xa5/0x610
Mar 17 09:13:42 [2906797.459579]  ? vfs_read+0x115/0x130
Mar 17 09:13:42 [2906797.459580]  SyS_ioctl+0x79/0x90
Mar 17 09:13:42 [2906797.459582]  entry_SYSCALL_64_fastpath+0x33/0xa3
Mar 17 09:13:42 [2906797.459582] RIP: 0033:0x7f6158cc2ef7
Mar 17 09:13:42 [2906797.459583] RSP: 002b:00007ffd56a4bca8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Mar 17 09:13:42 [2906797.459584] RAX: ffffffffffffffda RBX: 000055e19071ea20 RCX: 00007f6158cc2ef7
Mar 17 09:13:42 [2906797.459584] RDX: 00007ffd56a4bda0 RSI: 00000000c06864a2 RDI: 000000000000000d
Mar 17 09:13:42 [2906797.459585] RBP: 00007ffd56a4bda0 R08: 0000000000000001 R09: 000055e19071e6b0
Mar 17 09:13:42 [2906797.459585] R10: 000055e1910995a0 R11: 0000000000000246 R12: 00000000c06864a2
Mar 17 09:13:42 [2906797.459586] R13: 000000000000000d R14: 00007f615b7c2000 R15: 000055e19071e7a0
Mar 17 09:13:42 [2906797.459587] Code: 7f 2d dd ff be dc 05 00 00 bf e8 03 00 00 bb f0 ff ff ff e8 fb 32 95 ee e9 1c fa ff ff 89 de 48 c7 c7 38 3a 66 c0 e8 39 ad 12 ee <0f> ff 89 1d 88 f4 0b 00 bb f0 ff ff ff e9 fc f9 ff ff e8 21 a2 
Mar 17 09:13:42 [2906797.459605] ---[ end trace cf5e2d572dbb9493 ]---
Mar 17 09:13:49 [2906804.335250] [drm:intel_dp_start_link_train [i915]] *ERROR* failed to enable link training
Mar 17 09:13:54 [2906808.747331] DC6 already programmed to be enabled.
Mar 17 09:13:54 [2906808.747361] ------------[ cut here ]------------
Mar 17 09:13:54 [2906808.747383] WARNING: CPU: 5 PID: 20647 at /build/linux-UKCsxy/linux-4.13.0/drivers/gpu/drm/i915/intel_runtime_pm.c:725 skl_enable_dc6+0xa4/0xb0 [i915]
Mar 17 09:13:54 [2906808.747383] Modules linked in: nfnetlink_queue nfnetlink_log btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs algif_skcipher af_alg ccm pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) bnep xt_hashlimit xt_connlimit xt_limit nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_log_ipv6 nf_log_ipv4 nf_log_common xt_LOG ip6t_rpfilter ipt_rpfilter ip6table_filter iptable_filter ip6table_mangle ip6_tables iptable_mangle arc4 ip_set nfnetlink snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp kvm_intel iwlmvm kvm mac80211 snd_hda_intel irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc
Mar 17 09:13:54 [2906808.747402]  snd_hda_codec snd_hda_core snd_hwdep dell_laptop aesni_intel dell_smm_hwmon btusb aes_x86_64 iwlwifi crypto_simd btrtl glue_helper cryptd snd_pcm intel_cstate snd_seq_midi snd_seq_midi_event intel_rapl_perf snd_rawmidi dell_wmi joydev input_leds dell_smbios dcdbas serio_raw snd_seq sparse_keymap wmi_bmof cfg80211 snd_seq_device rtsx_pci_ms snd_timer memstick snd mei_me soundcore mei processor_thermal_device intel_pch_thermal shpchp ie31200_edac intel_soc_dts_iosf hci_uart btbcm serdev btqca btintel bluetooth int3403_thermal dell_smo8800 ecdh_generic intel_lpss_acpi intel_lpss int3402_thermal int340x_thermal_zone mac_hid int3400_thermal dell_rbtn acpi_thermal_rel acpi_als kfifo_buf acpi_pad industrialio coretemp firewire_sbp2 firewire_core crc_itu_t parport_pc ppdev lp parport dm_crypt
Mar 17 09:13:54 [2906808.747434]  ip_tables x_tables autofs4 hid_generic usbhid nouveau i915 rtsx_pci_sdmmc ttm mxm_wmi i2c_algo_bit drm_kms_helper syscopyarea e1000e sysfillrect sysimgblt fb_sys_fops ptp nvme psmouse drm pps_core ahci nvme_core rtsx_pci libahci i2c_hid wmi pinctrl_sunrisepoint video hid pinctrl_intel
Mar 17 09:13:54 [2906808.747445] CPU: 5 PID: 20647 Comm: kworker/5:1 Tainted: G        W  OE   4.13.0-32-generic #35-Ubuntu
Mar 17 09:13:54 [2906808.747446] Hardware name: Dell Inc. Precision 7510/0HV79W, BIOS 1.14.4 07/28/2017
Mar 17 09:13:54 [2906808.747462] Workqueue: events edp_panel_vdd_work [i915]
Mar 17 09:13:54 [2906808.747463] task: ffff88ba8380af80 task.stack: ffff959308de0000
Mar 17 09:13:54 [2906808.747475] RIP: 0010:skl_enable_dc6+0xa4/0xb0 [i915]
Mar 17 09:13:54 [2906808.747476] RSP: 0018:ffff959308de3dc8 EFLAGS: 00010282
Mar 17 09:13:54 [2906808.747477] RAX: 0000000000000025 RBX: ffff88ba7b030000 RCX: 0000000000000000
Mar 17 09:13:54 [2906808.747477] RDX: 0000000000000000 RSI: ffff88bacdd56578 RDI: ffff88bacdd56578
Mar 17 09:13:54 [2906808.747478] RBP: ffff959308de3dd0 R08: 0000000000000001 R09: 000000000003e4eb
Mar 17 09:13:54 [2906808.747478] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88ba7b030000
Mar 17 09:13:54 [2906808.747479] R13: ffff88ba7b034b80 R14: ffffffffc06540e0 R15: 0000000010000000
Mar 17 09:13:54 [2906808.747479] FS:  0000000000000000(0000) GS:ffff88bacdd40000(0000) knlGS:0000000000000000
Mar 17 09:13:54 [2906808.747480] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 17 09:13:54 [2906808.747480] CR2: 00007f9121c51000 CR3: 0000000657e0a005 CR4: 00000000003606e0
Mar 17 09:13:54 [2906808.747481] Call Trace:
Mar 17 09:13:54 [2906808.747494]  gen9_dc_off_power_well_disable+0x2a/0x40 [i915]
Mar 17 09:13:54 [2906808.747505]  intel_power_well_disable+0x3e/0x50 [i915]
Mar 17 09:13:54 [2906808.747516]  intel_display_power_put+0xb9/0x120 [i915]
Mar 17 09:13:54 [2906808.747531]  edp_panel_vdd_work+0x5f/0x70 [i915]
Mar 17 09:13:54 [2906808.747534]  process_one_work+0x1e7/0x410
Mar 17 09:13:54 [2906808.747535]  worker_thread+0x4b/0x420
Mar 17 09:13:54 [2906808.747536]  kthread+0x125/0x140
Mar 17 09:13:54 [2906808.747537]  ? process_one_work+0x410/0x410
Mar 17 09:13:54 [2906808.747538]  ? kthread_create_on_node+0x70/0x70
Mar 17 09:13:54 [2906808.747539]  ? SyS_exit+0x17/0x20
Mar 17 09:13:54 [2906808.747541]  ret_from_fork+0x1f/0x30
Mar 17 09:13:54 [2906808.747542] Code: 05 ce 21 15 00 01 e8 4d 2f 1b ee 0f ff eb 99 80 3d bd 21 15 00 00 75 a7 48 c7 c7 48 3e 65 c0 c6 05 ad 21 15 00 01 e8 2d 2f 1b ee <0f> ff eb 90 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 83 bf 38 
Mar 17 09:13:54 [2906808.747556] ---[ end trace cf5e2d572dbb9494 ]---
Mar 17 09:13:54 [2906808.747569] [drm:gen9_set_dc_state [i915]] *ERROR* DC state mismatch (0x0 -> 0x2)
Comment 1 Paul Donohue 2018-03-17 14:37:17 UTC
Created attachment 138172 [details]
Xorg.0.log
Comment 2 Jani Saarinen 2018-03-29 07:10:21 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 3 Paul Donohue 2018-04-11 17:50:45 UTC
This happened again today.

Laptop was last booted on 04/01/2018.  I was working on my laptop on battery (last time this happened it was on A/C power), closed my laptop lid, opened it again about a minute later, the backlight did not immediately come on like it usually does, the backlight came on after a few seconds, but the screen remained black after the backlight came on.  I closed and re-opened the lid again, and the backlight turned off and never came back on.

I plugged in an external monitor and managed to enable it, and everything seems to be working fine on that external monitor.

Same kernel and DMC as before: kernel 4.13.0-37-generic, DMC 1.26

Stack trace in dmesg looks the same as before (I will attach it).  Xorg.0.log has nothing interesting in it.  /sys/class/drm/card0/error says "No error state collected".  xrandr says "xrandr: Output eDP1 is not disconnected but has no modes".  xrandr --verbose says the eDP1 backlight is on, but it definitely is not.  The brightness buttons on the keyboard don't seem to do anything.
Comment 4 Paul Donohue 2018-04-11 17:52:44 UTC
Created attachment 138766 [details]
dmesg from second failure
Comment 5 Paul Donohue 2018-04-11 18:03:41 UTC
Created attachment 138767 [details]
Xorg.0.log from second failure
Comment 6 Paul Donohue 2018-04-11 18:06:21 UTC
Created attachment 138768 [details]
xrandr --verbose after second failure
Comment 7 Jani Saarinen 2018-04-12 05:22:01 UTC
Please try with drm-tip and with latest 1.27 DMC.

drm-tip: https://cgit.freedesktop.org/drm-tip
dmc: https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915
Comment 8 Jani Saarinen 2018-04-20 19:03:42 UTC
ping.
Comment 9 Paul Donohue 2018-04-20 19:56:39 UTC
I'm testing with drm-tip and DMC 1.27 ... but given the rarity of this, it will probably be several weeks before I can say for sure whether this is still happening or not.
Comment 10 Jani Saarinen 2018-04-20 20:08:09 UTC
OK, will follow again in few weeks, thanks.
Comment 11 Jani Saarinen 2018-05-04 12:24:30 UTC
friendly ping.
Comment 12 Jani Saarinen 2018-05-17 08:56:19 UTC
Let's resolve, please re-open if still issues.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.