Bug 104633 - Signal repeatedly lost with USB-C connection to monitor
Summary: Signal repeatedly lost with USB-C connection to monitor
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-01-15 08:29 UTC by nico-freedesktop.org
Modified: 2018-09-10 16:11 UTC (History)
2 users (show)

See Also:
i915 platform: KBL
i915 features: display/atomic


Attachments
dmesg.txt (208.23 KB, text/plain)
2018-01-17 14:51 UTC, nico-freedesktop.org
no flags Details
dmesg post crash (500.81 KB, text/plain)
2018-01-18 14:57 UTC, nico-freedesktop.org
no flags Details

Description nico-freedesktop.org 2018-01-15 08:29:21 UTC
I am connecting a Dell Latitude 5285 to a Samsung S27H850 by a USB-C cable (Hama USB 3.1, "full featured").

It turns out to be stable working until the first time I suspend.

After suspending, the monitor displays the picture for about a second, turns black and after about a minute it repeats the same behaviour.

While the screen is black, xrandr does not show a signal.

I checked the monitor + cable with a macbook that works fine with the setup.

I originally reported the bug on https://bugzilla.kernel.org/show_bug.cgi?id=198469 

My packages/versions are:


[9:20] manager:~% pacman -Q | grep -e xorg -e xf86 -e ^linux; uname -r
libxxf86dga 1.1.4-1
libxxf86vm 1.1.4-1
linux 4.14.13-1
linux-api-headers 4.14.8-1
linux-firmware 20171206.fdee922-1
xf86-input-libinput 0.26.0-1
xf86-input-wacom 0.35.0-1
xf86-video-intel 1:2.99.917+802+gaf6d8e9e-1
xf86dgaproto 2.1-3
xf86vidmodeproto 2.3.1-3
xorg-appres 1.0.4-1
xorg-bdftopcf 1.1-1
xorg-font-util 1.3.1-1
xorg-font-utils 7.6-4
xorg-fonts-alias 1.0.3-1
xorg-fonts-encodings 1.0.4-4
xorg-fonts-misc 1.0.3-5
xorg-luit 1.1.1-2
xorg-mkfontdir 1.0.7-8
xorg-mkfontscale 1.1.2-1
xorg-server 1.19.6-2
xorg-server-common 1.19.6-2
xorg-server-xvfb 1.19.6-2
xorg-setxkbmap 1.3.1-1
xorg-xauth 1.0.10-1
xorg-xbacklight 1.2.1-1
xorg-xdpyinfo 1.3.2-1
xorg-xev 1.2.2-1
xorg-xinit 1.3.4-4
xorg-xinput 1.6.2-1
xorg-xkbcomp 1.4.0-1
xorg-xkill 1.0.4-1
xorg-xmodmap 1.0.9-1
xorg-xrandr 1.5.0-1
xorg-xrdb 1.1.0-2
xorg-xset 1.2.3-1
xorg-xwd 1.0.6-1
xorg-xwininfo 1.1.3-1
4.14.13-1-ARCH
[9:20] manager:~% 

Digging through the logs around the time the problem occured, I see the following (not sure if related or even the reason):

Jan 14 12:51:35 manager kernel: RSP: 002b:00007fff80429728 EFLAGS: 00000246
Jan 14 12:51:35 manager kernel: Failed to release pages: bind_count=1, pages_pin_count=1, pin_display=0
Jan 14 12:51:35 manager kernel: ------------[ cut here ]------------
Jan 14 12:51:35 manager kernel: WARNING: CPU: 1 PID: 151 at drivers/gpu/drm/i915/i915_gem_userptr.c:89 cancel_userptr+0xdc/0xe0 [i915]
Jan 14 12:51:35 manager kernel: Modules linked in: cmac md4 nls_utf8 cifs dns_resolver fscache fuse tun ccm snd_hda_codec_hdmi nls_iso8859_1 nls_cp437 mei_wdt vfat fat snd_hda_codec_realtek snd_hda_codec_generic btusb btrtl hid_sensor_magn_3d hid_sensor_incl_3d hid_sensor_rotation hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer hid_sensor_iio_common joydev mousedev hid_sensor_hub msr wmi_bmof dell_wmi dell_smbios iTCO_wdt iTCO_vendor_support wacom intel_ishtp_hid ppdev snd_soc_skl arc4 snd_soc_skl_ipc dcdbas snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core dell_smm_hwmon snd_soc_sst_match tpm_crb intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel iwlmvm kvm mac80211 irqbypass intel_cstate intel_rapl_perf evdev input_leds led_class mac_hid iwlwifi
Jan 14 12:51:35 manager kernel:  snd_soc_core snd_compress snd_pcm_dmaengine ac97_bus cfg80211 rtsx_pci_ms memstick snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer i2c_i801 snd soundcore mei_me mei shpchp wmi i915 i2c_hid hci_uart btbcm btqca btintel int3403_thermal intel_vbtn bluetooth soc_button_array parport_pc parport i2c_algo_bit drm_kms_helper ov5693(C) v4l2_common ecdh_generic drm rfkill video videodev pinctrl_sunrisepoint pinctrl_intel intel_gtt agpgart tpm_tis intel_lpss_acpi tpm_tis_core syscopyarea sysfillrect intel_hid media int3402_thermal tpm sysimgblt battery intel_ish_ipc int3400_thermal button fb_sys_fops sparse_keymap intel_ishtp acpi_thermal_rel idma64 ac acpi_pad processor_thermal_device int340x_thermal_zone intel_lpss_pci intel_soc_dts_iosf intel_lpss intel_pch_thermal acpi_als
Jan 14 12:51:35 manager kernel:  kfifo_buf thermal industrialio sch_fq_codel ip_tables x_tables ext4 crc16 mbcache jbd2 fscrypto algif_skcipher af_alg hid_multitouch usbhid hid dm_crypt dm_mod rtsx_pci_sdmmc mmc_core serio_raw atkbd libps2 crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd xhci_pci rtsx_pci xhci_hcd nvme nvme_core usbcore usb_common i8042 serio
Jan 14 12:51:35 manager kernel: CPU: 1 PID: 151 Comm: kworker/u8:2 Tainted: G         C      4.14.13-1-ARCH #1
Jan 14 12:51:35 manager kernel: Hardware name: Dell Inc. Latitude 5285/0P8F3H, BIOS 1.1.11 08/18/2017
Jan 14 12:51:35 manager kernel: Workqueue: i915-userptr-release cancel_userptr [i915]
Jan 14 12:51:35 manager kernel: task: ffff997bfbbc0000 task.stack: ffffb05a41e14000
Jan 14 12:51:35 manager kernel: RIP: 0010:cancel_userptr+0xdc/0xe0 [i915]
Jan 14 12:51:35 manager kernel: RSP: 0018:ffffb05a41e17e80 EFLAGS: 00010282
Jan 14 12:51:35 manager kernel: RAX: 0000000000000047 RBX: ffff997aea3a5600 RCX: 0000000000000000
Jan 14 12:51:35 manager kernel: RDX: 0000000000000000 RSI: ffff997c0e496598 RDI: ffff997c0e496598
Jan 14 12:51:35 manager kernel: RBP: ffff997aea3a57b0 R08: 0000000000000001 R09: 00000000000003d6
Jan 14 12:51:35 manager kernel: R10: ffffe19f8f952500 R11: 0000000000000000 R12: 0000000000000000
Jan 14 12:51:35 manager kernel: R13: 0000000000000000 R14: 0ffff997bf9f8031 R15: ffff997bf8d68600
Jan 14 12:51:35 manager kernel: FS:  0000000000000000(0000) GS:ffff997c0e480000(0000) knlGS:0000000000000000
Jan 14 12:51:35 manager kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 14 12:51:35 manager kernel: CR2: 00001e30c1e14000 CR3: 000000035400a001 CR4: 00000000003606e0
Jan 14 12:51:35 manager kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jan 14 12:51:35 manager kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jan 14 12:51:35 manager kernel: Call Trace:
Jan 14 12:51:35 manager kernel:  process_one_work+0x1db/0x410
Jan 14 12:51:35 manager kernel:  worker_thread+0x2b/0x3d0
Jan 14 12:51:35 manager kernel:  ? process_one_work+0x410/0x410
Jan 14 12:51:35 manager kernel:  kthread+0x118/0x130
Jan 14 12:51:35 manager kernel:  ? kthread_create_on_node+0x70/0x70
Jan 14 12:51:35 manager kernel:  ret_from_fork+0x1f/0x30
Jan 14 12:51:35 manager kernel: Code: bf 14 00 00 75 d7 8b 93 d0 01 00 00 8b 8b ac 01 00 00 48 c7 c7 a0 96 9d c0 8b b3 a4 01 00 00 c6 05 12 bf 14 00 01 e8 cf 7d 7c fa <0f> ff eb ae 0f 1f 44 00 00 41 57 41 56 ba 08 00 00 00 41 55 41 
Jan 14 12:51:35 manager kernel: ---[ end trace a3c6f6c4e849c776 ]---


And later:

Jan 14 14:43:44 manager kernel: [drm] Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS.
Jan 14 14:43:51 manager kernel: usb 1-3: new high-speed USB device number 31 using xhci_hcd
Jan 14 14:43:52 manager kernel: acpi INT3400:00: Unsupported event [0x86]
Comment 1 Chris Wilson 2018-01-15 08:51:48 UTC
(In reply to nico-freedesktop.org from comment #0)
> I am connecting a Dell Latitude 5285 to a Samsung S27H850 by a USB-C cable
> (Hama USB 3.1, "full featured").
> 
> It turns out to be stable working until the first time I suspend.
> 

> Digging through the logs around the time the problem occured, I see the
> following (not sure if related or even the reason):
> 
> Jan 14 12:51:35 manager kernel: RSP: 002b:00007fff80429728 EFLAGS: 00000246
> Jan 14 12:51:35 manager kernel: Failed to release pages: bind_count=1,
> pages_pin_count=1, pin_display=0

That's an unrelated, mild warning.
 
> And later:
> 
> Jan 14 14:43:44 manager kernel: [drm] Reducing the compressed framebuffer
> size. This may lead to less power savings than a non-reduced-size. Try to
> increase stolen memory size if available in BIOS.
> Jan 14 14:43:51 manager kernel: usb 1-3: new high-speed USB device number 31
> using xhci_hcd
> Jan 14 14:43:52 manager kernel: acpi INT3400:00: Unsupported event [0x86]

Indicates that the USB connection dropped and triggered an interrupt that the acpi driver didn't handle. That maybe relevant.

Please attach the drm.debug=0xe dmesg from across the suspend to see if their is a problem closer to home.
Comment 2 nico-freedesktop.org 2018-01-17 14:51:56 UTC
Created attachment 136807 [details]
dmesg.txt
Comment 3 nico-freedesktop.org 2018-01-17 14:52:31 UTC
Just rebooted with dmesg.debug=0xe and had the screen connected on boot.

Funnily this time it does not require a suspend at all for the flacky behaviour.

Attached above is the dmesg output.
Comment 4 nico-freedesktop.org 2018-01-18 14:57:40 UTC
Created attachment 136830 [details]
dmesg post crash

After disconnecting the monitor from the display port adapter and then connecting it directly via usb-c, the connection is also stable.

Until I suspend, then xorg crashes. Attached is the dmesg after the crash.
Comment 5 Jani Saarinen 2018-03-29 07:10:06 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 6 nico-freedesktop.org 2018-03-29 11:51:13 UTC
Yes, the problem still exists in 4.15.11-1-ARCH with the following xorg components:

[root@manager ~]# pacman -Q | grep xorg
xorg-appres 1.0.5-1
xorg-bdftopcf 1.1-1
xorg-font-util 1.3.1-1
xorg-font-utils 7.6-4
xorg-fonts-alias 1.0.3-1
xorg-fonts-encodings 1.0.4-4
xorg-fonts-misc 1.0.3-5
xorg-luit 1.1.1-2
xorg-mkfontdir 1.0.7-8
xorg-mkfontscale 1.1.3-1
xorg-server 1.19.6+13+gd0d1a694f-1
xorg-server-common 1.19.6+13+gd0d1a694f-1
xorg-server-xvfb 1.19.6+13+gd0d1a694f-1
xorg-setxkbmap 1.3.1-1
xorg-xauth 1.0.10-1
xorg-xbacklight 1.2.2-1
xorg-xdpyinfo 1.3.2-1
xorg-xev 1.2.2-1
xorg-xinit 1.4.0-3
xorg-xinput 1.6.2-1
xorg-xkbcomp 1.4.1-1
xorg-xkill 1.0.5-1
xorg-xmodmap 1.0.9-1
xorg-xrandr 1.5.0-1
xorg-xrdb 1.1.1-1
xorg-xset 1.2.4-1
xorg-xwd 1.0.7-1
xorg-xwininfo 1.1.4-1
xorgproto 2018.4-1
[root@manager ~]# 

Jani, do you have a link to the latst upstream git repo? And it only requires to replace the .so from the intel driver & xorg restart or full system reboot?
Comment 7 Jani Saarinen 2018-03-29 14:17:52 UTC
Latest drm-tip from here: https://cgit.freedesktop.org/drm-tip
Comment 8 Jani Saarinen 2018-04-25 11:04:51 UTC
Closing, please re-open is issue still exists.
Comment 9 nico-freedesktop.org 2018-04-25 11:30:22 UTC
It still exists on the same device. Cannot reproduce it on Lenovo X1 carbon, but certainly exists for the Dell Latidute 5285.
Comment 10 Jani Saarinen 2018-04-25 15:01:00 UTC
Did you test latest drm-tip mentioned on comment #7?
Comment 11 Jani Saarinen 2018-04-25 15:04:11 UTC
Jani, any advice here?
Comment 12 Lakshmi 2018-09-10 16:11:51 UTC
No feedback from many months, closing as resolved works for me.

Please try to reproduce the error using drm-tip (https://cgit.freedesktop.org/drm-tip) and kernel parameters drm.debug=0x1e log_buf_len=4M, and if the problem persists attach the full dmesg from boot.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.