|Summary:||Got call trace message "eDP powered off while attempting aux channel communication" while boot up|
|Component:||DRM/Intel||Assignee:||Imre Deak <imre.deak>|
|Status:||RESOLVED FIXED||QA Contact:||Intel GFX Bugs mailing list <intel-gfx-bugs>|
|Priority:||not set||CC:||imre.deak, intel-gfx-bugs, james.ausmus, jani.nikula, tjaalton|
|i915 platform:||CML||i915 features:||display/eDP|
Description Cyrus 2019-10-18 09:12:48 UTC
Created attachment 145771 [details] dmesg Bug description: Got the call trace message during system boot up. [ 2.999233] ------------[ cut here ]------------ [ 2.999238] eDP powered off while attempting aux channel communication. [ 2.999338] WARNING: CPU: 2 PID: 296 at drivers/gpu/drm/i915/display/intel_dp.c:1148 intel_dp_aux_xfer+0x767/0x900 [i915] [ 2.999341] Modules linked in: i915(+) i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm r8169 ahci realtek libahci wmi video [ 2.999350] CPU: 2 PID: 296 Comm: systemd-udevd Not tainted 5.3.0-997-generic #201910072204 [ 2.999352] Hardware name: Dell Inc. OptiPlex 3280 AIO/, BIOS 99.00.08 10/01/2019 [ 2.999391] RIP: 0010:intel_dp_aux_xfer+0x767/0x900 [i915] [ 2.999394] Code: e8 ee b5 ff ff 84 c0 0f 85 ec f9 ff ff 48 8b 7d 80 e8 5d b6 ff ff 84 c0 0f 85 db f9 ff ff 48 c7 c7 50 22 5d c0 e8 84 ea b5 d1 <0f> 0b 48 8b 7d 80 48 8d 75 90 4c 8d ab a8 07 00 00 e8 63 b4 ff ff [ 2.999398] RSP: 0018:ffffb7000040b5b8 EFLAGS: 00010286 [ 2.999400] RAX: 0000000000000000 RBX: ffff9bfa9dc20000 RCX: 0000000000000396 [ 2.999402] RDX: 0000000000000001 RSI: 0000000000000082 RDI: 0000000000000246 [ 2.999404] RBP: ffffb7000040b698 R08: 0000000000000396 R09: 0000000000000019 [ 2.999406] R10: 000000000000072e R11: ffffb7000040b420 R12: 0000000000000004 [ 2.999408] R13: ffff9bfa9fbca120 R14: ffff9bfa9dc20000 R15: ffff9bfa9dc207a8 [ 2.999410] FS: 00007f0d8613a680(0000) GS:ffff9bfaaec80000(0000) knlGS:0000000000000000 [ 2.999412] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 2.999414] CR2: 00007ffca955ae58 CR3: 000000041e67e002 CR4: 00000000003606e0 [ 2.999416] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 2.999418] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [ 2.999420] Call Trace: [ 2.999425] ? vprintk_default+0x29/0x50 [ 2.999464] intel_dp_aux_transfer+0x15e/0x220 [i915] [ 2.999472] drm_dp_dpcd_access+0x72/0x110 [drm_kms_helper] [ 2.999478] drm_dp_dpcd_read+0x33/0xc0 [drm_kms_helper] [ 2.999515] intel_dp_read_dpcd+0x41/0x130 [i915] [ 2.999551] intel_edp_init_connector+0x136/0x540 [i915] [ 2.999587] intel_dp_init_connector+0x47f/0x870 [i915] [ 2.999623] intel_ddi_init+0x425/0x590 [i915] [ 2.999660] intel_setup_outputs+0x8e4/0xc60 [i915] System environment: -- chipset: CML-S -- system architecture: 64-bit -- xf86-video-intel: xserver-xorg-video-intel-hwe-18.04 2:2.99.917+git20171229-1ubuntu1~18.04.1 -- xserver: xserver-xorg-core-hwe-18.04 2:1.20.4-1ubuntu3~18.04.2 -- mesa: 19.0.8-0ubuntu0~18.04.3 -- libdrm: 2.4.99-1 -- kernel: 5.2.9-arch1-1-ARCH -- Linux distribution: Ubuntu -- Machine or mobo model: Dell Inc. OptiPlex 3280 AIO -- Display connector: eDP-1
Comment 2 Lakshmi 2019-10-19 07:19:43 UTC
(In reply to Cyrus from comment #1) > Created attachment 145772 [details] > lspci -vv Please try to reproduce this issue using drm-tip (https://cgit.freedesktop.org/drm-tip) with kernel parameters drm.debug=0x1e log_buf_len=4M. If the problem persists attach the full dmesg from boot.
Comment 3 Cyrus 2019-10-21 07:11:50 UTC
Created attachment 145783 [details] dmesg from drm-tip 54d0d8863278 Still observed lot of call trace messages with drm-tip (54d0d8863278).
Comment 4 Jani Saarinen 2019-10-21 07:25:23 UTC
On CI there is also CML-S. Is that seen there? https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7137/fi-cml-s/boot0.txt
Comment 5 Imre Deak 2019-10-21 11:44:24 UTC
Created attachment 145786 [details] [review] Fix PCH type detection Could you try the attached patch?
Comment 6 Cyrus 2019-10-22 03:28:53 UTC
Created attachment 145790 [details] dmesg with 0001-drm-i915-Add-new-CNL-PCH-ID-seen-on-a-CML-platform.patch No more drm call trace messages with 0001-drm-i915-Add-new-CNL-PCH-ID-seen-on-a-CML-platform.patch. Thanks!
Comment 7 Imre Deak 2019-10-23 19:00:29 UTC
Thanks for the report, the fix is now merged.
Comment 8 Cyrus 2019-10-24 10:03:02 UTC
Created attachment 145804 [details] dmesg with patch on Vostro I have got bad news, the patch causes black screen issue on another one specific machine.
Comment 9 Cyrus 2019-10-24 10:05:15 UTC
Created attachment 145805 [details] dmesg without patch on Vostro
Comment 10 Imre Deak 2019-10-24 10:33:31 UTC
(In reply to Cyrus from comment #8) > Created attachment 145804 [details] > dmesg with patch on Vostro > > I have got bad news, the patch causes black screen issue on another one > specific machine. Nothing obvious from the log, the only issue I see is [ 2.020663] irq 45: nobody cared (try booting with the "irqpoll" option) [ 2.020667] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 5.4.0-rc3+ #1 [ 2.020671] Hardware name: Dell Inc. Vostro 3681/06CV2N, BIOS 0.6.13 10/07/2019 [ 2.020673] Call Trace: [ 2.020679] <IRQ> [ 2.020683] dump_stack+0x63/0x85 [ 2.020685] __report_bad_irq+0x35/0xc0 [ 2.020687] note_interrupt+0x24b/0x2a0 [ 2.020689] handle_irq_event_percpu+0x54/0x80 [ 2.020690] handle_irq_event+0x3b/0x60 [ 2.020691] handle_fasteoi_irq+0x7c/0x130 [ 2.020693] do_IRQ+0x54/0xe0 [ 2.020695] common_interrupt+0xf/0xf [ 2.020696] </IRQ> but that's in both logs. Did you try this with the latest drm-tip and then with the patch reverted on top top of that? Is it 100% reproducible? Do you get an FB console? (booting to bootlevel 3 adding "3" to the kernel parameters).
Comment 11 Cyrus 2019-10-25 05:03:01 UTC
(In reply to Imre Deak from comment #10) > (In reply to Cyrus from comment #8) > > Created attachment 145804 [details] > > dmesg with patch on Vostro > > > > I have got bad news, the patch causes black screen issue on another one > > specific machine. > > Nothing obvious from the log, the only issue I see is > > [ 2.020663] irq 45: nobody cared (try booting with the "irqpoll" option) > [ 2.020667] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 5.4.0-rc3+ #1 > [ 2.020671] Hardware name: Dell Inc. Vostro 3681/06CV2N, BIOS 0.6.13 > 10/07/2019 > [ 2.020673] Call Trace: > [ 2.020679] <IRQ> > [ 2.020683] dump_stack+0x63/0x85 > [ 2.020685] __report_bad_irq+0x35/0xc0 > [ 2.020687] note_interrupt+0x24b/0x2a0 > [ 2.020689] handle_irq_event_percpu+0x54/0x80 > [ 2.020690] handle_irq_event+0x3b/0x60 > [ 2.020691] handle_fasteoi_irq+0x7c/0x130 > [ 2.020693] do_IRQ+0x54/0xe0 > [ 2.020695] common_interrupt+0xf/0xf > [ 2.020696] </IRQ> > > but that's in both logs. > > Did you try this with the latest drm-tip and then with the patch reverted on > top top of that? Tried the patch on top of drm-tip (commit 54d0d8863278). With the patch that Vostro machine got black screen after boot and it become normal after revert the patch. > > Is it 100% reproducible? Yes > > Do you get an FB console? (booting to bootlevel 3 adding "3" to the kernel > parameters). Yes, can get to console in runlevel 3.
Comment 12 Imre Deak 2019-10-28 18:38:41 UTC
Created attachment 145836 [details] [review] Attempt to fix the VBT DDC pin mapping The problem seems to be that the VBT DDC pin mapping info on your CFL doesn't match the ones we use for CNP PCHs based on the VBT specification. Either the VBT on your machines or the VBT spec we use is incorrect or my assumption that the PCH with ID 0xA3C1 was wrong; I'll try to clarify this. For now could you try if the attached patch gets rid of the problem on the CFL machine? Please also try your CML machine which should have an HDMI output and I suspect it has the same problem.
Comment 13 Imre Deak 2019-10-30 16:34:36 UTC
Created attachment 145843 [details] [review] Fix PCH type detection again The PCH is actually a CMP-V on both of these machines. CMP-V is in turn based on KBP not on CNP like other CMP PCHs. My first attempt to fix the detection is thus incorrect, since the DDC pin routing is different on KBP as it's on CNP. I attached a new attempt to fix it correctly. Cyrus, could you try the new patch on top of drm-tip if it works with both machines, trying also any external outputs? Thanks.
Comment 14 Cyrus 2019-10-31 07:10:38 UTC
Both machines works well with patch 0001-drm-i915-Fix-detection-for-a-CMP-V-PCH.patch on top of drm-tip (5efc505498d2). External outputs also works well on both machines. Thanks for the help!
Comment 15 Cyrus 2019-10-31 07:11:47 UTC
Created attachment 145844 [details] dmesg-optiplex-AIO-drm-i915-fix-detection-for-a-cmp-v-pch.log
Comment 16 Cyrus 2019-10-31 07:12:13 UTC
Created attachment 145845 [details] dmesg-vostro-desktop-drm-i915-fix-detection-for-a-cmp-v-pch.log
Comment 19 Timo Aaltonen 2019-11-12 10:00:17 UTC
the new patch isn't merged yet
Comment 20 Imre Deak 2019-11-13 12:22:04 UTC
Patch now is merged: commit 015330c6f76fccf47b2227048adc44dc453a17be Author: Imre Deak <email@example.com> Date: Wed Oct 30 18:05:48 2019 +0200 drm/i915: Fix detection for a CMP-V PCH Thanks for the reports, tests, re-reports and re-tests, hopefully now it's fixed properly.