Bug 108512 - [CI][BAT] igt@gem_exec_suspend@basic-s3 - dmesg-warn - *ERROR* Timed out waiting for PSR Idle State
Summary: [CI][BAT] igt@gem_exec_suspend@basic-s3 - dmesg-warn - *ERROR* Timed out wait...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: high normal
Assignee: James Ausmus
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-10-22 12:23 UTC by Martin Peres
Modified: 2018-11-02 15:50 UTC (History)
1 user (show)

See Also:
i915 platform: ICL
i915 features: display/PSR


Attachments

Description Martin Peres 2018-10-22 12:23:16 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5016/fi-icl-u/igt@gem_exec_suspend@basic-s3.html

<7> [420.263551] [drm:intel_psr_disable_locked [i915]] Disabling PSR1
<3> [422.263758] [drm:intel_psr_disable_locked [i915]] *ERROR* Timed out waiting for PSR Idle State
<7> [422.264364] [drm:intel_edp_backlight_off [i915]] 
<7> [422.467077] [drm:intel_panel_actually_set_backlight [i915]] set backlight PWM = 0
<7> [422.467436] [drm:intel_disable_pipe [i915]] disabling pipe A
<4> [422.568819] ------------[ cut here ]------------
<4> [422.568828] pipe_off wait timed out
<4> [422.569002] WARNING: CPU: 6 PID: 2416 at drivers/gpu/drm/i915/intel_display.c:1049 intel_disable_pipe+0x131/0x170 [i915]
<4> [422.569008] Modules linked in: vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic i915 x86_pkg_temp_thermal ax88179_178a usbnet coretemp mii crct10dif_pclmul crc32_pclmul snd_hda_intel ghash_clmulni_intel snd_hda_codec snd_hwdep e1000e snd_hda_core snd_pcm prime_numbers
<4> [422.569116] CPU: 6 PID: 2416 Comm: kworker/u16:18 Tainted: G     U  W         4.19.0-rc8-CI-CI_DRM_5016+ #1
<4> [422.569122] Hardware name: Intel Corporation Ice Lake Client Platform/IceLake U DDR4 SODIMM PD RVP, BIOS ICLSFWR1.R00.2313.A01.1808012121 08/01/2018
<4> [422.569135] Workqueue: events_unbound async_run_entry_fn
<4> [422.569253] RIP: 0010:intel_disable_pipe+0x131/0x170 [i915]
<4> [422.569261] Code: 6a 00 8d b4 0a 08 00 07 00 31 c9 ba 00 00 00 40 e8 74 29 fe ff 85 c0 5a 0f 84 55 ff ff ff 48 c7 c7 96 c8 2e a0 e8 9f 49 e5 e0 <0f> 0b e9 42 ff ff ff 85 d2 0f 84 6c ff ff ff 48 83 c4 08 89 ee 48
<4> [422.569267] RSP: 0018:ffffc9000063bae0 EFLAGS: 00010286
<4> [422.569278] RAX: 0000000000000000 RBX: ffff88049ea40000 RCX: 0000000000000001
<4> [422.569284] RDX: 0000000080000001 RSI: ffffffff8212508a RDI: 00000000ffffffff
<4> [422.569289] RBP: 000000000007f008 R08: 000000003eaf5b6a R09: 0000000000000000
<4> [422.569295] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000003
<4> [422.569301] R13: ffff88049088efc8 R14: ffff88049ea40000 R15: ffff88049508d3d8
<4> [422.569308] FS:  0000000000000000(0000) GS:ffff8804b0780000(0000) knlGS:0000000000000000
<4> [422.569314] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [422.569320] CR2: 0000561b17445d80 CR3: 0000000005210003 CR4: 0000000000760ee0
<4> [422.569326] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
<4> [422.569331] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
<4> [422.569336] PKRU: 55555554
<4> [422.569341] Call Trace:
<4> [422.569451]  haswell_crtc_disable+0xd9/0x140 [i915]
<4> [422.569566]  intel_atomic_commit_tail+0x7c7/0xd30 [i915]
<4> [422.569684]  intel_atomic_commit+0x244/0x330 [i915]
<4> [422.569705]  __drm_atomic_helper_disable_all.constprop.15+0x124/0x150
<4> [422.569720]  drm_atomic_helper_suspend+0x76/0xd0
<4> [422.569837]  intel_display_suspend+0xd/0x50 [i915]
<4> [422.569913]  i915_drm_suspend+0x39/0x100 [i915]
<4> [422.569930]  pci_pm_suspend+0x6d/0x120
<4> [422.569940]  ? pci_pm_freeze+0xc0/0xc0
<4> [422.569952]  dpm_run_callback+0x64/0x280
<4> [422.569968]  __device_suspend+0x12a/0x5b0
<4> [422.569984]  ? dpm_watchdog_set+0x60/0x60
<4> [422.570007]  async_suspend+0x15/0x90
<4> [422.570018]  async_run_entry_fn+0x34/0x160
<4> [422.570037]  process_one_work+0x245/0x610
<4> [422.570063]  worker_thread+0x37/0x380
<4> [422.570079]  ? process_one_work+0x610/0x610
<4> [422.570087]  kthread+0x119/0x130
<4> [422.570097]  ? kthread_park+0x80/0x80
<4> [422.570113]  ret_from_fork+0x3a/0x50
<4> [422.570146] irq event stamp: 14906
<4> [422.570156] hardirqs last  enabled at (14905): [<ffffffff810f9b0e>] vprintk_emit+0x2ee/0x310
<4> [422.570165] hardirqs last disabled at (14906): [<ffffffff81001930>] trace_hardirqs_off_thunk+0x1a/0x1c
<4> [422.570175] softirqs last  enabled at (14192): [<ffffffff81c0031d>] __do_softirq+0x31d/0x483
<4> [422.570185] softirqs last disabled at (14185): [<ffffffff8108c539>] irq_exit+0xa9/0xc0
<4> [422.570278] WARNING: CPU: 6 PID: 2416 at drivers/gpu/drm/i915/intel_display.c:1049 intel_disable_pipe+0x131/0x170 [i915]
<4> [422.570284] ---[ end trace cc1baaf67f8166e6 ]---
Comment 1 Francesco Balestrieri 2018-10-23 07:16:25 UTC
<7> [420.263551] [drm:intel_psr_disable_locked [i915]] Disabling PSR1
<3> [422.263758] [drm:intel_psr_disable_locked [i915]] *ERROR* Timed out waiting for PSR Idle State

Based on this log this seems related to PSR, updating component.

Also, we've had other problems with Suspend (see Bug 107713), I wonder if it could be related.
Comment 2 Martin Peres 2018-10-23 07:17:54 UTC
(In reply to Francesco Balestrieri from comment #1)
> <7> [420.263551] [drm:intel_psr_disable_locked [i915]] Disabling PSR1
> <3> [422.263758] [drm:intel_psr_disable_locked [i915]] *ERROR* Timed out
> waiting for PSR Idle State
> 
> Based on this log this seems related to PSR, updating component.

Sorry about that!

> 
> Also, we've had other problems with Suspend (see Bug 107713), I wonder if it
> could be related.

I doubt it :)
Comment 3 Jani Saarinen 2018-10-23 07:20:06 UTC
CI systems had wrong BIOS, to be checked / updated today.
Comment 4 James Ausmus 2018-10-24 17:51:25 UTC
Moving this to "high", as the BIOS update seems to have fixed it - will keep monitoring results to see if it pops back up
Comment 5 James Ausmus 2018-11-02 15:44:44 UTC
Closing this, as this issue is not seen anymore on CI.
Comment 6 Martin Peres 2018-11-02 15:50:27 UTC
(In reply to James Ausmus from comment #5)
> Closing this, as this issue is not seen anymore on CI.

This is still a little early to close, but this is true that is was seen 3 times in 4 runs, so we should wait only 14 runs to have a reasonable assumption that this is closed. It has been 60 runs without issues, so I guess we are good here :)

Thanks!


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.