Bug 107732 - [CI][SHARDS] igt@* - dmesg-fail / dmesg-warn - flip_done timed out
Summary: [CI][SHARDS] igt@* - dmesg-fail / dmesg-warn - flip_done timed out
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: high normal
Assignee: Ville Syrjala
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-08-29 08:04 UTC by Martin Peres
Modified: 2019-11-29 17:50 UTC (History)
2 users (show)

See Also:
i915 platform: BSW/CHT
i915 features: display/Other


Attachments

Description Martin Peres 2018-08-29 08:04:36 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_95/fi-bdw-samus/igt@gem_mmap_gtt@basic-write-gtt-no-prefault.html

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_93/fi-bdw-samus/igt@kms_vblank@pipe-b-wait-idle-hang.html

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_93/fi-bdw-samus/igt@gem_pwrite@small-cpu-backwards.html

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_93/fi-bdw-samus/igt@kms_pipe_crc_basic@nonblocking-crc-pipe-b.html

... and other 130 instances ...

[  431.278697] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:39:pipe A] flip_done timed out
[  441.523797] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CONNECTOR:65:eDP-1] flip_done timed out
[  451.768866] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:28:primary A] flip_done timed out
[  462.014048] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:39:pipe A] flip_done timed out
[  472.259145] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:39:pipe A] flip_done timed out
[  482.504266] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:39:pipe A] flip_done timed out
[  492.749385] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CONNECTOR:65:eDP-1] flip_done timed out
[  502.994523] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:28:primary A] flip_done timed out
[  513.239657] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:39:pipe A] flip_done timed out
Comment 1 Martin Peres 2018-09-07 07:43:47 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_105/fi-bsw-kefka/igt@kms_busy@extended-modeset-hang-oldfb-render-b.html

[  301.660971] i915 0000:00:02.0: Resetting rcs0 for no progress on rcs0
[  314.690039] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:61:pipe B] flip_done timed out
[  315.615851] i915 0000:00:02.0: Resetting rcs0 for no progress on rcs0
[  315.721625] ------------[ cut here ]------------
[  315.721658] vblank wait timed out on crtc 1
[  315.721761] WARNING: CPU: 1 PID: 1379 at drivers/gpu/drm/drm_vblank.c:1084 drm_wait_one_vblank+0x172/0x180
[  315.721783] Modules linked in: vgem snd_hda_codec_hdmi asix usbnet mii coretemp crct10dif_pclmul i915 crc32_pclmul ghash_clmulni_intel snd_hda_intel btusb btrtl btbcm snd_hda_codec btintel snd_hwdep bluetooth snd_hda_core ecdh_generic snd_pcm lpc_ich pinctrl_cherryview prime_numbers
[  315.722253] CPU: 1 PID: 1379 Comm: kms_busy Tainted: G     U            4.19.0-rc2-g1a2bb6c06121-drmtip_105+ #1
[  315.722273] Hardware name: GOOGLE Kefka/Kefka, BIOS MrChromebox 02/04/2018
[  315.722298] RIP: 0010:drm_wait_one_vblank+0x172/0x180
[  315.722323] Code: ff ff ff e8 f0 0b a7 ff 48 89 e6 4c 89 f7 e8 e5 13 ac ff 45 85 e4 0f 85 0f ff ff ff 89 ee 48 c7 c7 a8 cc 0c 9a e8 ee 08 a7 ff <0f> 0b e9 fa fe ff ff 0f 1f 80 00 00 00 00 8b b7 f8 00 00 00 48 8b
[  315.722344] RSP: 0018:ffffb0ef80947a58 EFLAGS: 00010286
[  315.722381] RAX: 0000000000000000 RBX: ffff8ecb2d110000 RCX: 0000000000000001
[  315.722400] RDX: 0000000080000001 RSI: ffffffff9a080d30 RDI: 00000000ffffffff
[  315.722420] RBP: 0000000000000001 R08: 00000000bfabfde7 R09: 0000000000000000
[  315.722438] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  315.722457] R13: 000000000000096e R14: ffff8ecb2d9894c8 R15: ffff8ecb2e83dbc8
[  315.722479] FS:  00007f59b8b2f980(0000) GS:ffff8ecb3bb00000(0000) knlGS:0000000000000000
[  315.722499] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  315.722518] CR2: 00007f51dc025c08 CR3: 0000000178360000 CR4: 00000000001006e0
[  315.722535] Call Trace:
[  315.722582]  ? wait_woken+0xa0/0xa0
[  315.722957]  intel_pre_plane_update+0x161/0x200 [i915]
[  315.723309]  intel_update_crtc+0x79/0x80 [i915]
[  315.723642]  intel_update_crtcs+0x42/0x60 [i915]
[  315.723968]  intel_atomic_commit_tail+0x1c9/0xd00 [i915]
[  315.724300]  ? intel_atomic_commit_ready+0x3f/0x4c [i915]
[  315.724578]  ? __i915_sw_fence_complete+0x1a0/0x250 [i915]
[  315.724932]  intel_atomic_commit+0x240/0x320 [i915]
[  315.724982]  drm_mode_atomic_ioctl+0x837/0xa00
[  315.725122]  ? drm_atomic_set_property+0x650/0x650
[  315.725152]  drm_ioctl_kernel+0x7c/0xf0
[  315.725204]  drm_ioctl+0x2e6/0x3a0
[  315.725246]  ? drm_atomic_set_property+0x650/0x650
[  315.725359]  do_vfs_ioctl+0xa0/0x6d0
[  315.725437]  ksys_ioctl+0x35/0x60
[  315.725481]  __x64_sys_ioctl+0x11/0x20
[  315.725509]  do_syscall_64+0x55/0x190
[  315.725546]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
[  315.725572] RIP: 0033:0x7f59b7fc45d7
[  315.725596] Code: b3 66 90 48 8b 05 b1 48 2d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 81 48 2d 00 f7 d8 64 89 01 48
[  315.725615] RSP: 002b:00007ffe9139cd08 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[  315.725653] RAX: ffffffffffffffda RBX: 000056465d36baf0 RCX: 00007f59b7fc45d7
[  315.725672] RDX: 00007ffe9139cd60 RSI: 00000000c03864bc RDI: 0000000000000003
[  315.725690] RBP: 00007ffe9139cd60 R08: 000056465d3791c0 R09: 000000000000002d
[  315.725709] R10: 0000000000000000 R11: 0000000000000246 R12: 00000000c03864bc
[  315.725727] R13: 0000000000000003 R14: 0000000000000000 R15: 0000000000000000
[  315.725820] irq event stamp: 886
[  315.725849] hardirqs last  enabled at (885): [<ffffffff990fbba6>] console_unlock+0x1b6/0x5e0
[  315.725872] hardirqs last disabled at (886): [<ffffffff99001910>] trace_hardirqs_off_thunk+0x1a/0x1c
[  315.725897] softirqs last  enabled at (492): [<ffffffff99c0031d>] __do_softirq+0x31d/0x483
[  315.725920] softirqs last disabled at (473): [<ffffffff990900f9>] irq_exit+0xa9/0xc0
[  315.725946] WARNING: CPU: 1 PID: 1379 at drivers/gpu/drm/drm_vblank.c:1084 drm_wait_one_vblank+0x172/0x180
[  315.725966] ---[ end trace 04fc5582f947561d ]---
Comment 2 Martin Peres 2018-10-08 13:13:17 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_mmap_gtt@basic.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_mmap@basic-small-bo.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_mmap@basic.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_linear_blits@basic.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_flink_basic@flink-lifetime.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_flink_basic@double-flink.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_flink_basic@basic.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4950/fi-icl-u2/igt@gem_flink_basic@bad-open.html

<3> [518.010429] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:45:pipe A] flip_done timed out
<3> [528.250366] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CONNECTOR:101:HDMI-A-2] flip_done timed out
<3> [538.490394] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:28:plane 1A] flip_done timed out
<3> [548.730392] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:45:pipe A] flip_done timed out
<3> [558.970392] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:45:pipe A] flip_done timed out
<3> [569.210388] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CONNECTOR:101:HDMI-A-2] flip_done timed out
<3> [579.450369] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:28:plane 1A] flip_done timed out
<3> [589.690390] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:45:pipe A] flip_done timed out
Comment 3 Martin Peres 2018-10-10 12:36:24 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_124/fi-glk-dsi/igt@kms_universal_plane@cursor-fb-leak-pipe-c.html

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_124/fi-glk-dsi/igt@kms_vblank@pipe-b-wait-busy-hang.html

<3> [96.643905] [drm:dpi_send_cmd.constprop.5 [i915]] *ERROR* Video mode command 0x00000042 send failed.
<3> [106.718114] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:59:pipe A] flip_done timed out
<3> [116.963095] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:59:pipe A] flip_done timed out
<3> [127.208129] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CONNECTOR:138:DSI-1] flip_done timed out
<3> [137.453326] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:28:plane 1A] flip_done timed out
<3> [137.456210] [drm:intel_pipe_update_start [i915]] *ERROR* Potential atomic update failure on pipe A
<3> [147.698186] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:59:pipe A] flip_done timed out
Comment 4 Ville Syrjala 2018-10-16 17:18:46 UTC
The glk fail could be some kind of DSI specific issue:
<3> [96.643905] [drm:dpi_send_cmd.constprop.5 [i915]] *ERROR* Video mode command 0x00000042 send failed.

The ICL fail could be related to the mismanagement of type C ports:
<7>[  120.123471] [drm:intel_power_well_enable [i915]] enabling DDI C IO
<4>[  120.125029] ------------[ cut here ]------------
<4>[  120.125031] WARN_ON(intel_wait_for_register(dev_priv, regs->driver, (0x1 << ((pw_idx) * 2)), (0x1 << ((pw_idx) * 2)), 1))

The chv and bdw fails are a mystery.

I suggest splitting this up to different bugs for each platform.
Comment 5 Jani Saarinen 2018-10-16 19:20:30 UTC
On that ICL-U2 there is to Type C connected or is there? 
It should have only: eDP-PSR2,HDMI (mDP). But could Type-C cause issues still?
Comment 6 Jani Saarinen 2018-10-16 19:22:26 UTC
(In reply to Jani Saarinen from comment #5)
> On that ICL-U2 there is to Type C connected or is there? 
There is no Type-C

> It should have only: eDP-PSR2,HDMI (mDP). But could Type-C cause issues
> still?
Comment 7 Ville Syrjala 2018-10-16 19:40:04 UTC
(In reply to Jani Saarinen from comment #6)
> (In reply to Jani Saarinen from comment #5)
> > On that ICL-U2 there is to Type C connected or is there? 
> There is no Type-C
> 
> > It should have only: eDP-PSR2,HDMI (mDP). But could Type-C cause issues
> > still?

DDI C is a type C port no matter what the physical connector looks like. The log says it's got HDMI hooked up to that port.
Comment 8 Imre Deak 2018-10-17 10:19:22 UTC
(In reply to Ville Syrjala from comment #4)
> The glk fail could be some kind of DSI specific issue:
> <3> [96.643905] [drm:dpi_send_cmd.constprop.5 [i915]] *ERROR* Video mode
> command 0x00000042 send failed.
> 
> The ICL fail could be related to the mismanagement of type C ports:
> <7>[  120.123471] [drm:intel_power_well_enable [i915]] enabling DDI C IO
> <4>[  120.125029] ------------[ cut here ]------------
> <4>[  120.125031] WARN_ON(intel_wait_for_register(dev_priv, regs->driver,
> (0x1 << ((pw_idx) * 2)), (0x1 << ((pw_idx) * 2)), 1))

The above looks like
https://bugs.freedesktop.org/show_bug.cgi?id=108070

> 
> The chv and bdw fails are a mystery.
> 
> I suggest splitting this up to different bugs for each platform.
Comment 9 Lakshmi 2018-10-24 12:01:17 UTC
Ville, any updates here?
Comment 10 Lakshmi 2018-10-24 12:02:35 UTC
Latest log (dmesg-warn) from drmtip_132
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_132/fi-icl-u2/igt@gem_tiled_pread_basic.html
Comment 11 Jani Saarinen 2018-10-26 10:29:00 UTC
Need to wait latest BIOS results.
Comment 12 Jani Saarinen 2018-11-01 07:54:52 UTC
Do we see these still with BIOS 2402?
Comment 13 Lakshmi 2018-11-01 11:30:06 UTC
(In reply to Jani Saarinen from comment #12)
> Do we see these still with BIOS 2402?

Last seen drmtip_134 (5 days, 2 hours / 52 runs ago) which had BIOS 2392. Not seen from 2402 BIOS yet from BAT tests. Drmtip results with BIOS 2402 are not yet available.
Comment 14 Lakshmi 2018-12-03 12:36:52 UTC
Ville, any updates here?

Last seen CI_DRM_5239 (1 hour, 20 minutes / 0 runs ago)

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5239/fi-icl-u2/igt@kms_chamelium@common-hpd-after-suspend.html
Comment 15 Imre Deak 2018-12-12 18:30:10 UTC
(In reply to Lakshmi from comment #14)
> Ville, any updates here?
> 
> Last seen CI_DRM_5239 (1 hour, 20 minutes / 0 runs ago)
> 
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5239/fi-icl-u2/
> igt@kms_chamelium@common-hpd-after-suspend.html

This one failure here (which ends up in a flip done timeout) is a duplicate of 108070.
Comment 16 Jani Saarinen 2018-12-12 20:13:33 UTC
Martin this has been seen only 4 times on bsw:
eg: https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_153/fi-bsw-kefka/igt@kms_busy@extended-modeset-hang-newfb-render-a.html

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_147/fi-bsw-kefka/igt@kms_busy@extended-modeset-hang-newfb-with-reset-render-a.html

but lately only on ICL, no bdw lately (drmtip_138 (1 month / 1010 runs ago))
dropping BDW, maybe even BSW warrant own bug?
Comment 17 Jani Saarinen 2018-12-12 20:14:31 UTC
Based on comment from Imre, dup for 108070 for ICL.

*** This bug has been marked as a duplicate of bug 108070 ***
Comment 18 Martin Peres 2019-03-08 15:16:43 UTC
(In reply to Jani Saarinen from comment #17)
> Based on comment from Imre, dup for 108070 for ICL.
> 
> *** This bug has been marked as a duplicate of bug 108070 ***

This is still happening:

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_231/fi-bsw-kefka/igt@kms_busy@extended-modeset-hang-newfb-render-a.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5599/shard-iclb4/igt@pm_rpm@legacy-planes.html
Comment 19 Martin Peres 2019-03-08 15:17:32 UTC
Oops, it was seen on ICL shards too.
Comment 20 Jani Saarinen 2019-03-08 15:24:04 UTC
Old BIOS still:
<6>[    0.000000] DMI: Intel Corporation Ice Lake Client Platform/IceLake U DDR4 SODIMM PD RVP TLC, BIOS ICLSFWR1.R00.2402.AD3.1810170014 10/17/2018

Need to wait until results from BIOS ICLSFWR1.R00.3087 got.
Comment 21 CI Bug Log 2019-04-05 12:10:41 UTC
A CI Bug Log filter associated to this bug has been updated:

{- BSW GLK BDW ICL: all tests - dmesg-fail / dmesg-warn - flip_done timed out -}
{+ BSW GLK BDW: all tests - dmesg-fail / dmesg-warn - flip_done timed out +}

 No new failures caught with the new filter
Comment 22 Lakshmi 2019-04-05 12:12:20 UTC
Removed ICL tag as this issue occurred CI_DRM_5599_full (1 month, 2 weeks old) ago. I assume ICL is fixed, WORKSFORME.
Comment 23 CI Bug Log 2019-05-29 07:25:56 UTC
A CI Bug Log filter associated to this bug has been updated:

{- BSW GLK BDW: all tests - dmesg-fail / dmesg-warn - flip_done timed out -}
{+ HSW BSW BDW GLK: all tests - dmesg-fail / dmesg-warn - flip_done timed out +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6158/fi-hsw-4770r/igt@i915_module_load@reload-with-fault-injection.html
Comment 24 CI Bug Log 2019-05-29 07:26:16 UTC
The CI Bug Log issue associated to this bug has been updated.

### Removed filters

* HSW BSW BDW GLK: all tests - dmesg-fail / dmesg-warn - flip_done timed out (added on a minute ago)

### New filters associated

* BSW GLK BDW: all tests - dmesg-fail / dmesg-warn - flip_done timed out
  (No new failures associated)

* HSW: igt@runner@aborted - fail - Previous test: i915_module_load (reload-with-fault-injection)
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6158/fi-hsw-4770r/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4320/fi-hsw-4770/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4320/fi-hsw-4770r/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4320/fi-hsw-peppy/igt@runner@aborted.html
Comment 25 CI Bug Log 2019-07-17 10:21:49 UTC
A CI Bug Log filter associated to this bug has been updated:

{- BSW GLK BDW: all tests - dmesg-fail / dmesg-warn - flip_done timed out -}
{+ HSW BSW GLK BDW: all tests - dmesg-fail / dmesg-warn - flip_done timed out +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6425_189/fi-hsw-4770r/igt@i915_module_load@reload-with-fault-injection.html
Comment 26 Martin Peres 2019-11-29 17:50:25 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/142.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.