Summary: | [CI][BAT]igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 | ||
---|---|---|---|
Product: | DRI | Reporter: | Lakshmi <lakshminarayana.vudum> |
Component: | DRM/Intel | Assignee: | Intel GFX Bugs mailing list <intel-gfx-bugs> |
Status: | RESOLVED MOVED | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> |
Severity: | major | ||
Priority: | medium | CC: | intel-gfx-bugs |
Version: | DRI git | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | BXT, CFL, CML, GLK, ICL, KBL, SKL | i915 features: | GEM/Other |
Description
Lakshmi
2019-10-22 11:16:00 UTC
The CI Bug Log issue associated to this bug has been updated. ### New filters associated * KBL CFL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-cfl-8109u/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-cfl-8700k/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-cfl-guc/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-cml-s/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-cml-u/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-cml-u2/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-kbl-7500u/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-kbl-8809g/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-kbl-guc/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-kbl-r/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-kbl-soraka/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14904/fi-kbl-x1275/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7143/fi-cfl-8700k/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7144/fi-cml-u/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3592/fi-cml-u2/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14908/fi-kbl-8809g/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_5203/fi-kbl-r/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14910/fi-cml-u2/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14910/fi-kbl-8809g/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14911/fi-cfl-8109u/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14912/fi-cml-s/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14912/fi-kbl-guc/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7146/fi-kbl-r/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14913/fi-cml-u2/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14913/fi-kbl-guc/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14913/fi-kbl-r/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14914/fi-kbl-r/igt@i915_selftest@live_gt_heartbeat.html The CI Bug Log issue associated to this bug has been updated. ### New filters associated * CFL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7148/fi-cfl-8109u/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- KBL CFL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 -} {+ SKL KBL CFL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7151/fi-skl-iommu/igt@i915_selftest@live_gt_heartbeat.html * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7153/fi-skl-iommu/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- SKL KBL CFL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 -} {+ BXT SKL KBL CFL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7153/fi-bxt-dsi/igt@i915_selftest@live_gt_heartbeat.html Tried commit f79520bb333792fb23a32352f83d8d59a525cec9 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Tue Oct 22 12:21:11 2019 +0100 drm/i915/selftests: Synchronize checking active status with retirement to remove one possible cause of not noticing the callback being run. Hmm, easy to write off as a test bug, but we do have lots of mysterious timeouts all possible due to the callbacks going astray. Hmm, that sounds like a coincidence! A CI Bug Log filter associated to this bug has been updated: {- BXT SKL KBL CFL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 -} {+ BXT SKL KBL CFL WHL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/fi-whl-u/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- BXT SKL KBL CFL WHL CML: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 -} {+ BXT SKL KBL CFL WHL CML ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7162/fi-icl-u2/igt@i915_selftest@live_gt_heartbeat.html * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7167/fi-icl-dsi/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- BXT SKL KBL CFL WHL CML ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 -} {+ BXT SKL KBL CFL WHL CML ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_flush failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7166/fi-glk-dsi/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- CFL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 -} {+ CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7171/fi-icl-dsi/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 -} {+ APL SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7167/shard-skl3/igt@i915_selftest@live_gt_heartbeat.html The CI Bug Log issue associated to this bug has been updated. ### New filters associated * BYT: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_heartbeat_fast failed with error -22 - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_5181/fi-byt-n2820/igt@i915_selftest@live_gt_heartbeat.html - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7186/fi-byt-n2820/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- APL SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 -} {+ APL BXT SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5249/fi-bxt-dsi/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- BYT: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_heartbeat_fast failed with error -22 -} {+ BYT SKL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_heartbeat_fast failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7222/shard-skl5/igt@i915_selftest@live_gt_heartbeat.html Overall reproduction rate so far is 76 / 179 runs (42.5%), setting severity to high. vcs0: heartbeat pulse did not flush idle tasks pulse active pulse_active+0x0/0x10 [i915]:pulse_retire+0x0/0x10 [i915] pulse count: 0 pulse preallocated barriers? no So just a synchronisation problem. :| commit 38813767c7c5d9f8e0bd6b14136add861cc79b33 (HEAD -> drm-intel-next-queued, drm-intel/drm-intel-next-queued) Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Fri Nov 1 18:10:22 2019 +0000 drm/i915/selftests: Flush all active callbacks Flushing the outer i915_active is not enough, as we need the barrier to be applied across all the active dma_fence callbacks. So we must serialise with each outstanding fence. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=112096 References: f79520bb3337 ("drm/i915/selftests: Synchronize checking active status with retirement") Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Andi Shyti <andi.shyti@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20191101181022.25633-1-chris@chris-wilson.co.uk A CI Bug Log filter associated to this bug has been updated: {- APL BXT SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 -} {+ APL BXT SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7391/fi-cml-u2/igt@i915_selftest@live_gt_heartbeat.html A CI Bug Log filter associated to this bug has been updated: {- APL BXT SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 -} {+ APL BXT SKL KBL CFL CML ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 +} No new failures caught with the new filter (In reply to CI Bug Log from comment #18) > A CI Bug Log filter associated to this bug has been updated: > > {- APL BXT SKL KBL CFL ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail > - live_idle_pulse failed with error -22 -} > {+ APL BXT SKL KBL CFL CML ICL: igt@i915_selftest@live_gt_heartbeat - > dmesg-fail - live_idle_pulse failed with error -22 +} Still happening, here are some of the latest failures. https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7374/fi-kbl-7560u/igt@i915_selftest@live_gt_heartbeat.html https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7337/fi-skl-6770hq/igt@i915_selftest@live_gt_heartbeat.html And the annoying thing is... <3> [383.431019] vcs0: heartbeat pulse did not flush idle tasks <3> [383.431123] *ERROR* pulse active pulse_active+0x0/0x10 [i915]:pulse_retire+0x0/0x10 [i915] <3> [383.431128] *ERROR* pulse count: 0 <3> [383.431132] *ERROR* pulse preallocated barriers? no it is nothing more than bad timing; a missing CPU barrier. But where? I'm running out of places to put them! The CI Bug Log issue associated to this bug has been updated. ### New filters associated * KBL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -62 - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7417/fi-kbl-guc/igt@i915_selftest@live_gt_heartbeat.html (In reply to Chris Wilson from comment #20) > And the annoying thing is... > > <3> [383.431019] vcs0: heartbeat pulse did not flush idle tasks > <3> [383.431123] *ERROR* pulse active pulse_active+0x0/0x10 > [i915]:pulse_retire+0x0/0x10 [i915] > <3> [383.431128] *ERROR* pulse count: 0 > <3> [383.431132] *ERROR* pulse preallocated barriers? no > > it is nothing more than bad timing; a missing CPU barrier. But where? I'm > running out of places to put them! > https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7417/fi-kbl-guc/ > igt@i915_selftest@live_gt_heartbeat.html Another failure <3> [686.205010] i915/intel_heartbeat_live_selftests: live_idle_pulse failed with error -62 (In reply to CI Bug Log from comment #21) > The CI Bug Log issue associated to this bug has been updated. > > ### New filters associated > > * KBL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse > failed with error -62 > - > https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7417/fi-kbl-guc/ > igt@i915_selftest@live_gt_heartbeat.html Note that is a different class of failure entirely. The CI Bug Log issue associated to this bug has been updated. ### Removed filters * APL BXT SKL KBL CFL CML ICL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -22 (added on 6 days, 1 hour ago) * KBL: igt@i915_selftest@live_gt_heartbeat - dmesg-fail - live_idle_pulse failed with error -62 (added on 1 day ago) *** Bug 112406 has been marked as a duplicate of this bug. *** -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/541. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.