Bug 111678

Summary: [CI][BAT]igt@i915_selftest@live_hangcheck - dmesg-fail- Failed assertion: err == 0
Product: DRI Reporter: Lakshmi <lakshminarayana.vudum>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED MOVED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: major    
Priority: high CC: intel-gfx-bugs
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: ICL i915 features: GEM/Other

Description Lakshmi 2019-09-12 17:32:01 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6879/fi-icl-u3/igt@i915_selftest@live_hangcheck.html

Starting subtest: live_hangcheck
(i915_selftest:5287) igt_kmod-CRITICAL: Test assertion failure function igt_kselftest_execute, file ../lib/igt_kmod.c:532:
(i915_selftest:5287) igt_kmod-CRITICAL: Failed assertion: err == 0
(i915_selftest:5287) igt_kmod-CRITICAL: kselftest "i915 igt__33__live_hangcheck=1 live_selftests=-1 disable_display=1 st_filter=" failed: Input/output error [5]
Subtest live_hangcheck failed.
**** DEBUG ****
(i915_selftest:5287) igt_kmod-CRITICAL: Test assertion failure function igt_kselftest_execute, file ../lib/igt_kmod.c:532:
(i915_selftest:5287) igt_kmod-CRITICAL: Failed assertion: err == 0
(i915_selftest:5287) igt_kmod-CRITICAL: kselftest "i915 igt__33__live_hangcheck=1 live_selftests=-1 disable_display=1 st_filter=" failed: Input/output error [5]
(i915_selftest:5287) igt_core-INFO: Stack trace:
(i915_selftest:5287) igt_core-INFO:   #0 ../lib/igt_core.c:1674 __igt_fail_assert()
(i915_selftest:5287) igt_core-INFO:   #1 ../lib/igt_kmod.c:535 igt_kselftest_execute()
(i915_selftest:5287) igt_core-INFO:   #2 [main+0x30]
(i915_selftest:5287) igt_core-INFO:   #3 [<unknown>+0xba891000]
****  END  ****
Subtest live_hangcheck: FAIL (21.533s)
Comment 1 CI Bug Log 2019-09-12 17:32:44 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* ICL: igt@i915_selftest@live_hangcheck - dmesg-fail- Failed assertion: err == 0
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6879/fi-icl-u3/igt@i915_selftest@live_hangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4974/fi-icl-guc/igt@i915_selftest@live_hangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4974/fi-icl-u4/igt@i915_selftest@live_hangcheck.html
Comment 2 Chris Wilson 2019-09-12 19:50:02 UTC
Just in case it turns out to be relevant, this is with iommu enabled. No DMAR errors in this run, but I have seen

16:42            ickle : <3> [69.430006] DMAR: DRHD: handling fault status reg 2
16:42            ickle : <3> [69.430050] DMAR: [DMA Write] Request device [00:02.0] fault addr 44000 [fault reason 07] Next page table ptr is invalid

recently on icl, so worth remembering.
Comment 3 Francesco Balestrieri 2019-09-30 05:47:15 UTC
Occurs on all ICL machines every other day or so, setting priority to high.
Comment 4 Martin Peres 2019-11-29 19:28:28 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/419.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.