Bug 112117

Summary: [CI][SHARDS] igt@gem_ctx_exec@basic-nohangcheck - fail - Failed assertion: gem_wait(i915, spin->handle, &timeout) == 0
Product: DRI Reporter: Lakshmi <lakshminarayana.vudum>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: not set    
Priority: not set CC: intel-gfx-bugs
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: ALL i915 features: GEM/Other

Description Lakshmi 2019-10-24 05:46:37 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-apl6/igt@gem_ctx_exec@basic-nohangcheck.html
Starting subtest: basic-nohangcheck
(gem_ctx_exec:7568) CRITICAL: Test assertion failure function nohangcheck_hostile, file ../tests/i915/gem_ctx_exec.c:228:
(gem_ctx_exec:7568) CRITICAL: Failed assertion: gem_wait(i915, spin->handle, &timeout) == 0
(gem_ctx_exec:7568) CRITICAL: error: -62 != 0
Subtest basic-nohangcheck failed.
Comment 1 CI Bug Log 2019-10-24 05:47:18 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* All machines: igt@gem_ctx_exec@basic-nohangcheck - fail - Failed assertion: gem_wait(i915, spin-&gt;handle, &amp;timeout) == 0
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-apl4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-glk7/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-iclb5/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-snb2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-tglb6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-apl6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-glk5/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-hsw8/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-iclb8/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-skl3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-snb6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-tglb6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-apl4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-glk1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-iclb3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-skl3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-snb1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-tglb4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-apl2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-glk3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-iclb7/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-kbl3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-skl7/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-snb5/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-tglb2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-apl4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-glk3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-iclb2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-skl2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-snb2/igt@gem_ctx_exec@basic-nohangcheck.html
Comment 2 Chris Wilson 2019-10-24 06:35:58 UTC
commit 2e0986a58cc4f2e7f9e7ede19ec32b9c116d0068
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Oct 23 14:31:07 2019 +0100

    drm/i915/gem: Cancel contexts when hangchecking is disabled
    
    Normally, we rely on our hangcheck to prevent persistent batches from
    hogging the GPU. However, if the user disables hangcheck, this mechanism
    breaks down. Despite our insistence that this is unsafe, the users are
    equally insistent that they want to use endless batches and will disable
    the hangcheck mechanism. We are looking at replacing hangcheck, in the
    next patch, with a softer mechanism, that sends a pulse down the engine
    to check if it is well. We can use the same preemptive pulse to flush an
    active context off the GPU upon context close, preventing resources
    being lost and unkillable requests remaining on the GPU after process
    termination.
    
    Testcase: igt/gem_ctx_exec/basic-nohangcheck
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
    Cc: MichaƂ Winiarski <michal.winiarski@intel.com>
    Cc: Jon Bloomfield <jon.bloomfield@intel.com>
    Reviewed-by: Jon Bloomfield <jon.bloomfield@intel.com>
    Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20191023133108.21401-4-chris@chris-wilson.co.uk

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.