Bug 112117 - [CI][SHARDS] igt@gem_ctx_exec@basic-nohangcheck - fail - Failed assertion: gem_wait(i915, spin->handle, &timeout) == 0
Summary: [CI][SHARDS] igt@gem_ctx_exec@basic-nohangcheck - fail - Failed assertion: ge...
Status: RESOLVED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: not set not set
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-10-24 05:46 UTC by Lakshmi
Modified: 2019-10-24 06:35 UTC (History)
1 user (show)

See Also:
i915 platform: ALL
i915 features: GEM/Other


Attachments

Description Lakshmi 2019-10-24 05:46:37 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-apl6/igt@gem_ctx_exec@basic-nohangcheck.html
Starting subtest: basic-nohangcheck
(gem_ctx_exec:7568) CRITICAL: Test assertion failure function nohangcheck_hostile, file ../tests/i915/gem_ctx_exec.c:228:
(gem_ctx_exec:7568) CRITICAL: Failed assertion: gem_wait(i915, spin->handle, &timeout) == 0
(gem_ctx_exec:7568) CRITICAL: error: -62 != 0
Subtest basic-nohangcheck failed.
Comment 1 CI Bug Log 2019-10-24 05:47:18 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* All machines: igt@gem_ctx_exec@basic-nohangcheck - fail - Failed assertion: gem_wait(i915, spin->handle, &timeout) == 0
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-apl4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-glk7/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-iclb5/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-snb2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_3594/shard-tglb6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-apl6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-glk5/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-hsw8/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-iclb8/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-skl3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-snb6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5236/shard-tglb6/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-apl4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-glk1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-iclb3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-skl3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-snb1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7159/shard-tglb4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-apl2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-glk3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-iclb7/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-kbl3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-skl7/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-snb5/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14943/shard-tglb2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-apl4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-glk3/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-hsw4/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-iclb2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-kbl1/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-skl2/igt@gem_ctx_exec@basic-nohangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_14944/shard-snb2/igt@gem_ctx_exec@basic-nohangcheck.html
Comment 2 Chris Wilson 2019-10-24 06:35:58 UTC
commit 2e0986a58cc4f2e7f9e7ede19ec32b9c116d0068
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Oct 23 14:31:07 2019 +0100

    drm/i915/gem: Cancel contexts when hangchecking is disabled
    
    Normally, we rely on our hangcheck to prevent persistent batches from
    hogging the GPU. However, if the user disables hangcheck, this mechanism
    breaks down. Despite our insistence that this is unsafe, the users are
    equally insistent that they want to use endless batches and will disable
    the hangcheck mechanism. We are looking at replacing hangcheck, in the
    next patch, with a softer mechanism, that sends a pulse down the engine
    to check if it is well. We can use the same preemptive pulse to flush an
    active context off the GPU upon context close, preventing resources
    being lost and unkillable requests remaining on the GPU after process
    termination.
    
    Testcase: igt/gem_ctx_exec/basic-nohangcheck
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
    Cc: Michał Winiarski <michal.winiarski@intel.com>
    Cc: Jon Bloomfield <jon.bloomfield@intel.com>
    Reviewed-by: Jon Bloomfield <jon.bloomfield@intel.com>
    Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20191023133108.21401-4-chris@chris-wilson.co.uk


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.