Bug 111379

Summary: [CI][DRMTIP] igt@gem_eio@unwedge-stress - fail - Failed assertion: __gem_execbuf(fd, execbuf) == 0
Product: DRI Reporter: Lakshmi <lakshminarayana.vudum>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: BYT i915 features: GEM/Other

Description Lakshmi 2019-08-12 11:48:51 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_340/fi-byt-j1900/igt@gem_eio@unwedge-stress.html

Starting subtest: unwedge-stress
(gem_eio:1149) ioctl_wrappers-CRITICAL: Test assertion failure function gem_execbuf, file ../lib/ioctl_wrappers.c:609:
(gem_eio:1149) ioctl_wrappers-CRITICAL: Failed assertion: __gem_execbuf(fd, execbuf) == 0
(gem_eio:1149) ioctl_wrappers-CRITICAL: error: -5 != 0
Subtest unwedge-stress failed.
Comment 2 Chris Wilson 2019-08-12 15:09:04 UTC
Hmm,

<7> [297.642048] hangcheck bcs0
<7> [297.642063] hangcheck 	Awake? 4
<7> [297.642074] hangcheck 	Hangcheck: 8000 ms ago
<7> [297.642084] hangcheck 	Reset count: 0 (global 294)
<7> [297.642094] hangcheck 	Requests:
<7> [297.642119] hangcheck 		active  6:12ce2*  @ 8512ms: gem_eio[1149]
<7> [297.642130] hangcheck 		ring->start:  0x007f1000
<7> [297.642140] hangcheck 		ring->head:   0x00003890
<7> [297.642149] hangcheck 		ring->tail:   0x00002a50
<7> [297.642158] hangcheck 		ring->emit:   0x00002a50
<7> [297.642168] hangcheck 		ring->space:  0x00000e00
<7> [297.642177] hangcheck 		ring->hwsp:   0x007eb100

that ring->space suggests we blocked waiting for space (and so hung before we ourselves performed the reset).

reset_stress() expects to submit 21 batches without a problem, that might have been optimistic.
Comment 3 Chris Wilson 2019-08-13 15:14:37 UTC
Should be

commit d62008b45c735c32c5e52dd14c5ad2a4ba57feb5 (upstream/master, origin/master, 
origin/HEAD)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Aug 12 16:10:49 2019 +0100

    i915/gem_eio: Restrict number of batches of submitted
    
    Make sure we don't block while setting up the stress case before the
    reset by only submitting less batches than would fill the ring.
    
    References: https://bugs.freedesktop.org/show_bug.cgi?id=111379
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Reviewed-by: Andi Shyti <andi.shyti@intel.com>
Comment 4 Martin Peres 2019-09-02 07:29:57 UTC
(In reply to Chris Wilson from comment #3)
> Should be
> 
> commit d62008b45c735c32c5e52dd14c5ad2a4ba57feb5 (upstream/master,
> origin/master, 
> origin/HEAD)
> Author: Chris Wilson <chris@chris-wilson.co.uk>
> Date:   Mon Aug 12 16:10:49 2019 +0100
> 
>     i915/gem_eio: Restrict number of batches of submitted
>     
>     Make sure we don't block while setting up the stress case before the
>     reset by only submitting less batches than would fill the ring.
>     
>     References: https://bugs.freedesktop.org/show_bug.cgi?id=111379
>     Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
>     Reviewed-by: Andi Shyti <andi.shyti@intel.com>

Thanks! It was seen on twice per run on 3 consecutive runs, and then nothing since this patch (~10 runs).
Comment 5 CI Bug Log 2019-09-02 07:30:04 UTC
The CI Bug Log issue associated to this bug has been archived.

New failures matching the above filters will not be associated to this bug anymore.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.