Summary: | [CI][SHARDS] igt@gem_exec_fence@basic-await-default - fail - Failed assertion: out[n] == 0 | ||
---|---|---|---|
Product: | DRI | Reporter: | Martin Peres <martin.peres> |
Component: | DRM/Intel | Assignee: | Chris Wilson <chris> |
Status: | CLOSED FIXED | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> |
Severity: | normal | ||
Priority: | high | CC: | intel-gfx-bugs |
Version: | XOrg git | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | ReadyForDev | ||
i915 platform: | BYT, HSW, IVB | i915 features: | GEM/Other |
Description
Martin Peres
2018-11-28 14:23:52 UTC
commit f36c071f6344e0a335ed4b4e0b3a38c0dd54648b Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Mon Dec 3 11:36:56 2018 +0000 drm/i915/ringbuffer: Clear semaphore sync registers on ring init Ensure that the sync registers are cleared every time we restart the ring to avoid stale values from creeping in from random neutrinos. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108888 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Mika Kuoppala <mika.kuoppala@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20181203113701.12106-3-chris@chris-wilson.co.uk According to CI Buglog, this still occurs: https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-hsw-4770r/igt@gem_exec_fence@basic-await-default.html Reopening. Haswell is no longer the odd one out, commit 6faf5916e6beb0dedb0fcbbafbaa152adeaea758 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Fri Dec 28 14:07:35 2018 +0000 drm/i915: Remove HW semaphores for gen7 inter-engine synchronisation The writing is on the wall for the existence of a single execution queue along each engine, and as a consequence we will not be able to track dependencies along the HW queue itself, i.e. we will not be able to use HW semaphores on gen7 as they use a global set of registers (and unlike gen8+ we can not effectively target memory to keep per-context seqno and dependencies). On the positive side, when we implement request reordering for gen7 we also can not presume a simple execution queue and would also require removing the current semaphore generation code. So this bring us another step closer to request reordering for ringbuffer submission! The negative side is that using interrupts to drive inter-engine synchronisation is much slower (4us -> 15us to do a nop on each of the 3 engines on ivb). This is much better than it was at the time of introducing the HW semaphores and equally important userspace weaned itself off intermixing dependent BLT/RENDER operations (the prime culprit was glyph rendering in UXA). So while we regress the microbenchmarks, it should not impact the user. References: https://bugs.freedesktop.org/show_bug.cgi?id=108888 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Mika Kuoppala <mika.kuoppala@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20181228140736.32606-2-chris@chris-wilson.co.uk A CI Bug Log filter associated to this bug has been updated: {- HSW: igt@gem_exec_fence@basic-await-default - fail - Failed assertion: out[n] == 0 -} {+ BYT IVB IVBm HSW: igt@gem_exec_fence@basic-await-default - fail - Failed assertion: out[n] == 0 +} New failures caught by the filter: * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_169/fi-byt-clapper/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_170/fi-byt-clapper/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_170/fi-byt-j1900/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_170/fi-byt-n2820/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_170/fi-ivb-3520m/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_170/fi-ivb-3770/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_171/fi-byt-clapper/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_171/fi-byt-j1900/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_171/fi-byt-n2820/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_171/fi-ivb-3520m/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_171/fi-ivb-3770/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_172/fi-byt-clapper/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-byt-j1900/igt@gem_exec_fence@basic-await-default.html * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-ivb-3520m/igt@gem_exec_fence@basic-await-default.html (In reply to Chris Wilson from comment #3) > Haswell is no longer the odd one out, > > commit 6faf5916e6beb0dedb0fcbbafbaa152adeaea758 > Author: Chris Wilson <chris@chris-wilson.co.uk> > Date: Fri Dec 28 14:07:35 2018 +0000 > > drm/i915: Remove HW semaphores for gen7 inter-engine synchronisation > > The writing is on the wall for the existence of a single execution queue > along each engine, and as a consequence we will not be able to track > dependencies along the HW queue itself, i.e. we will not be able to use > HW semaphores on gen7 as they use a global set of registers (and unlike > gen8+ we can not effectively target memory to keep per-context seqno and > dependencies). > > On the positive side, when we implement request reordering for gen7 we > also can not presume a simple execution queue and would also require > removing the current semaphore generation code. So this bring us another > step closer to request reordering for ringbuffer submission! > > The negative side is that using interrupts to drive inter-engine > synchronisation is much slower (4us -> 15us to do a nop on each of the 3 > engines on ivb). This is much better than it was at the time of > introducing > the HW semaphores and equally important userspace weaned itself off > intermixing dependent BLT/RENDER operations (the prime culprit was glyph > rendering in UXA). So while we regress the microbenchmarks, it should not > impact the user. > > References: https://bugs.freedesktop.org/show_bug.cgi?id=108888 > Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> > Reviewed-by: Mika Kuoppala <mika.kuoppala@linux.intel.com> > Link: > https://patchwork.freedesktop.org/patch/msgid/20181228140736.32606-2- > chris@chris-wilson.co.uk Thanks! Let's hope it fixes the IVB and BYT issues too (which are also gen7). So far, it's been only 2 drmtip runs since this patch landed but the reproduction rate was quite sporadic, so I guess we'll have to be a little patient. (In reply to Martin Peres from comment #5) > Thanks! Let's hope it fixes the IVB and BYT issues too (which are also gen7). > > So far, it's been only 2 drmtip runs since this patch landed but the > reproduction rate was quite sporadic, so I guess we'll have to be a little > patient. Seems like we were too optimistic: https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-byt-j1900/igt@gem_exec_fence@basic-await-default.html https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-ivb-3520m/igt@gem_exec_fence@basic-await-default.html (In reply to Martin Peres from comment #6) > (In reply to Martin Peres from comment #5) > > Thanks! Let's hope it fixes the IVB and BYT issues too (which are also gen7). > > > > So far, it's been only 2 drmtip runs since this patch landed but the > > reproduction rate was quite sporadic, so I guess we'll have to be a little > > patient. > > Seems like we were too optimistic: > > https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-byt-j1900/ > igt@gem_exec_fence@basic-await-default.html > > https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_178/fi-ivb-3520m/ > igt@gem_exec_fence@basic-await-default.html Nah, don't be confused by a real GPU hang! Which definitely didn't happen and has nothing at all to do with full-ppgtt. Nope, definitely not that at all. Last seen drmtip_178 (1 month, 3 weeks / 1032 runs ago). I assume this issue has been fixed, changing the status to Closed. The CI Bug Log issue associated to this bug has been archived. New failures matching the above filters will not be associated to this bug anymore. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.