Bug 102490

Summary: [CI] several gem tests causing incomplete results for sharded runs
Product: DRI Reporter: Jani Saarinen <jani.saarinen>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: major    
Priority: medium CC: intel-gfx-bugs
Version: DRI git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard: ReadyForDev
i915 platform: BXT i915 features: power/runtime PM

Description Jani Saarinen 2017-08-31 08:23:59 UTC
On CI for sharded runs several tests causing complete. Agreed to make bug from these. 

Especially lately tests:
igt@gem_fence_thrash@bo-write-verify-threaded-y
igt@gem_mmap_gtt@forked-medium-copy
igt@gem_mmap_gtt@forked-medium-copy-XY
igt@gem_mmap_gtt@forked-medium-copy-odd
igt@gem_mmap_gtt@medium-copy

To get these sorted out we could see maybe more stable APL.

See: https://intel-gfx-ci.01.org/tree/drm-tip/shards.html

Note APL's currently without displays.
Comment 1 Chris Wilson 2017-08-31 08:58:48 UTC
The main culprit there is the continual runtime suspend/resume making that platform much slower than the others; rpm is especially harsh towards GTT mmapings.
Comment 2 Chris Wilson 2017-09-03 14:36:23 UTC
https://patchwork.freedesktop.org/series/29764/ to speed up mmap-gtt and https://patchwork.freedesktop.org/series/29208/ to speed up gem_fence_thash.
Comment 3 Jani Saarinen 2017-10-04 13:25:01 UTC
Patches sent here seems to fix issues seen on APL shards?
Comment 5 Chris Wilson 2017-10-09 20:54:44 UTC
The situation has improved slightly by the runtime pm issues being masked by the connected display, and 

kernel commit f34a93bbb33d988df36bb99f791f0bf8a7017041
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Oct 9 09:43:58 2017 +0100

    drm/i915: Check PIN_NONFAULT overlaps in evict_for_node

...

kernel commit 43ae70d97c5195f48d903df31ecac4c5397b2f1e
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Oct 9 09:44:01 2017 +0100

    drm/i915: Early rejection of mappable GGTT pin attempts for large bo

will improve gem_mmap_gtt / gem_pread / gem_pwrite.

gem_fence_thrash is improved by

igt commit eb57c0289dbd2bfb573bb1c104d80eaf5edc7228
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Aug 23 13:44:57 2017 +0100

    igt/gem_fence_thrash: Use streaming reads for verify

That should hopefully account for the remaining incompletes. Although as stated earlier, part of the problem is being masked by a connected display and we don't have good test coverage of GTT vs rpm (or at least that demonstrates the severe performance issues).
Comment 7 Elizabeth 2018-02-13 16:40:00 UTC
Closing old verified.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.