Bug 103940 - 2% perf drop in SynMark PSPom test from "intel/fs: Be more explicit about our placement of [un]zip"
Summary: 2% perf drop in SynMark PSPom test from "intel/fs: Be more explicit about our...
Status: NEW
Alias: None
Product: Mesa
Classification: Unclassified
Component: Drivers/DRI/i965 (show other bugs)
Version: git
Hardware: Other All
: medium normal
Assignee: Intel 3D Bugs Mailing List
QA Contact: Intel 3D Bugs Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-11-27 15:47 UTC by Eero Tamminen
Modified: 2018-05-28 14:24 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Eero Tamminen 2017-11-27 15:47:09 UTC
Following commit was bisected (on SKL GT2):
-------------------------------------------------------------
commit 0d905597fe2997c89022c76cdf84dc4fba5eb055
Author:     Jason Ekstrand <jason.ekstrand@intel.com>
AuthorDate: Wed Sep 6 18:31:11 2017 -0700
Commit:     Jason Ekstrand <jason.ekstrand@intel.com>
CommitDate: Tue Nov 7 10:37:52 2017 -0800

    intel/fs: Be more explicit about our placement of [un]zip
    
    Before, we were careful to place the zip after the last of the split
    instructions but did unzip on-demand.  This changes things so that the
    unzips go before all of the split instructions and the unzip comes
    explicitly after all the split instructions.  As a side-effect of this
    change, we now emit the split instruction from highest SIMD group to
    lowest instead of low to high.  We could have kept the old behavior, but
    it shouldn't matter and this made the code easier.
    
    Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
    Cc: mesa-stable@lists.freedesktop.org
-------------------------------------------------------------

To be the cause for the SynMark v7 PSPom drop visible on following platforms:
* 2% in BXT J4205, KBL GT3e
* 1-2% on BSW N3050
* 1% on BDW GT2, SKL GT2 & GT4e

This test didn't improve on any other platform and no other test was affected.

(I'm OK if this is closed as WONTFIX, I filed this more to document the perf change.)
Comment 1 Eero Tamminen 2018-05-28 14:24:30 UTC
Bisected commit changed the instruction emit order, which caused GPU hangs with SIMD32.

Following patch fixes the hang / restores the order:
https://patchwork.freedesktop.org/patch/225334/

Regardless of whether it fixes the small perf regression in this bug, I think this can be closed after that commit is in.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.