Bug 84444 - [BDW Bisected]Smokin-Guns v1.1 and urbanterror v4.1 performance reduced ~15%
Summary: [BDW Bisected]Smokin-Guns v1.1 and urbanterror v4.1 performance reduced ~15%
Status: CLOSED WONTFIX
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: All Linux (All)
: high major
Assignee: Mika Kuoppala
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-09-29 03:09 UTC by lili
Modified: 2015-01-06 02:40 UTC (History)
3 users (show)

See Also:
i915 platform:
i915 features:


Attachments
dmesg.log (46.81 KB, text/plain)
2014-09-29 03:09 UTC, lili
no flags Details
drm/i915/bdw: Enable single subspan dispatch (2.06 KB, patch)
2014-10-01 13:21 UTC, Mika Kuoppala
no flags Details | Splinter Review

Description lili 2014-09-29 03:09:54 UTC
Created attachment 107034 [details]
dmesg.log

Environment:
-----------------------------------
Platform:BDW
Libdrm:(master)libdrm-2.4.56-29-g666788a6062de62aa0b3560760fbb0903167a319
Mesa:(master)5ccdc23a86978628449ace0058cd3697020b0c94
Xserver:(master)xorg-server-1.16.0-335-gcc59be38b7eff52a1d003b
Xf86_video_intel:(master)2.99.916-68-gf785035d5bd42a778d4be0bf3ff8678bd7a7e503
Cairo:(master)fbb0a260b707cb5f02a14cc368c6f2f0d63564c3
Libva:(master)5faa5f50382af6d2f58ba07bbc64d2e9e63abad9
Libva_intel_driver:(master)925c98afcd381e52b37eb3870c3c80ff9c59a069
Kernel:(drm-intel-nightly)7101d84020f63f1da7e0c5d021cdd6be4d515de5

Bug detailed description:
---------------------------------------------
Smokin-Guns v1.1 and urbanterror v4.1 performance reduced ~15% on BDW.This problem exist both on gnome-session and Raw X,it doesn't exist on IVB/HSW/BYT-M.
It's kernel (drm-intel-next-queued)regression,bisect result first bad commit is:
commit d37cf5f7e1b315585940a735a8508d955ffc0f16
Author:     Mika Kuoppala <mika.kuoppala@linux.intel.com>
AuthorDate: Fri Sep 19 20:05:26 2014 +0300
Commit:     Daniel Vetter <daniel.vetter@ffwll.ch>
CommitDate: Wed Sep 24 10:38:41 2014 +0200

    drm/i915/bdw: Cleanup pre prod workarounds

BTW,this issue also exist on drm-intel-nightly branch.

Reproduce steps:
---------------------------------------------
1.	xinit&
2.	vblank_mode=0 ./smokinguns.x86_64 +set r_fullscreen 1 +timedemo 1 +set demodone "quit" +set demoloop1 "demo pts; set nextdemo vstr demodone" +vstr demoloop1 +setr_customwidth 1920 +set r_customheight 1080
Comment 1 Chris Wilson 2014-09-29 07:05:46 UTC
Assigning to Mika for fun.
Comment 2 Rodrigo Vivi 2014-09-29 20:54:27 UTC
Hi lili, what BDW stepping are you using?
Comment 3 wendy.wang 2014-09-30 01:24:49 UTC
(In reply to comment #2)
> Hi lili, what BDW stepping are you using?

Both BDW-GT2-E2 and BDW-GT2-F0 two SKUs have this issue, they are using the same CPU stepping: stepping 4
Detail BDW-GT2-E2 CPU info: E2 2.1GHz 2Cores/4Thread 4 QGZ9 (Processor number: i3-5010U) 
Detail BDW-GT2-F0 CPU info: F0 i5-5500U 2.4GHz 2Cores/4Thread 4 QQC5
Comment 4 lili 2014-09-30 01:40:16 UTC
(In reply to comment #2)
> Hi lili, what BDW stepping are you using?

Hi Vivi,
please refer to reproduce steps:
---------------------------------------------
1.	xinit&
2.	vblank_mode=0 ./smokinguns.x86_64 +set r_fullscreen 1 +timedemo 1 +set demodone "quit" +set demoloop1 "demo pts; set nextdemo vstr demodone" +vstr demoloop1 +setr_customwidth 1920 +set r_customheight 1080
Comment 5 Gordon Jin 2014-09-30 07:37:27 UTC
(In reply to comment #0)
> Bug detailed description:
> ---------------------------------------------
> Smokin-Guns v1.1 and urbanterror v4.1 performance reduced ~15% on BDW.This
> problem exist both on gnome-session and Raw X,it doesn't exist on
> IVB/HSW/BYT-M.
> It's kernel (drm-intel-next-queued)regression,bisect result first bad commit
> is:
> commit d37cf5f7e1b315585940a735a8508d955ffc0f16
> Author:     Mika Kuoppala <mika.kuoppala@linux.intel.com>
> AuthorDate: Fri Sep 19 20:05:26 2014 +0300
> Commit:     Daniel Vetter <daniel.vetter@ffwll.ch>
> CommitDate: Wed Sep 24 10:38:41 2014 +0200
> 
>     drm/i915/bdw: Cleanup pre prod workarounds
> 

This patches resolves bug#83482. Do we see that improvement on this machine?
Comment 6 lili 2014-09-30 09:14:25 UTC
(In reply to comment #5)
> (In reply to comment #0)
> > Bug detailed description:
> > ---------------------------------------------
> > Smokin-Guns v1.1 and urbanterror v4.1 performance reduced ~15% on BDW.This
> > problem exist both on gnome-session and Raw X,it doesn't exist on
> > IVB/HSW/BYT-M.
> > It's kernel (drm-intel-next-queued)regression,bisect result first bad commit
> > is:
> > commit d37cf5f7e1b315585940a735a8508d955ffc0f16
> > Author:     Mika Kuoppala <mika.kuoppala@linux.intel.com>
> > AuthorDate: Fri Sep 19 20:05:26 2014 +0300
> > Commit:     Daniel Vetter <daniel.vetter@ffwll.ch>
> > CommitDate: Wed Sep 24 10:38:41 2014 +0200
> > 
> >     drm/i915/bdw: Cleanup pre prod workarounds
> > 
> 
> This patches resolves bug#83482. Do we see that improvement on this machine?

This patches did resolved bug#83482 issue, and it's pushed into commit d37cf5, this commit is culprit of bug#84444 by bisect. Otherwise bug#83482 && bug#84444 are not related. 

Developers show make sure whether their's patch cause bug#84444 exist.
Comment 7 Mika Kuoppala 2014-10-01 13:21:01 UTC
Created attachment 107188 [details] [review]
drm/i915/bdw: Enable single subspan dispatch
Comment 8 Mika Kuoppala 2014-10-01 13:23:08 UTC
Patch for testing. However we need more broad testing to be sure that this patch doesn't make things worse on other workloads.
Comment 9 lili 2014-10-08 08:02:50 UTC
(In reply to Mika Kuoppala from comment #8)
> Patch for testing. However we need more broad testing to be sure that this
> patch doesn't make things worse on other workloads.

This issue has been fixed with this patch,about if this patch will make things worse on other workloads,I will verify it and update the result later.
Comment 10 lili 2014-10-10 07:50:24 UTC
(In reply to lili from comment #9)
> (In reply to Mika Kuoppala from comment #8)
> Patch for testing. However we
> need more broad testing to be sure that this
> patch doesn't make things
> worse on other workloads.

This issue has been fixed with this patch,about
> if this patch will make things worse on other workloads,I will verify it and
> update the result later.

Installing the patch, the issue is fixed. I have run other cases and there is no big change compared to before test results.
Comment 11 wendy.wang 2014-10-15 05:13:25 UTC
pls help merge up the patch, thanks.
Comment 12 Eero Tamminen 2014-10-15 12:28:38 UTC
(In reply to lili from comment #10)
> This issue has been fixed with this patch

This was a test patch, not necessarily a fix.  Performance should be *in general* better with the multiple subspans (current state), but there can be corner-cases where single subspan (the attached patch) happens to provide a bit better performance.


>> about
>> if this patch will make things worse on other workloads,I will verify it and
>> update the result later.
> 
> Installing the patch, the issue is fixed. I have run other cases and there
> is no big change compared to before test results.

This is unexpected result.  Did you run all tests, or just a subset of them?
Comment 13 lili 2014-10-17 08:43:52 UTC
(In reply to Eero Tamminen from comment #12)
> (In reply to lili from comment #10)
> > This issue has been fixed with this patch
> 
> This was a test patch, not necessarily a fix.  Performance should be *in
> general* better with the multiple subspans (current state), but there can be
> corner-cases where single subspan (the attached patch) happens to provide a
> bit better performance.
> 
> 
> >> about
> >> if this patch will make things worse on other workloads,I will verify it and
> >> update the result later.
> > 
> > Installing the patch, the issue is fixed. I have run other cases and there
> > is no big change compared to before test results.
> 
> This is unexpected result.  Did you run all tests, or just a subset of them?


1.We tested some cases and found there is no big change before.

2.This time we installed the patch and tested all of the cases. The performance of the most cases are worse than that without patch,(for exzample:case unigine-sanctuary,reduced ~47%,Half_Life 2 reduced ~137%,....),and minor cases are better.
Comment 14 Eero Tamminen 2014-10-20 09:25:08 UTC
(In reply to lili from comment #13)
> 2.This time we installed the patch and tested all of the cases. The
> performance of the most cases are worse than that without patch,(for
> exzample:case unigine-sanctuary,reduced ~47%,Half_Life 2 reduced
> ~137%,....),and minor cases are better.

Sanctuary shaders won't get anymore compiled with Mesa, so any results with that can be ignored and how one can get >100% performance drop in a test?
Comment 15 wendy.wang 2014-10-22 08:54:32 UTC
(In reply to Eero Tamminen from comment #14)
> (In reply to lili from comment #13)
> > 2.This time we installed the patch and tested all of the cases. The
> > performance of the most cases are worse than that without patch,(for
> > exzample:case unigine-sanctuary,reduced ~47%,Half_Life 2 reduced
> > ~137%,....),and minor cases are better.
> 
> Sanctuary shaders won't get anymore compiled with Mesa, so any results with
> that can be ignored and how one can get >100% performance drop in a test?

Hi Eero, I sent you the full compare test result for the Mika's patch, and base on the each benchmark single cycle test result, we did see after applied Mika's patch, some case did show less than 1/2 FPS vs. without apply patch.
such as: 
Half_Life 2
GLBenchmark v2.7.0 FillTestC24Z16
GLBenchmark v2.7.0 FillTestC24Z16_Offscreen
SynMark2_v6_0_OglFillTexMulti
SynMark2_v6_0_OglPSPhong
Comment 16 Eero Tamminen 2014-10-23 15:09:42 UTC
According to your data, there were couple of cases where performance improved up to 10-30% (smoking guns, urban terror, TF2, zbuffer), with the attached test patch.

However, (as expected), it reduced performance in many more tests and the difference was *much* larger, up to >70% percent (in GLB fill test and sampler bound pixel shader test).

Based on your data, this bug should be closed as WONTFIX.

(ZBuffer being better with single span is a bit worrying, as its performance isn't currently quite what it should be.)
Comment 17 Mika Kuoppala 2014-10-24 11:35:32 UTC
On basis of current performance measurements, the net gain is not positive.
Marking as wontfix. Please reopen if you disagree/have contradicting evidence.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.