Bug 86452 - [BDW Bisected] Suspend entry and suspend resume responsive time increased by 78% and 58%
Summary: [BDW Bisected] Suspend entry and suspend resume responsive time increased by ...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: Other All
: high major
Assignee: Mika Kuoppala
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-11-19 02:02 UTC by wendy.wang
Modified: 2017-10-06 14:33 UTC (History)
3 users (show)

See Also:
i915 platform:
i915 features:


Attachments

Description wendy.wang 2014-11-19 02:02:37 UTC
System Environment
BDW RVP FAB4 board
CPU: F stepping i5-5300U 2.3Ghz, 2Cores/4ThreadS, QH3F
BIOS: V93

Whole gfx sw stack:
=======
Libdrm: (master)libdrm-2.4.58-4-g00847fa48b83a85b0cb882594a12ed1511f780db
Mesa: (master)2b6e703863f7e5469ba8d82003282cc42f277d5e
Xserver: (master)xorg-server-1.16.99.901-3-g63bb5c5ef16edf652179770294dcca4fc07dc992
Xf86_video_intel: (master)2.99.916-144-gf9f85b88032b2458d4d24ceca450fdaa65e6b94d
Cairo: (master)121f384c0e231c9c5d9c937b216d342bfc7810a6
Libva: (master)ccd93de5a707e92a629cccd595757c8d436fa3cc
Libva_intel_driver: (master)4bdbc1ea4a875a21caa628ce27345528a2cb0774
Kernel: (drm-intel-nightly)22ba72f728eaf30e63ded0c0bb0484b4bc6f92d4


Regression?
=======
YES

Bug detailed description
=======
Execute responsiveness testing, suspend entry time increased by 78%, suspend resume time increased by 58%

From bisect ‘ drm-intel-next-queued ’ branch result, it's kernel regression, the first bad commit is:

commit 8dcfd18343a56fd2277968f6b98f24b1669d2238
Author: Mika Kuoppala <mika.kuoppala@linux.intel.com>
Date: Mon Nov 10 04:52:50 2014 -0800
drm/i915: Wait thread status on gen8+ fw sequence
As per latest pm guide, we need to do this also on
past hsw.
Cc: Ville Syrjälä <ville.syrjala@linux.intel.com>
Cc: Chris Wilson <chris@chris-wilson.co.uk>
Cc: Damien Lespiau <damien.lespiau@intel.com>
Signed-off-by: Mika Kuoppala <mika.kuoppala@intel.com>
Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
Reviewed-by: Ville Syrjälä <ville.syrjala@linux.intel.com>
Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>


Reproduce Steps
==============
1. tsocks git clone mmm.sh.intel.com:/work/responsiveness or visit the ip address: 10.239.36.58:/work/responsiveness
2. execute resp.sh script
Comment 1 Daniel Vetter 2014-11-19 09:23:36 UTC
Oh dang, sounds like we need to resurrect forcewake hysteresis.

Chris, do you still have the patches handy for the delayed put from mmio writes? Mika, could you perhaps refresh those?
Comment 2 Daniel Vetter 2014-11-19 09:24:15 UTC
Note: Since the is a w/a we can't just revert the commit for obvious reasons ...
Comment 4 wendy.wang 2014-11-21 08:52:10 UTC
Hello Chris,
Would you pls gudie me which branch kernel commit should I based on to apply you pasted three patches.

I tried to apply your patches based on drm-intel-next-queued branch(this bug's first bad commit), or drm-intel-next-queued branch latest commit 
(commit 89a35ecdc6aa5a88165313ca5cfd52b8e8e7fbbd
Author: Jesse Barnes <jbarnes@virtuousgeek.org>
Date:   Thu Nov 20 13:24:13 2014 -0800

    drm/i915/g4x: fix g4x infoframe readout

    Need to check the port too.

    Reported-by: Daniel Vetter <daniel@ffwll.ch>
    Signed-off-by: Jesse Barnes <jbarnes@virtuousgeek.org>
    Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
) 

or drm-intel-nighly branch latest commit(commit 18748be7c96accc27327423c384f86a8fae99c35
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Thu Nov 20 22:59:05 2014 +0100

    drm-intel-nightly: 2014y-11m-20d-21h-58m-44s UTC integration manifest

)

apply patches all fail because of conflict, especially for 2nd and 3rd patches.

Thanks.
Comment 5 Mika Kuoppala 2014-11-27 18:11:38 UTC
Wendy,

http://cgit.freedesktop.org/~miku/drm-intel/log/?h=uncore_cleanup

Work in progress but in case you want to try out.
Comment 6 wendy.wang 2014-11-28 02:23:55 UTC
(In reply to Mika Kuoppala from comment #5)
> Wendy,
> 
> http://cgit.freedesktop.org/~miku/drm-intel/log/?h=uncore_cleanup
> 
> Work in progress but in case you want to try out.

Hello Mika, 
Shall we checkout "bdw_forcewake" branch?
Comment 7 Mika Kuoppala 2014-11-28 13:52:12 UTC
(In reply to wendy.wang from comment #6)
> (In reply to Mika Kuoppala from comment #5)
> > Wendy,
> > 
> > http://cgit.freedesktop.org/~miku/drm-intel/log/?h=uncore_cleanup
> > 
> > Work in progress but in case you want to try out.
> 
> Hello Mika, 
> Shall we checkout "bdw_forcewake" branch?

uncore_cleanup
Comment 8 wendy.wang 2014-12-04 01:35:53 UTC
(In reply to Mika Kuoppala from comment #7)
> (In reply to wendy.wang from comment #6)
> > (In reply to Mika Kuoppala from comment #5)
> > > Wendy,
> > > 
> > > http://cgit.freedesktop.org/~miku/drm-intel/log/?h=uncore_cleanup
> > > 
> > > Work in progress but in case you want to try out.
> > 
> > Hello Mika, 
> > Shall we checkout "bdw_forcewake" branch?
> 
> uncore_cleanup

Hello Mika, your uncore_cleanup branch commit(59d3f39e57b3e40f7fd249598589910c39270d94)  can fix this regression bug: both suspend and suspend resume responsiveness can back to the values before regression.
Comment 9 zhipeng.Zheng 2014-12-05 05:04:10 UTC
verfied on drm-intel-nightly
	
commit 63df8858af984ee11e32579a5d3cf2db17baf942
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Tue Dec 2 16:43:51 2014 +0100

    drm-intel-nightly: 2014y-12m-02d-15h-43m-26s UTC integration manifest


bug has fixed ,close it
Comment 10 Mika Kuoppala 2014-12-08 18:38:55 UTC
No patches from the uncore_cleanup branch was posted on the mailing list when this got fixed.

So we have absolutely no idea what was the commit that fixed this. Would be very important to figure out why the symptoms disappeared.

So could we get a reverse bisect pointing to a commit that fixed the symptomps. Thanks.
Comment 11 wendy.wang 2014-12-09 06:14:26 UTC
(In reply to Mika Kuoppala from comment #10)
> No patches from the uncore_cleanup branch was posted on the mailing list
> when this got fixed.
> 
> So we have absolutely no idea what was the commit that fixed this. Would be
> very important to figure out why the symptoms disappeared.
> 
> So could we get a reverse bisect pointing to a commit that fixed the
> symptomps. Thanks.

Hello Mika,

We've try to bisect which commit fixed this bug on 2014-12-05, but when we try to restore 2014-12-05 test environment onto the same machine, suspend entry and suspend resume responsive time regression bug still exist, cannot find the good commit any more, though our Tinderbox responsiveness history trend data did show that day has normal values, but we cannot reproduce today after checked all the testing conditions, so re-open this bug.
Comment 12 wendy.wang 2014-12-26 01:40:33 UTC
Verified BDW with latest kernel on 2014-12-25 drm-intel-nightly branch,
this responsiveness bug has been fixed, all the data recovered to the values before regression.

2014-12-25 GFX SW stack
 Libdrm:		(master)libdrm-2.4.58-19-gf99522e678dbbaffeca9462a8edcbe900574dc12
 Mesa:		(master)f332cf92b69e52de3cb7c3088ad1efd2e291bb88
 Xserver:		(master)xorg-server-1.16.99.901-102-g826e7c2b36f192fbbe7ddff37eb559f4d6301146
 Xf86_video_intel:		(master)2.99.917
 Cairo:		(master)8e11a42e3e9b679dce97ac45cd8b47322536a253
 Libva:		(master)8986ec692b19d8dd6bd2aa118b5dffbd05a8f909
 Libva_intel_driver:		(master)b5d6d9d425a6d539b27d22992bda05f79d1a0622
 Kernel:   (drm-intel-nightly)4fa23142a15526f4a4b5df61f26eacdd558a849a
Comment 13 Elizabeth 2017-10-06 14:33:39 UTC
Closing old verified.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.