Bug 77302 - [ILK SNB IVB HSW BYT BDW]System random hung while run some subcases of gem_reset_stats
Summary: [ILK SNB IVB HSW BYT BDW]System random hung while run some subcases of gem_re...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: Mika Kuoppala
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-04-11 06:17 UTC by Guo Jinxian
Modified: 2017-10-06 14:38 UTC (History)
2 users (show)

See Also:
i915 platform:
i915 features:


Attachments
System random hung (60.85 KB, text/plain)
2014-04-11 06:17 UTC, Guo Jinxian
no flags Details
dmesg on latest -fixes (6.09 KB, text/plain)
2014-04-14 05:58 UTC, Guo Jinxian
no flags Details

Description Guo Jinxian 2014-04-11 06:17:51 UTC
Created attachment 97210 [details]
System random hung

System Environment:
--------------------------
Platform: ILK SNB IVB HSW BYT BDW
kernel:   (drm-intel-nightly)35dc7c8af75a3e5ae52a5469786602ce083a6368

Bug detailed description:
----------------------------
System random hung while run some subcases of gem_reset_stats, like close-pending-fork-render, ban-vebox etc.

The hung rate about 1 out of 6.

Reproduce steps:
---------------------------- 
1.  ./kms_flip --run-subtest close-pending-fork-render
Comment 1 Daniel Vetter 2014-04-11 12:48:34 UTC
Please retest with latest drm-intel-fixes:

commit 691e6415c891b8b2b082a120b896b443531c4d45
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Apr 9 09:07:36 2014 +0100

    drm/i915: Always use kref tracking for all contexts.

Might be related to bug #74100 but unlikely.
Comment 2 Daniel Vetter 2014-04-11 12:49:41 UTC
Adding Mika for more clue.
Comment 3 Guo Jinxian 2014-04-14 05:57:46 UTC
Checked on latest -fixes(b6842feb63a23a6a988f4e1ffb93408d8ff6931e), this bug still able to reproduce.

Make correction

Reproduce steps:
---------------------------- 
./gem_reset_stats --run-subtest close-pending-fork-render
Comment 4 Guo Jinxian 2014-04-14 05:58:35 UTC
Created attachment 97327 [details]
dmesg on latest -fixes
Comment 5 Mika Kuoppala 2014-04-17 10:48:56 UTC
(In reply to comment #4)
> Created attachment 97327 [details]
> dmesg on latest -fixes

What gen was used to get this?

Can you repro this on all ILK SNB IVB HSW BYT BDW?

-Mika
Comment 6 Guo Jinxian 2014-04-18 02:29:13 UTC
(In reply to comment #5)
> (In reply to comment #4)
> > Created attachment 97327 [details]
> > dmesg on latest -fixes
> 
> What gen was used to get this?
> 
> Can you repro this on all ILK SNB IVB HSW BYT BDW?
> 
> -Mika

I tried the test on each platform 10 times on latest -nightly(1e771b84e47085ef9b6efea1321e7cb5a8b2c065), didn't reproduce this issue. But I found a error on ILK platform. the error information shows blow. Thanks.

IGT-Version: 1.6-g78e4c2b (x86_64) (Linux: 3.14.0_drm-intel-nightly_1e771b_20140417+ x86_64)
Subtest close-pending-fork-render: SUCCESS
Test requirement not met in function gem_require_ring, file ioctl_wrappers.c:802:
Last errno: 11, Resource temporarily unavailable
Test requirement: (!((((intel_get_drm_devid(fd)) == 0x0102 || (intel_get_drm_devid(fd)) == 0x0112 || (intel_get_drm_devid(fd)) == 0x0122 || (intel_get_drm_devid(fd)) == 0x0106 || (intel_get_drm_devid(fd)) == 0x0116 || (intel_get_drm_devid(fd)) == 0x0126 || (intel_get_drm_devid(fd)) == 0x010A) || (((intel_get_drm_devid(fd)) == 0x0152 || (intel_get_drm_devid(fd)) == 0x0162 || (intel_get_drm_devid(fd)) == 0x0156 || (intel_get_drm_devid(fd)) == 0x0166 || (intel_get_drm_devid(fd)) == 0x015a || (intel_get_drm_devid(fd)) == 0x016a) || (((intel_get_drm_devid(fd)) == 0x0402 || (intel_get_drm_devid(fd)) == 0x0406 || (intel_get_drm_devid(fd)) == 0x040A || (intel_get_drm_devid(fd)) == 0x040B || (intel_get_drm_devid(fd)) == 0x040E || (intel_get_drm_devid(fd)) == 0x0C02 || (intel_get_drm_devid(fd)) == 0x0C06 || (intel_get_drm_devid(fd)) == 0x0C0A || (intel_get_drm_devid(fd)) == 0x0C0B || (intel_get_drm_devid(fd)) == 0x0C0E || (intel_get_drm_devid(fd)) == 0x0A02 || (intel_get_drm_devid(fd)) == 0x0A06 || (intel_get_drm_devid(fd)) == 0x0A0A || (intel_get_drm_devid(fd)) == 0x0A0B || (intel_get_drm_devid(fd)) == 0x0A0E || (intel_get_drm_devid(fd)) == 0x0D02 || (intel_get_drm_devid(fd)) == 0x0D06 || (intel_get_drm_devid(fd)) == 0x0D0A || (intel_get_drm_devid(fd)) == 0x0D0B || (intel_get_drm_devid(fd)) == 0x0D0E) || ((intel_get_drm_devid(fd)) == 0x0412 || (intel_get_drm_devid(fd)) == 0x0416 || (intel_get_drm_devid(fd)) == 0x041A || (intel_get_drm_devid(fd)) == 0x041B || (intel_get_drm_devid(fd)) == 0x041E || (intel_get_drm_devid(fd)) == 0x0C12 || (intel_get_drm_devid(fd)) == 0x0C16 || (intel_get_drm_devid(fd)) == 0x0C1A || (intel_get_drm_devid(fd)) == 0x0C1B || (intel_get_drm_devid(fd)) == 0x0C1E || (intel_get_drm_devid(fd)) == 0x0A12 || (intel_get_drm_devid(fd)) == 0x0A16 || (intel_get_drm_devid(fd)) == 0x0A1A || (intel_get_drm_devid(fd)) == 0x0A1B || (intel_get_drm_devid(fd)) == 0x0A1E || (intel_get_drm_devid(fd)) == 0x0D12 || (intel_get_drm_devid(fd)) == 0x0D16 || (intel_get_drm_devid(fd)) == 0x0D1A || (intel_get_drm_devid(fd)) == 0x0D1B || (intel_get_drm_devid(fd)) == 0x0D1E) || ((intel_get_drm_devid(fd)) == 0x0422 || (intel_get_drm_devid(fd)) == 0x0426 || (intel_get_drm_devid(fd)) == 0x042A || (intel_get_drm_devid(fd)) == 0x042B || (intel_get_drm_devid(fd)) == 0x042E || (intel_get_drm_devid(fd)) == 0x0C22 || (intel_get_drm_devid(fd)) == 0x0C26 || (intel_get_drm_devid(fd)) == 0x0C2A || (intel_get_drm_devid(fd)) == 0x0C2B || (intel_get_drm_devid(fd)) == 0x0C2E || (intel_get_drm_devid(fd)) == 0x0A22 || (intel_get_drm_devid(fd)) == 0x0A26 || (intel_get_drm_devid(fd)) == 0x0A2A || (intel_get_drm_devid(fd)) == 0x0A2B || (intel_get_drm_devid(fd)) == 0x0A2E || (intel_get_drm_devid(fd)) == 0x0D22 || (intel_get_drm_devid(fd)) == 0x0D26 || (intel_get_drm_devid(fd)) == 0x0D2A || (intel_get_drm_devid(fd)) == 0x0D2B || (intel_get_drm_devid(fd)) == 0x0D2E)) || ((intel_get_drm_devid(fd)) == 0x0f30 || (intel_get_drm_devid(fd)) == 0x0f31 || (intel_get_drm_devid(fd)) == 0x0f32 || (intel_get_drm_devid(fd)) == 0x0f33)) || ((((intel_get_drm_devid(fd)) & 0xff00) != 0x1600) ? 0 : ((((intel_get_drm_devid(fd)) & 0x00f0) >> 4) > 3) ? 0 : (((intel_get_drm_devid(fd)) & 0x000f) == 0x2) ? 1 : (((intel_get_drm_devid(fd)) & 0x000f) == 0x6) ? 1 : (((intel_get_drm_devid(fd)) & 0x000f) == 0xb) ? 1 : (((intel_get_drm_devid(fd)) & 0x000f) == 0xa) ? 1 : (((intel_get_drm_devid(fd)) & 0x000f) == 0xd) ? 1 : (((intel_get_drm_devid(fd)) & 0x000f) == 0xe) ? 1 : 0))))
Comment 7 Guo Jinxian 2014-04-25 03:19:37 UTC
./gem_reset_stats --run-subtest ban-vebox cause HSW hang on latest -nightly(6c398a53417fd96d9e58bdb618e395163c81e1c4).
Comment 8 Mika Kuoppala 2014-05-08 07:18:06 UTC
(In reply to comment #7)
> ./gem_reset_stats --run-subtest ban-vebox cause HSW hang on latest
> -nightly(6c398a53417fd96d9e58bdb618e395163c81e1c4).

System hang or gpu hang?

Please retest with latest nightly.
Comment 9 Guo Jinxian 2014-05-09 08:55:03 UTC
(In reply to comment #8)
> (In reply to comment #7)
> > ./gem_reset_stats --run-subtest ban-vebox cause HSW hang on latest
> > -nightly(6c398a53417fd96d9e58bdb618e395163c81e1c4).
> 
> System hang or gpu hang?
> 
> Please retest with latest nightly.

It's system hung, only found on HSW ULT.

./gem_reset_stats --run-subtest close-pending-fork-render failure didn't reproduce on latest -nightly(dd28119c31cf06fc4c3bb548699018a91e45a676).
Comment 10 Daniel Vetter 2014-05-15 16:11:43 UTC
(In reply to comment #9)
> (In reply to comment #8)
> > (In reply to comment #7)
> > > ./gem_reset_stats --run-subtest ban-vebox cause HSW hang on latest
> > > -nightly(6c398a53417fd96d9e58bdb618e395163c81e1c4).
> > 
> > System hang or gpu hang?
> > 
> > Please retest with latest nightly.
> 
> It's system hung, only found on HSW ULT.

Only on HSW ULT? Are all the other platforms mentioned in the bug headline now solid?
Comment 11 Mika Kuoppala 2014-05-20 15:53:52 UTC
Based on https://bugs.freedesktop.org/show_bug.cgi?id=77302#c9

Please reopen if still an issue.
Comment 12 Guo Jinxian 2014-06-18 06:11:26 UTC
Verified on latest -next-queued
Output:
root@x-byt06:/GFX/Test/Intel_gpu_tools/intel-gpu-tools/tests# ./gem_reset_stats --run-subtest close-pending-fork-render
IGT-Version: 1.7-g1b1f4b1 (x86_64) (Linux: 3.15.0-rc8_drm-intel-next-queued_27b6c1_20140618+ x86_64)
Subtest close-pending-fork-render: SUCCESS
Test requirement not met in function gem_require_ring, file ioctl_wrappers.c:820:
Last errno: 0, Success
Test requirement: (!(gem_has_vebox(fd)))
root@x-byt06:/GFX/Test/Intel_gpu_tools/intel-gpu-tools/tests# echo $?
0
Comment 13 Elizabeth 2017-10-06 14:38:43 UTC
Closing old verified.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.