Bug 81374 - [BSW Regression]igt/drv_hangman/error-state-capture-blt causes "[drm:i915_context_is_banned] *ERROR* gpu hanging too fast, banning!" sporadically
Summary: [BSW Regression]igt/drv_hangman/error-state-capture-blt causes "[drm:i915_con...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: Other All
: highest normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-07-15 07:51 UTC by Guo Jinxian
Modified: 2017-02-10 08:49 UTC (History)
2 users (show)

See Also:
i915 platform:
i915 features:


Attachments
dmesg (6.67 KB, text/plain)
2014-07-15 07:51 UTC, Guo Jinxian
no flags Details
dmesg on fixes branch (126.00 KB, text/plain)
2014-07-15 07:52 UTC, Guo Jinxian
no flags Details

Description Guo Jinxian 2014-07-15 07:51:16 UTC
Created attachment 102832 [details]
dmesg

==System Environment==
--------------------------
Regression: Yes
On commit of -next-queued(91565c85b66db820f01894a971d39aaef60c4325)
The fail rate about 1 out of 2

Non-working platforms: BSW

==kernel==
--------------------------
origin/drm-intel-nightly: 2a38e1bcd4dc9523cd723291340226d139bece1b(fails)
    drm-intel-nightly: 2014y-07m-14d-23h-18m-30s integration manifest
origin/drm-intel-next-queued: ebcdf5276e6e2ddef6a4fd7716684030301a6fed(fails)
    drm/i915: Handle failure to kick out a conflicting fb driver
origin/drm-intel-fixes: c6930992948adf0f8fc1f6ff1da51c5002a2cf95(fails)
    Revert "drm/i915: reverse dp link param selection, prefer fast over wide again"

==Bug detailed description==
igt/drv_hangman/error-state-capture-blt causes "[drm:i915_context_is_banned] *ERROR* gpu hanging too fast, banning!" sporadically

Output:
[root@x-bsw01 tests]# ./drv_hangman --run-subtest error-state-capture-blt
IGT-Version: 1.7-ge74dd22 (x86_64) (Linux: 3.16.0-rc5_drm-intel-nightly_2a38e1_20140715+ x86_64)
Test assertion failure function gem_execbuf, file ioctl_wrappers.c:428:
Failed assertion: ret == 0
Last errno: 5, Input/output error
Subtest error-state-capture-blt: FAIL
[root@x-bsw01 tests]# dmesg -r|egrep "<[1-3>]"|grep drm
<3>[  180.759598] [drm:i915_context_is_banned] *ERROR* gpu hanging too fast, banning!

On fixes branch, the failure and error message is difference

Output:
[root@x-bsw01 tests]# ./drv_hangman --run-subtest error-state-capture-blt
IGT-Version: 1.7-ge74dd22 (x86_64) (Linux: 3.16.0-rc5_drm-intel-fixes_c69309_20140715+ x86_64)
Test assertion failure function gem_quiescent_gpu, file drmtest.c:153:
Failed assertion: drmIoctl((fd), ((((1U) << (((0+8)+8)+14)) | ((('d')) << (0+8)) | (((0x40 + 0x29)) << 0) | ((((sizeof(struct drm_i915_gem_execbuffer2)))) << ((0+8)+8)))), (&execbuf)) == 0
Last errno: 5, Input/output error
Subtest error-state-capture-blt: FAIL
[root@x-bsw01 tests]# dmesg -r|egrep ""<[1-3>]""|grep drm
<3>[  134.207607] [drm:__vlv_force_wake_get] *ERROR* Timed out: Render forcewake old ack to clear.
<3>[  134.217607] [drm:__vlv_force_wake_get] *ERROR* Timed out: Media forcewake old ack to clear.


==Reproduce steps==
---------------------------- 
1. ./drv_hangman --run-subtest error-state-capture-blt
Comment 1 Guo Jinxian 2014-07-15 07:52:03 UTC
Created attachment 102833 [details]
dmesg on fixes branch
Comment 2 Guo Jinxian 2014-07-15 07:57:56 UTC
On commit of -next-queued(91565c85b66db820f01894a971d39aaef60c4325), the test still failed, but the dmesg error unable to reproduce.

Output:
[root@x-bsw01 tests]# ./drv_hangman --run-subtest error-state-capture-blt
IGT-Version: 1.7-ge74dd22 (x86_64) (Linux: 3.15.0-rc8_drm-intel-next-queued_91565c_20140701+ x86_64)
Test assertion failure function check_error_state, file drv_hangman.c:302:
Failed assertion: strstr(ring_name, expected_ring_name)
Subtest error-state-capture-blt: FAIL
[root@x-bsw01 tests]# dmesg -r|egrep "<[1-3>]"|grep drm
Comment 3 Guo Jinxian 2014-07-17 07:26:40 UTC
This bug still able to reproduce on latest -nightly(f326cd0fae7e74011d3c0a8bd2be2cd2e2a45407) on BDW while running test igt/kms_pipe_crc_basic/hang-read-crc-pipe-C.

[root@x-bdw02 tests]# ./kms_pipe_crc_basic --run-subtest hang-read-crc-pipe-C
IGT-Version: 1.7-gb6ed9fb (x86_64) (Linux: 3.16.0-rc5_drm-intel-nightly_f326cd_20140717_debug+ x86_64)
Test assertion failure function gem_execbuf, file ioctl_wrappers.c:403:
Failed assertion: ret == 0
Last errno: 5, Input/output error
Subtest hang-read-crc-pipe-C: FAIL
[root@x-bdw02 tests]# dmesg -r | egrep "<[1-4]>" |grep drm
<3>[  264.308329] [drm:i915_context_is_banned] *ERROR* gpu hanging too fast, banning!
<3>[  270.319426] [drm:i915_context_is_banned] *ERROR* gpu hanging too fast, banning!
Comment 4 Guo Jinxian 2014-08-22 05:45:55 UTC
Test still failed on latest -nightly(257d90d13794c2eb545ab0d6c708f21e2a0378b6) on BSW

[root@x-bsw01 tests]# ./gem_reloc_vs_gpu --run-subtest forked-interruptible-faulting-reloc
IGT-Version: 1.7-g4d2f511 (x86_64) (Linux: 3.17.0-rc1_drm-intel-nightly_257d90_20140822+ x86_64)
Test assertion failure function gem_execbuf, file ioctl_wrappers.c:437:
Failed assertion: ret == 0
Last errno: 2, No such file or directory
child 6 failed with exit status 99
Subtest forked-interruptible-faulting-reloc: FAIL
Comment 5 Jani Nikula 2014-09-08 15:52:02 UTC
Please bisect.
Comment 6 Guo Jinxian 2014-09-22 08:38:43 UTC
Test passes on latest nightly(c5660b4ad395f1e34eacc22cf81c687edfc9c83c)


[root@x-bsw01 tests]# ./drv_hangman --run-subtest error-state-capture-blt
IGT-Version: 1.8-g25c55d3 (x86_64) (Linux: 3.17.0-rc5_drm-intel-nightly_c5660b_20140922+ x86_64)
Subtest error-state-capture-blt: SUCCESS (30.008s)
[root@x-bsw01 tests]# dmesg -r|egrep "<[1-4]>"|grep drm
[root@x-bsw01 tests]#
Comment 7 Jari Tahvanainen 2017-02-10 08:49:40 UTC
Closing (>2 years) old Verified+Fixed.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.