| Summary: | [BDW BYT SKL HSW IVB]Regression] [GPU Hang] with gem_reset_stats ban- and reset-stats- subtests | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | DRI | Reporter: | cprigent <christophe.prigent> | ||||||||||
| Component: | IGT | Assignee: | Default DRI bug account <dri-devel> | ||||||||||
| Status: | CLOSED FIXED | QA Contact: | |||||||||||
| Severity: | critical | ||||||||||||
| Priority: | high | CC: | intel-gfx-bugs | ||||||||||
| Version: | unspecified | ||||||||||||
| Hardware: | x86-64 (AMD64) | ||||||||||||
| OS: | Linux (All) | ||||||||||||
| Whiteboard: | |||||||||||||
| i915 platform: | BDW, BYT, HSW, IVB, SKL | i915 features: | GPU hang | ||||||||||
| Attachments: |
|
||||||||||||
|
Description
cprigent
2016-09-15 12:55:01 UTC
Created attachment 126549 [details]
BDW--gem_reset_stats--ban---kern.log
Tests were Pass with: Kernel: 4.8.0-rc4 9baa666 from http://cgit.freedesktop.org/drm-intel/ commit 9baa666b3e48f71b46c5f63541f57d2a95a1b1c0 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Sat Sep 3 13:12:38 2016 +0100 drm-intel-nightly: 2016y-09m-03d-12h-12m-15s UTC integration manifest libdrm-2.4.70-6 4462303 from git://anongit.freedesktop.org/mesa/drm mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa cairo 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo xorg-server-1.18.0-546 deae9c7 from git://git.freedesktop.org/git/xorg/xserver xf86-video-intel 2.99.917-701 205146b from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel libva-1.7.0-47 2ebf897 from git://git.freedesktop.org/git/vaapi/libva vaapi-intel-driver: 1.7.0-95 1817bee from git://git.freedesktop.org/git/vaapi/intel-driver Intel-Gpu-Tools 1.16 6bd4208 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git /sys/class/drm/card0/error is empty Created attachment 126550 [details]
BDW--gem_reset_stats-reset-stats----kern.log
Reproduced with reset-stats- subcases:
igt@gem_reset_stats@reset-stats-blt
igt@gem_reset_stats@reset-stats-bsd2
igt@gem_reset_stats@reset-stats-ctx-render
igt@gem_reset_stats@reset-stats-default
igt@gem_reset_stats@reset-stats-render
igt@gem_reset_stats@reset-stats-vebox
Created attachment 126551 [details]
BDW--gem_reset_stats-reset-stats----output
The tests require updating. Reproduced with fresh setup IGT 1.16 200237a and kernel 4.8-rc7 commit 0e34cb5b35f0f837219495c402073141481b1b90 Author: Jani Nikula <jani.nikula@intel.com> Date: Mon Sep 19 18:39:27 2016 +0300 drm-intel-nightly: 2016y-09m-19d-15h-38m-53s UTC integration manifest # ./gem_reset_stats --r reset-stats-blt IGT-Version: 1.16-g200237a (x86_64) (Linux: 4.8.0-rc7-nightly+ x86_64) 2:0 expected 'Pending' [2], found 'No error' [0] (gem_reset_stats:16915) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:242: (gem_reset_stats:16915) CRITICAL: Failed assertion: _assert_reset_status(i, fd[i], 0, (1 << 1)) == 0 Stack trace: #0 [__igt_fail_assert+0xf1] #1 [test_rs.constprop.9+0x158] #2 [__real_main781+0x593] #3 [main+0x29] #4 [__libc_start_main+0xf0] #5 [_start+0x29] #6 [<unknown>+0x29] Subtest reset-stats-blt failed. **** DEBUG **** (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) DEBUG: num fds=4, hang index=1 (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) ioctl-wrappers-DEBUG: Test requirement passed: gem_has_ring(fd, ring) (gem_reset_stats:16915) ioctl-wrappers-DEBUG: Test requirement passed: has_ban_period (gem_reset_stats:16915) igt-gt-DEBUG: Test requirement passed: has_gpu_reset(fd) (gem_reset_stats:16915) igt-gt-DEBUG: Test requirement passed: ctx == 0 || ring == I915_EXEC_RENDER (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) INFO: 2:0 expected 'Pending' [2], found 'No error' [0] (gem_reset_stats:16915) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:242: (gem_reset_stats:16915) CRITICAL: Failed assertion: _assert_reset_status(i, fd[i], 0, (1 << 1)) == 0 **** END **** Subtest reset-stats-blt: FAIL (9.416s) [ 277.418448] gem_reset_stats: executing [ 277.418613] [drm:i915_gem_open] [ 277.418748] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.418753] [drm:i915_pages_create_for_stolen] offset=0x17d1000, size=16384 [ 277.418923] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.418946] [drm:i915_pages_create_for_stolen] offset=0x17d5000, size=16384 [ 277.418976] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.418979] [drm:i915_pages_create_for_stolen] offset=0x17d9000, size=16384 [ 277.419003] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419006] [drm:i915_pages_create_for_stolen] offset=0x17dd000, size=16384 [ 277.419249] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419252] [drm:i915_pages_create_for_stolen] offset=0x17e1000, size=16384 [ 277.419478] [drm:i915_gem_open] [ 277.419614] gem_reset_stats: starting subtest reset-stats-blt [ 277.419638] [drm:i915_gem_open] [ 277.419714] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419720] [drm:i915_pages_create_for_stolen] offset=0x17e5000, size=16384 [ 277.419830] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419834] [drm:i915_pages_create_for_stolen] offset=0x17e9000, size=16384 [ 277.419869] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419872] [drm:i915_pages_create_for_stolen] offset=0x17ed000, size=16384 [ 277.419903] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419905] [drm:i915_pages_create_for_stolen] offset=0x17f1000, size=16384 [ 277.420423] [drm:i915_gem_open] [ 277.420451] [drm:i915_gem_open] [ 277.420479] [drm:i915_gem_open] [ 277.420503] [drm:i915_gem_open] [ 277.420529] [drm:i915_gem_open] [ 277.420582] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420586] [drm:i915_pages_create_for_stolen] offset=0x17f5000, size=16384 [ 277.420673] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420684] [drm:i915_pages_create_for_stolen] offset=0x17f9000, size=16384 [ 277.420716] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420719] [drm:i915_pages_create_for_stolen] offset=0x17fd000, size=16384 [ 277.420749] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420752] [drm:i915_pages_create_for_stolen] offset=0x1801000, size=16384 [ 277.421294] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421297] [drm:i915_pages_create_for_stolen] offset=0x17d1000, size=16384 [ 277.421457] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421460] [drm:i915_pages_create_for_stolen] offset=0x17d5000, size=16384 [ 277.421554] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421557] [drm:i915_pages_create_for_stolen] offset=0x17d9000, size=16384 [ 277.421606] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421609] [drm:i915_pages_create_for_stolen] offset=0x17dd000, size=16384 [ 277.421652] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421655] [drm:i915_pages_create_for_stolen] offset=0x17e1000, size=16384 [ 277.421682] [drm:i915_gem_open] [ 277.421738] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421741] [drm:i915_pages_create_for_stolen] offset=0x17e5000, size=16384 [ 277.421817] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421820] [drm:i915_pages_create_for_stolen] offset=0x17e9000, size=16384 [ 277.421903] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421913] [drm:i915_pages_create_for_stolen] offset=0x17ed000, size=16384 [ 277.421943] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421953] [drm:i915_pages_create_for_stolen] offset=0x17f1000, size=16384 [ 286.833807] [drm:i915_capture_error_state] GPU HANG: ecode 8:1:0xe77ffff2, in gem_reset_stats [16915], reason: Hang on blitter ring, action: reset [ 286.833842] [drm:i915_reset_and_wakeup] resetting chip [ 286.833855] drm/i915: Resetting chip after gpu hang [ 286.833868] [drm:i915_gem_reset_engine] resetting blitter ring to restart from tail of request 0x7e [ 286.833898] [drm:gen8_init_common_ring] Execlists enabled for render ring [ 286.833902] [drm:gen8_init_common_ring] Execlists enabled for blitter ring [ 286.833907] [drm:gen8_init_common_ring] Execlists enabled for bsd ring [ 286.833910] [drm:gen8_init_common_ring] Execlists enabled for bsd2 ring [ 286.833913] [drm:gen8_init_common_ring] Execlists enabled for video enhancement ring [ 286.833916] [drm:intel_guc_setup] GuC fw status: path (null), fetch NONE, load NONE [ 286.835374] gem_reset_stats: exiting, ret=99 [ 286.835438] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835441] [drm:i915_pages_create_for_stolen] offset=0x1805000, size=16384 [ 286.835513] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835516] [drm:i915_pages_create_for_stolen] offset=0x1809000, size=16384 [ 286.835538] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835543] [drm:i915_pages_create_for_stolen] offset=0x180d000, size=16384 [ 286.835575] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835578] [drm:i915_pages_create_for_stolen] offset=0x1811000, size=16384 [ 286.835926] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835928] [drm:i915_pages_create_for_stolen] offset=0x17f5000, size=16384 If this requires the change on test then should one change the component to IGT instead of DRM/Intel? (In reply to Jari Tahvanainen from comment #8) > If this requires the change on test then should one change the component to > IGT instead of DRM/Intel? Hi, Antonio modified the test. https://patchwork.freedesktop.org/patch/161335/ Can you try with this igt patch? The following tests FAIL on SKL
Tests List:
igt@gem_reset_stats@reset-stats-bsd
igt@gem_reset_stats@reset-stats-default
igt@gem_reset_stats@reset-stats-render
======================================
Output Sample
======================================
...
(gem_reset_stats:3067) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:3067) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_reset_stats:3067) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:229:
(gem_reset_stats:3067) CRITICAL: Failed assertion: noop(fd[i], 0, e) > 0
(gem_reset_stats:3067) igt-core-INFO: Stack trace:
(gem_reset_stats:3067) igt-core-INFO: #0 [__igt_fail_assert+0x101]
(gem_reset_stats:3067) igt-core-INFO: #1 [test_rs.constprop.8+0x206]
(gem_reset_stats:3067) igt-core-INFO: #2 [__real_main773+0x10b]
(gem_reset_stats:3067) igt-core-INFO: #3 [main+0x23]
(gem_reset_stats:3067) igt-core-INFO: #4 [__libc_start_main+0xf1]
(gem_reset_stats:3067) igt-core-INFO: #5 [_start+0x29]
(gem_reset_stats:3067) igt-core-INFO: #6 [<unknown>+0x29]
**** END ****
Stack trace:
#0 [__igt_fail_assert+0x101]
#1 [test_rs.constprop.8+0x206]
#2 [__real_main773+0x10b]
#3 [main+0x23]
#4 [__libc_start_main+0xf1]
#5 [_start+0x29]
#6 [<unknown>+0x29]
...
This is my configuration:
======================================
Graphic stack
======================================
Component: drm
tag: libdrm-2.4.81-53-gd55d080
commit: d55d0804f9e37637d7510f38f97e07a50c6b7baa
Component: cairo
tag: 1.15.6-30-gc29db4f
commit: c29db4f0de52727406a8b62f4da9a382084b5a79
Component: intel-gpu-tools
tag: intel-gpu-tools-1.19-244-g4a1c8da
commit: 4a1c8daff2005e2cbfe980d63bc0a0fb09cb017d
Component: piglit
tag: piglit-v1
commit: dab15137044c34023b1a843e72e781b03b17548b
======================================
Software
======================================
kernel version : 4.13.0-drm-tip-ww36-commit-7a827ab+
hostname : gfx-desktop
architecture : x86_64
os version : Ubuntu 16.10
os codename : yakkety
kernel driver : i915
bios revision : 5.6
bios release date : 09/29/2016
hardware acceleration : disabled
swap partition : enabled on (/dev/sda3)
======================================
Graphic drivers
======================================
libdrm : 2.4.83
cairo : 1.15.9
intel-gpu-tools (tag) : intel-gpu-tools-1.19-234-g490b386
intel-gpu-tools (commit) : 490b386
======================================
Hardware
======================================
platform : Skylake Canyon
motherboard id : NUC6i7KYB
form factor : Desktop
cpu family : Core i7
cpu family id : 6
cpu information : Intel(R) Core(TM) i7-6770HQ CPU @ 2.60GHz
gpu card : Intel Corporation Iris Pro Graphics 580 (rev 09) (prog-if 00 [VGA controller])
memory ram : 31.31 GB
max memory ram : 32 GB
cpu thread : 8
cpu core : 4
cpu model : 94
cpu stepping : 3
socket : Other
signature : Type 0, Family 6, Model 94, Stepping 3
hard drive : 223GiB (240GB)
current cd clock frequency : 337500 kHz
maximum cd clock frequency : 675000 kHz
displays connected : DP-1 DP-3
======================================
Firmware
======================================
dmc fw loaded : yes
dmc version : 1.26
guc fw loaded : SUCCESS
guc version wanted : 6.1
guc version found : 6.1
huc fw loaded : yes
======================================
kernel parameters
======================================
quiet splash drm.debug=0x1e i915.enable_guc_submission=2 i915.enable_guc_loading=2 i915.alpha_support=1
After February ww6 these tests aren't failing anymore and no hang have been reported in a while with these tests, though issue on bug 97129 still exist for SNB, IVB and BYT. Closing this bug, thank you. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.