Summary: | [BDW BYT SKL HSW IVB]Regression] [GPU Hang] with gem_reset_stats ban- and reset-stats- subtests | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | cprigent <christophe.prigent> | ||||||||||
Component: | IGT | Assignee: | Default DRI bug account <dri-devel> | ||||||||||
Status: | CLOSED FIXED | QA Contact: | |||||||||||
Severity: | critical | ||||||||||||
Priority: | high | CC: | intel-gfx-bugs | ||||||||||
Version: | unspecified | ||||||||||||
Hardware: | x86-64 (AMD64) | ||||||||||||
OS: | Linux (All) | ||||||||||||
Whiteboard: | |||||||||||||
i915 platform: | BDW, BYT, HSW, IVB, SKL | i915 features: | GPU hang | ||||||||||
Attachments: |
|
Description
cprigent
2016-09-15 12:55:01 UTC
Created attachment 126549 [details]
BDW--gem_reset_stats--ban---kern.log
Tests were Pass with: Kernel: 4.8.0-rc4 9baa666 from http://cgit.freedesktop.org/drm-intel/ commit 9baa666b3e48f71b46c5f63541f57d2a95a1b1c0 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Sat Sep 3 13:12:38 2016 +0100 drm-intel-nightly: 2016y-09m-03d-12h-12m-15s UTC integration manifest libdrm-2.4.70-6 4462303 from git://anongit.freedesktop.org/mesa/drm mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa cairo 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo xorg-server-1.18.0-546 deae9c7 from git://git.freedesktop.org/git/xorg/xserver xf86-video-intel 2.99.917-701 205146b from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel libva-1.7.0-47 2ebf897 from git://git.freedesktop.org/git/vaapi/libva vaapi-intel-driver: 1.7.0-95 1817bee from git://git.freedesktop.org/git/vaapi/intel-driver Intel-Gpu-Tools 1.16 6bd4208 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git /sys/class/drm/card0/error is empty Created attachment 126550 [details]
BDW--gem_reset_stats-reset-stats----kern.log
Reproduced with reset-stats- subcases:
igt@gem_reset_stats@reset-stats-blt
igt@gem_reset_stats@reset-stats-bsd2
igt@gem_reset_stats@reset-stats-ctx-render
igt@gem_reset_stats@reset-stats-default
igt@gem_reset_stats@reset-stats-render
igt@gem_reset_stats@reset-stats-vebox
Created attachment 126551 [details]
BDW--gem_reset_stats-reset-stats----output
The tests require updating. Reproduced with fresh setup IGT 1.16 200237a and kernel 4.8-rc7 commit 0e34cb5b35f0f837219495c402073141481b1b90 Author: Jani Nikula <jani.nikula@intel.com> Date: Mon Sep 19 18:39:27 2016 +0300 drm-intel-nightly: 2016y-09m-19d-15h-38m-53s UTC integration manifest # ./gem_reset_stats --r reset-stats-blt IGT-Version: 1.16-g200237a (x86_64) (Linux: 4.8.0-rc7-nightly+ x86_64) 2:0 expected 'Pending' [2], found 'No error' [0] (gem_reset_stats:16915) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:242: (gem_reset_stats:16915) CRITICAL: Failed assertion: _assert_reset_status(i, fd[i], 0, (1 << 1)) == 0 Stack trace: #0 [__igt_fail_assert+0xf1] #1 [test_rs.constprop.9+0x158] #2 [__real_main781+0x593] #3 [main+0x29] #4 [__libc_start_main+0xf0] #5 [_start+0x29] #6 [<unknown>+0x29] Subtest reset-stats-blt failed. **** DEBUG **** (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) DEBUG: num fds=4, hang index=1 (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) ioctl-wrappers-DEBUG: Test requirement passed: gem_has_ring(fd, ring) (gem_reset_stats:16915) ioctl-wrappers-DEBUG: Test requirement passed: has_ban_period (gem_reset_stats:16915) igt-gt-DEBUG: Test requirement passed: has_gpu_reset(fd) (gem_reset_stats:16915) igt-gt-DEBUG: Test requirement passed: ctx == 0 || ring == I915_EXEC_RENDER (gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:16915) INFO: 2:0 expected 'Pending' [2], found 'No error' [0] (gem_reset_stats:16915) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:242: (gem_reset_stats:16915) CRITICAL: Failed assertion: _assert_reset_status(i, fd[i], 0, (1 << 1)) == 0 **** END **** Subtest reset-stats-blt: FAIL (9.416s) [ 277.418448] gem_reset_stats: executing [ 277.418613] [drm:i915_gem_open] [ 277.418748] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.418753] [drm:i915_pages_create_for_stolen] offset=0x17d1000, size=16384 [ 277.418923] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.418946] [drm:i915_pages_create_for_stolen] offset=0x17d5000, size=16384 [ 277.418976] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.418979] [drm:i915_pages_create_for_stolen] offset=0x17d9000, size=16384 [ 277.419003] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419006] [drm:i915_pages_create_for_stolen] offset=0x17dd000, size=16384 [ 277.419249] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419252] [drm:i915_pages_create_for_stolen] offset=0x17e1000, size=16384 [ 277.419478] [drm:i915_gem_open] [ 277.419614] gem_reset_stats: starting subtest reset-stats-blt [ 277.419638] [drm:i915_gem_open] [ 277.419714] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419720] [drm:i915_pages_create_for_stolen] offset=0x17e5000, size=16384 [ 277.419830] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419834] [drm:i915_pages_create_for_stolen] offset=0x17e9000, size=16384 [ 277.419869] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419872] [drm:i915_pages_create_for_stolen] offset=0x17ed000, size=16384 [ 277.419903] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.419905] [drm:i915_pages_create_for_stolen] offset=0x17f1000, size=16384 [ 277.420423] [drm:i915_gem_open] [ 277.420451] [drm:i915_gem_open] [ 277.420479] [drm:i915_gem_open] [ 277.420503] [drm:i915_gem_open] [ 277.420529] [drm:i915_gem_open] [ 277.420582] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420586] [drm:i915_pages_create_for_stolen] offset=0x17f5000, size=16384 [ 277.420673] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420684] [drm:i915_pages_create_for_stolen] offset=0x17f9000, size=16384 [ 277.420716] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420719] [drm:i915_pages_create_for_stolen] offset=0x17fd000, size=16384 [ 277.420749] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.420752] [drm:i915_pages_create_for_stolen] offset=0x1801000, size=16384 [ 277.421294] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421297] [drm:i915_pages_create_for_stolen] offset=0x17d1000, size=16384 [ 277.421457] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421460] [drm:i915_pages_create_for_stolen] offset=0x17d5000, size=16384 [ 277.421554] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421557] [drm:i915_pages_create_for_stolen] offset=0x17d9000, size=16384 [ 277.421606] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421609] [drm:i915_pages_create_for_stolen] offset=0x17dd000, size=16384 [ 277.421652] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421655] [drm:i915_pages_create_for_stolen] offset=0x17e1000, size=16384 [ 277.421682] [drm:i915_gem_open] [ 277.421738] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421741] [drm:i915_pages_create_for_stolen] offset=0x17e5000, size=16384 [ 277.421817] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421820] [drm:i915_pages_create_for_stolen] offset=0x17e9000, size=16384 [ 277.421903] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421913] [drm:i915_pages_create_for_stolen] offset=0x17ed000, size=16384 [ 277.421943] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 277.421953] [drm:i915_pages_create_for_stolen] offset=0x17f1000, size=16384 [ 286.833807] [drm:i915_capture_error_state] GPU HANG: ecode 8:1:0xe77ffff2, in gem_reset_stats [16915], reason: Hang on blitter ring, action: reset [ 286.833842] [drm:i915_reset_and_wakeup] resetting chip [ 286.833855] drm/i915: Resetting chip after gpu hang [ 286.833868] [drm:i915_gem_reset_engine] resetting blitter ring to restart from tail of request 0x7e [ 286.833898] [drm:gen8_init_common_ring] Execlists enabled for render ring [ 286.833902] [drm:gen8_init_common_ring] Execlists enabled for blitter ring [ 286.833907] [drm:gen8_init_common_ring] Execlists enabled for bsd ring [ 286.833910] [drm:gen8_init_common_ring] Execlists enabled for bsd2 ring [ 286.833913] [drm:gen8_init_common_ring] Execlists enabled for video enhancement ring [ 286.833916] [drm:intel_guc_setup] GuC fw status: path (null), fetch NONE, load NONE [ 286.835374] gem_reset_stats: exiting, ret=99 [ 286.835438] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835441] [drm:i915_pages_create_for_stolen] offset=0x1805000, size=16384 [ 286.835513] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835516] [drm:i915_pages_create_for_stolen] offset=0x1809000, size=16384 [ 286.835538] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835543] [drm:i915_pages_create_for_stolen] offset=0x180d000, size=16384 [ 286.835575] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835578] [drm:i915_pages_create_for_stolen] offset=0x1811000, size=16384 [ 286.835926] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000 [ 286.835928] [drm:i915_pages_create_for_stolen] offset=0x17f5000, size=16384 If this requires the change on test then should one change the component to IGT instead of DRM/Intel? (In reply to Jari Tahvanainen from comment #8) > If this requires the change on test then should one change the component to > IGT instead of DRM/Intel? Hi, Antonio modified the test. https://patchwork.freedesktop.org/patch/161335/ Can you try with this igt patch? The following tests FAIL on SKL Tests List: igt@gem_reset_stats@reset-stats-bsd igt@gem_reset_stats@reset-stats-default igt@gem_reset_stats@reset-stats-render ====================================== Output Sample ====================================== ... (gem_reset_stats:3067) drmtest-DEBUG: Test requirement passed: !(fd<0) (gem_reset_stats:3067) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0' (gem_reset_stats:3067) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:229: (gem_reset_stats:3067) CRITICAL: Failed assertion: noop(fd[i], 0, e) > 0 (gem_reset_stats:3067) igt-core-INFO: Stack trace: (gem_reset_stats:3067) igt-core-INFO: #0 [__igt_fail_assert+0x101] (gem_reset_stats:3067) igt-core-INFO: #1 [test_rs.constprop.8+0x206] (gem_reset_stats:3067) igt-core-INFO: #2 [__real_main773+0x10b] (gem_reset_stats:3067) igt-core-INFO: #3 [main+0x23] (gem_reset_stats:3067) igt-core-INFO: #4 [__libc_start_main+0xf1] (gem_reset_stats:3067) igt-core-INFO: #5 [_start+0x29] (gem_reset_stats:3067) igt-core-INFO: #6 [<unknown>+0x29] **** END **** Stack trace: #0 [__igt_fail_assert+0x101] #1 [test_rs.constprop.8+0x206] #2 [__real_main773+0x10b] #3 [main+0x23] #4 [__libc_start_main+0xf1] #5 [_start+0x29] #6 [<unknown>+0x29] ... This is my configuration: ====================================== Graphic stack ====================================== Component: drm tag: libdrm-2.4.81-53-gd55d080 commit: d55d0804f9e37637d7510f38f97e07a50c6b7baa Component: cairo tag: 1.15.6-30-gc29db4f commit: c29db4f0de52727406a8b62f4da9a382084b5a79 Component: intel-gpu-tools tag: intel-gpu-tools-1.19-244-g4a1c8da commit: 4a1c8daff2005e2cbfe980d63bc0a0fb09cb017d Component: piglit tag: piglit-v1 commit: dab15137044c34023b1a843e72e781b03b17548b ====================================== Software ====================================== kernel version : 4.13.0-drm-tip-ww36-commit-7a827ab+ hostname : gfx-desktop architecture : x86_64 os version : Ubuntu 16.10 os codename : yakkety kernel driver : i915 bios revision : 5.6 bios release date : 09/29/2016 hardware acceleration : disabled swap partition : enabled on (/dev/sda3) ====================================== Graphic drivers ====================================== libdrm : 2.4.83 cairo : 1.15.9 intel-gpu-tools (tag) : intel-gpu-tools-1.19-234-g490b386 intel-gpu-tools (commit) : 490b386 ====================================== Hardware ====================================== platform : Skylake Canyon motherboard id : NUC6i7KYB form factor : Desktop cpu family : Core i7 cpu family id : 6 cpu information : Intel(R) Core(TM) i7-6770HQ CPU @ 2.60GHz gpu card : Intel Corporation Iris Pro Graphics 580 (rev 09) (prog-if 00 [VGA controller]) memory ram : 31.31 GB max memory ram : 32 GB cpu thread : 8 cpu core : 4 cpu model : 94 cpu stepping : 3 socket : Other signature : Type 0, Family 6, Model 94, Stepping 3 hard drive : 223GiB (240GB) current cd clock frequency : 337500 kHz maximum cd clock frequency : 675000 kHz displays connected : DP-1 DP-3 ====================================== Firmware ====================================== dmc fw loaded : yes dmc version : 1.26 guc fw loaded : SUCCESS guc version wanted : 6.1 guc version found : 6.1 huc fw loaded : yes ====================================== kernel parameters ====================================== quiet splash drm.debug=0x1e i915.enable_guc_submission=2 i915.enable_guc_loading=2 i915.alpha_support=1 After February ww6 these tests aren't failing anymore and no hang have been reported in a while with these tests, though issue on bug 97129 still exist for SNB, IVB and BYT. Closing this bug, thank you. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.