Bug 97820 - [BDW BYT SKL HSW IVB]Regression] [GPU Hang] with gem_reset_stats ban- and reset-stats- subtests
Summary: [BDW BYT SKL HSW IVB]Regression] [GPU Hang] with gem_reset_stats ban- and res...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: IGT (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: high critical
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-09-15 12:55 UTC by cprigent
Modified: 2018-02-27 15:49 UTC (History)
1 user (show)

See Also:
i915 platform: BDW, BYT, HSW, IVB, SKL
i915 features: GPU hang


Attachments
BDW--gem_reset_stats--ban---output (15.78 KB, text/plain)
2016-09-15 12:55 UTC, cprigent
no flags Details
BDW--gem_reset_stats--ban---kern.log (320.99 KB, text/plain)
2016-09-15 12:55 UTC, cprigent
no flags Details
BDW--gem_reset_stats-reset-stats----kern.log (316.93 KB, text/plain)
2016-09-15 13:20 UTC, cprigent
no flags Details
BDW--gem_reset_stats-reset-stats----output (11.95 KB, text/plain)
2016-09-15 13:21 UTC, cprigent
no flags Details

Description cprigent 2016-09-15 12:55:01 UTC
Created attachment 126548 [details]
BDW--gem_reset_stats--ban---output

Platform: NUC5i7RYB
CPU: Intel(R) Core(TM) i7-5557U CPU @ 3.10GHz (Family 6, Model 61, Stepping 4)
Motherboard version: H73774-102
GPU: Intel® Iris™ Graphics 6100 - Intel Corporation Broadwell-U Integrated Graphics (rev 09)
Memory: two 4GB card Kingston 99U5469-045.A00LF
SSD: INTEL SSDSC2KW24

Software
Bios: RYBDWi35.86A.0358.2016.0606.1423 from https://downloadcenter.intel.com/downloads/eula/26081/BIOS-Update-RYBDWi35-86A-?httpDown=https%3A%2F%2Fdownloadmirror.intel.com%2F26081%2Feng%2FRY0358.bio
Linux distribution: Ubuntu 16.04 64 bits
Kernel: 4.8.0-rc5 bef9c1f from http://cgit.freedesktop.org/drm-intel/
  commit bef9c1f4afe24cfff578d386bde349add65673eb
  Author: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
  Date:   Mon Sep 12 11:35:34 2016 +0300
  drm-intel-nightly: 2016y-09m-12d-08h-35m-02s UTC integration manifest
libdrm-2.4.70-12 2d00869 from git://anongit.freedesktop.org/mesa/drm
mesa: mesa-11.2.2 3a9f628 from git://anongit.freedesktop.org/mesa/mesa
cairo 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo
xorg-server-1.18.0-549 527c6ba from git://git.freedesktop.org/git/xorg/xserver
xf86-video-intel 2.99.917-703 15c5ff1 from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel
libva-1.7.0-47 2ebf897 from git://git.freedesktop.org/git/vaapi/libva 
vaapi-intel-driver: 1.7.0-117 8c11f51 from git://git.freedesktop.org/git/vaapi/intel-driver
Intel-Gpu-Tools 1.16 f565b6c from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git

Steps:
------
1. Execute gem_reset_stats ban subtests
igt@gem_reset_stats@ban-blt
igt@gem_reset_stats@ban-bsd1
igt@gem_reset_stats@ban-bsd2
igt@gem_reset_stats@ban-ctx-render
igt@gem_reset_stats@ban-default
igt@gem_reset_stats@ban-render
igt@gem_reset_stats@ban-vebox 

Actual result:
--------------
1. Subtests are fail

Expected result:
----------------
1. Subtests are Pass
Comment 1 cprigent 2016-09-15 12:55:19 UTC
Created attachment 126549 [details]
BDW--gem_reset_stats--ban---kern.log
Comment 2 cprigent 2016-09-15 12:55:44 UTC
Tests were Pass with:
Kernel: 4.8.0-rc4 9baa666 from http://cgit.freedesktop.org/drm-intel/
  commit 9baa666b3e48f71b46c5f63541f57d2a95a1b1c0
  Author: Chris Wilson <chris@chris-wilson.co.uk>
  Date:   Sat Sep 3 13:12:38 2016 +0100
  drm-intel-nightly: 2016y-09m-03d-12h-12m-15s UTC integration manifest
libdrm-2.4.70-6 4462303 from git://anongit.freedesktop.org/mesa/drm
mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa
cairo 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo
xorg-server-1.18.0-546 deae9c7 from git://git.freedesktop.org/git/xorg/xserver
xf86-video-intel 2.99.917-701 205146b from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel
libva-1.7.0-47 2ebf897 from git://git.freedesktop.org/git/vaapi/libva 
vaapi-intel-driver: 1.7.0-95 1817bee from git://git.freedesktop.org/git/vaapi/intel-driver
Intel-Gpu-Tools 1.16 6bd4208 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git
Comment 3 cprigent 2016-09-15 13:04:52 UTC
/sys/class/drm/card0/error is empty
Comment 4 cprigent 2016-09-15 13:20:52 UTC
Created attachment 126550 [details]
BDW--gem_reset_stats-reset-stats----kern.log

Reproduced with reset-stats- subcases:
igt@gem_reset_stats@reset-stats-blt
igt@gem_reset_stats@reset-stats-bsd2
igt@gem_reset_stats@reset-stats-ctx-render
igt@gem_reset_stats@reset-stats-default
igt@gem_reset_stats@reset-stats-render
igt@gem_reset_stats@reset-stats-vebox
Comment 5 cprigent 2016-09-15 13:21:09 UTC
Created attachment 126551 [details]
BDW--gem_reset_stats-reset-stats----output
Comment 6 Chris Wilson 2016-09-15 13:24:25 UTC
The tests require updating.
Comment 7 cprigent 2016-09-19 16:41:24 UTC
Reproduced with fresh setup
IGT 1.16 200237a and kernel 4.8-rc7 
  commit 0e34cb5b35f0f837219495c402073141481b1b90
  Author: Jani Nikula <jani.nikula@intel.com>
  Date:   Mon Sep 19 18:39:27 2016 +0300
  drm-intel-nightly: 2016y-09m-19d-15h-38m-53s UTC integration manifest

# ./gem_reset_stats --r reset-stats-blt
IGT-Version: 1.16-g200237a (x86_64) (Linux: 4.8.0-rc7-nightly+ x86_64)
2:0 expected 'Pending' [2], found 'No error' [0]
(gem_reset_stats:16915) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:242:
(gem_reset_stats:16915) CRITICAL: Failed assertion: _assert_reset_status(i, fd[i], 0, (1 << 1)) == 0
Stack trace:
  #0 [__igt_fail_assert+0xf1]
  #1 [test_rs.constprop.9+0x158]
  #2 [__real_main781+0x593]
  #3 [main+0x29]
  #4 [__libc_start_main+0xf0]
  #5 [_start+0x29]
  #6 [<unknown>+0x29]
Subtest reset-stats-blt failed.
**** DEBUG ****
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) DEBUG: num fds=4, hang index=1
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) ioctl-wrappers-DEBUG: Test requirement passed: gem_has_ring(fd, ring)
(gem_reset_stats:16915) ioctl-wrappers-DEBUG: Test requirement passed: has_ban_period
(gem_reset_stats:16915) igt-gt-DEBUG: Test requirement passed: has_gpu_reset(fd)
(gem_reset_stats:16915) igt-gt-DEBUG: Test requirement passed: ctx == 0 || ring == I915_EXEC_RENDER
(gem_reset_stats:16915) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:16915) INFO: 2:0 expected 'Pending' [2], found 'No error' [0]
(gem_reset_stats:16915) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:242:
(gem_reset_stats:16915) CRITICAL: Failed assertion: _assert_reset_status(i, fd[i], 0, (1 << 1)) == 0
****  END  ****
Subtest reset-stats-blt: FAIL (9.416s)



[  277.418448] gem_reset_stats: executing
[  277.418613] [drm:i915_gem_open]
[  277.418748] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.418753] [drm:i915_pages_create_for_stolen] offset=0x17d1000, size=16384
[  277.418923] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.418946] [drm:i915_pages_create_for_stolen] offset=0x17d5000, size=16384
[  277.418976] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.418979] [drm:i915_pages_create_for_stolen] offset=0x17d9000, size=16384
[  277.419003] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.419006] [drm:i915_pages_create_for_stolen] offset=0x17dd000, size=16384
[  277.419249] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.419252] [drm:i915_pages_create_for_stolen] offset=0x17e1000, size=16384
[  277.419478] [drm:i915_gem_open]
[  277.419614] gem_reset_stats: starting subtest reset-stats-blt
[  277.419638] [drm:i915_gem_open]
[  277.419714] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.419720] [drm:i915_pages_create_for_stolen] offset=0x17e5000, size=16384
[  277.419830] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.419834] [drm:i915_pages_create_for_stolen] offset=0x17e9000, size=16384
[  277.419869] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.419872] [drm:i915_pages_create_for_stolen] offset=0x17ed000, size=16384
[  277.419903] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.419905] [drm:i915_pages_create_for_stolen] offset=0x17f1000, size=16384
[  277.420423] [drm:i915_gem_open]
[  277.420451] [drm:i915_gem_open]
[  277.420479] [drm:i915_gem_open]
[  277.420503] [drm:i915_gem_open]
[  277.420529] [drm:i915_gem_open]
[  277.420582] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.420586] [drm:i915_pages_create_for_stolen] offset=0x17f5000, size=16384
[  277.420673] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.420684] [drm:i915_pages_create_for_stolen] offset=0x17f9000, size=16384
[  277.420716] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.420719] [drm:i915_pages_create_for_stolen] offset=0x17fd000, size=16384
[  277.420749] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.420752] [drm:i915_pages_create_for_stolen] offset=0x1801000, size=16384
[  277.421294] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421297] [drm:i915_pages_create_for_stolen] offset=0x17d1000, size=16384
[  277.421457] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421460] [drm:i915_pages_create_for_stolen] offset=0x17d5000, size=16384
[  277.421554] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421557] [drm:i915_pages_create_for_stolen] offset=0x17d9000, size=16384
[  277.421606] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421609] [drm:i915_pages_create_for_stolen] offset=0x17dd000, size=16384
[  277.421652] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421655] [drm:i915_pages_create_for_stolen] offset=0x17e1000, size=16384
[  277.421682] [drm:i915_gem_open]
[  277.421738] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421741] [drm:i915_pages_create_for_stolen] offset=0x17e5000, size=16384
[  277.421817] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421820] [drm:i915_pages_create_for_stolen] offset=0x17e9000, size=16384
[  277.421903] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421913] [drm:i915_pages_create_for_stolen] offset=0x17ed000, size=16384
[  277.421943] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  277.421953] [drm:i915_pages_create_for_stolen] offset=0x17f1000, size=16384
[  286.833807] [drm:i915_capture_error_state] GPU HANG: ecode 8:1:0xe77ffff2, in gem_reset_stats [16915], reason: Hang on blitter ring, action: reset
[  286.833842] [drm:i915_reset_and_wakeup] resetting chip
[  286.833855] drm/i915: Resetting chip after gpu hang
[  286.833868] [drm:i915_gem_reset_engine] resetting blitter ring to restart from tail of request 0x7e
[  286.833898] [drm:gen8_init_common_ring] Execlists enabled for render ring
[  286.833902] [drm:gen8_init_common_ring] Execlists enabled for blitter ring
[  286.833907] [drm:gen8_init_common_ring] Execlists enabled for bsd ring
[  286.833910] [drm:gen8_init_common_ring] Execlists enabled for bsd2 ring
[  286.833913] [drm:gen8_init_common_ring] Execlists enabled for video enhancement ring
[  286.833916] [drm:intel_guc_setup] GuC fw status: path (null), fetch NONE, load NONE
[  286.835374] gem_reset_stats: exiting, ret=99
[  286.835438] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  286.835441] [drm:i915_pages_create_for_stolen] offset=0x1805000, size=16384
[  286.835513] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  286.835516] [drm:i915_pages_create_for_stolen] offset=0x1809000, size=16384
[  286.835538] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  286.835543] [drm:i915_pages_create_for_stolen] offset=0x180d000, size=16384
[  286.835575] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  286.835578] [drm:i915_pages_create_for_stolen] offset=0x1811000, size=16384
[  286.835926] [drm:i915_gem_object_create_stolen] creating stolen object: size=4000
[  286.835928] [drm:i915_pages_create_for_stolen] offset=0x17f5000, size=16384
Comment 8 Jari Tahvanainen 2016-10-11 13:01:26 UTC
If this requires the change on test then should one change the component to IGT instead of DRM/Intel?
Comment 9 Michel Thierry 2017-06-16 17:44:59 UTC
(In reply to Jari Tahvanainen from comment #8)
> If this requires the change on test then should one change the component to
> IGT instead of DRM/Intel?

Hi,

Antonio modified the test. https://patchwork.freedesktop.org/patch/161335/
Can you try with this igt patch?
Comment 10 Hector Velazquez 2017-09-07 16:38:04 UTC
The following tests FAIL on SKL 

Tests List:

igt@gem_reset_stats@reset-stats-bsd
igt@gem_reset_stats@reset-stats-default
igt@gem_reset_stats@reset-stats-render


======================================
        Output Sample
======================================
...
(gem_reset_stats:3067) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_reset_stats:3067) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_reset_stats:3067) CRITICAL: Test assertion failure function test_rs, file gem_reset_stats.c:229:
(gem_reset_stats:3067) CRITICAL: Failed assertion: noop(fd[i], 0, e) > 0
(gem_reset_stats:3067) igt-core-INFO: Stack trace:
(gem_reset_stats:3067) igt-core-INFO:   #0 [__igt_fail_assert+0x101]
(gem_reset_stats:3067) igt-core-INFO:   #1 [test_rs.constprop.8+0x206]
(gem_reset_stats:3067) igt-core-INFO:   #2 [__real_main773+0x10b]
(gem_reset_stats:3067) igt-core-INFO:   #3 [main+0x23]
(gem_reset_stats:3067) igt-core-INFO:   #4 [__libc_start_main+0xf1]
(gem_reset_stats:3067) igt-core-INFO:   #5 [_start+0x29]
(gem_reset_stats:3067) igt-core-INFO:   #6 [<unknown>+0x29]
****  END  ****
Stack trace:
  #0 [__igt_fail_assert+0x101]
  #1 [test_rs.constprop.8+0x206]
  #2 [__real_main773+0x10b]
  #3 [main+0x23]
  #4 [__libc_start_main+0xf1]
  #5 [_start+0x29]
  #6 [<unknown>+0x29]
...

This is my configuration:

======================================
        Graphic stack
======================================

Component: drm
    tag: libdrm-2.4.81-53-gd55d080
    commit: d55d0804f9e37637d7510f38f97e07a50c6b7baa

Component: cairo
    tag: 1.15.6-30-gc29db4f
    commit: c29db4f0de52727406a8b62f4da9a382084b5a79

Component: intel-gpu-tools
    tag: intel-gpu-tools-1.19-244-g4a1c8da
    commit: 4a1c8daff2005e2cbfe980d63bc0a0fb09cb017d

Component: piglit
    tag: piglit-v1
    commit: dab15137044c34023b1a843e72e781b03b17548b

======================================
	     Software
======================================
kernel version              : 4.13.0-drm-tip-ww36-commit-7a827ab+
hostname                    : gfx-desktop
architecture                : x86_64
os version                  : Ubuntu 16.10
os codename                 : yakkety
kernel driver               : i915
bios revision               : 5.6
bios release date           : 09/29/2016
hardware acceleration       : disabled
swap partition              : enabled on (/dev/sda3)

======================================
	Graphic drivers
======================================
libdrm                      : 2.4.83
cairo                       : 1.15.9
intel-gpu-tools (tag)       : intel-gpu-tools-1.19-234-g490b386
intel-gpu-tools (commit)    : 490b386

======================================
	     Hardware
======================================
platform                   : Skylake Canyon
motherboard id             : NUC6i7KYB
form factor                : Desktop
cpu family                 : Core i7
cpu family id              : 6
cpu information            : Intel(R) Core(TM) i7-6770HQ CPU @ 2.60GHz
gpu card                   : Intel Corporation Iris Pro Graphics 580 (rev 09) (prog-if 00 [VGA controller])
memory ram                 : 31.31 GB
max memory ram             : 32 GB
cpu thread                 : 8
cpu core                   : 4
cpu model                  : 94
cpu stepping               : 3
socket                     : Other
signature                  : Type 0, Family 6, Model 94, Stepping 3
hard drive                 : 223GiB (240GB)
current cd clock frequency : 337500 kHz
maximum cd clock frequency : 675000 kHz
displays connected         : DP-1 DP-3

======================================
	     Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.26
guc fw loaded             : SUCCESS
guc version wanted        : 6.1
guc version found         : 6.1
huc fw loaded             : yes

======================================
	     kernel parameters
======================================
quiet splash drm.debug=0x1e i915.enable_guc_submission=2 i915.enable_guc_loading=2 i915.alpha_support=1
Comment 11 Elizabeth 2018-02-27 15:49:33 UTC
After February ww6 these tests aren't failing anymore and no hang have been reported in a while with these tests, though issue on bug 97129 still exist for SNB, IVB and BYT. Closing this bug, thank you.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.