Bug 97076 - [BDW SKL IVB BYT HSW SNB] [Basic] GPU Hang with prime_busy basic tests
Summary: [BDW SKL IVB BYT HSW SNB] [Basic] GPU Hang with prime_busy basic tests
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: highest blocker
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
: 96751 (view as bug list)
Depends on:
Blocks:
 
Reported: 2016-07-25 15:12 UTC by cprigent
Modified: 2017-01-04 05:00 UTC (History)
3 users (show)

See Also:
i915 platform: BDW, BYT, HSW, IVB, SKL, SNB
i915 features: GEM/Other


Attachments
error (208.80 KB, text/plain)
2016-07-25 15:12 UTC, cprigent
no flags Details
SKLY-prime_busy-basic_kern.log (285.80 KB, text/plain)
2016-07-25 15:12 UTC, cprigent
no flags Details
SKLY-prime_busy-basic_output (6.83 KB, text/plain)
2016-07-25 15:13 UTC, cprigent
no flags Details
BDW-error (208.79 KB, text/plain)
2016-07-26 14:12 UTC, cprigent
no flags Details
BDW_prime-busy-basic_output (1.62 KB, text/plain)
2016-07-26 14:12 UTC, cprigent
no flags Details
BDW-prime_busy-basic-kern.log (85.23 KB, text/plain)
2016-07-26 14:12 UTC, cprigent
no flags Details

Description cprigent 2016-07-25 15:12:34 UTC
Created attachment 125312 [details]
error

Platform: NUC6i3SYB
CPU: Intel(R) Core(TM) i3-6100U CPU @ 2.30GHZ (family 6, model 78, stepping 3)
Motherboard version: H81132-502
GPU: Intel® HD Graphics 520 - Intel Corporation Sky Lake Integrated Graphics (rev 07)
Memory: one 8GB card Kingston KVR21S15D8/8
SSD: Samsung 850 EVO M.2 120 Go
Software
Bios: SYSKLi35.86A.0045
Linux distribution: Ubuntu 16.04 64 bits
Kernel: 4.7.0-rc7 7eeb04a from http://cgit.freedesktop.org/drm-intel/
   commit 7eeb04a101316645916d4d9df058a9341797f1af
   Author: Chris Wilson <chris@chris-wilson.co.uk>
   Date:   Sun Jul 24 11:00:31 2016 +0100
   drm-intel-nightly: 2016y-07m-24d-09h-59m-54s UTC integration manifest
drm: libdrm-2.4.70 0caa84c from git://anongit.freedesktop.org/mesa/drm
mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa
cairo: 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo
xserver: xorg-server-1.18.0-497 0b2f308 from git://git.freedesktop.org/git/xorg/xserver
xf86-video-intel: 2.99.917-687 6988b87 from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel
libva: libva-1.7.0-26 c36971c from git://git.freedesktop.org/git/vaapi/libva
vaapi-intel-driver: 1.7.0-58 e554446 from git://git.freedesktop.org/git/vaapi/intel-driver
DMC 1.26 from https://01.org/sites/default/files/downloads/intelr-graphics-linux/skldmcver126.tar_1.bz2
GUC 6.1 from https://01.org/sites/default/files/downloads/intelr-graphics-linux/sklgucver61.tar.bz2 
Intel-Gpu-Tools 1.15-140 e3abb20 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git

Steps:
------
1. Execute IGT tests:
./prime_busy --r basic-after-default
./prime_busy --r basic-before-default
./prime_busy --r basic-wait-after-default
./prime_busy --r basic-wait-before-default

Actual results:
---------------
1. Tests are fail. Log shows a GPU Hang.

Expected result:
-----------------
1. Tests are fail or skip. No GPU Hang.
Comment 1 cprigent 2016-07-25 15:12:57 UTC
Created attachment 125313 [details]
SKLY-prime_busy-basic_kern.log
Comment 2 cprigent 2016-07-25 15:13:12 UTC
Created attachment 125314 [details]
SKLY-prime_busy-basic_output
Comment 3 cprigent 2016-07-26 14:12:18 UTC
Created attachment 125338 [details]
BDW-error

Reproduced on BDW

Platform: NUC6i3SYB
CPU: Intel(R) Core(TM) i3-5010U CPU @ 2.10GHz (family 6, model 78, stepping 3)
Motherboard version: H41000-503
GPU: Intel® HD Graphics 520 - Intel Corporation Sky Lake Integrated Graphics (rev 07)
Memory: 2 Crucial 4GB DDR3L-1600 SODIMM - CT51264BF160B
SSD: INTEL SSDSC2BW48 480 Go
Software
Bios: RYBDWi35.86A.0350.2015.0812.1722
Kernel: 4.7.0-rc7 7eeb04a from http://cgit.freedesktop.org/drm-intel/
  commit 7eeb04a101316645916d4d9df058a9341797f1af
  Author: Chris Wilson <chris@chris-wilson.co.uk>
  Date:   Sun Jul 24 11:00:31 2016 +0100
  drm-intel-nightly: 2016y-07m-24d-09h-59m-54s UTC integration manifest
drm: libdrm-2.4.70 0caa84c from git://anongit.freedesktop.org/mesa/drm
mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa
cairo: 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo
xserver: xorg-server-1.18.0-497 0b2f308 from git://git.freedesktop.org/git/xorg/xserver
xf86-video-intel: 2.99.917-687 6988b87 from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel
libva: libva-1.7.0-26 c36971c from git://git.freedesktop.org/git/vaapi/libva
vaapi-intel-driver: 1.7.0-58 e554446 from git://git.freedesktop.org/git/vaapi/intel-driver
Intel-Gpu-Tools 1.15-140 e3abb20 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git
Comment 4 cprigent 2016-07-26 14:12:38 UTC
Created attachment 125339 [details]
BDW_prime-busy-basic_output
Comment 5 cprigent 2016-07-26 14:12:56 UTC
Created attachment 125340 [details]
BDW-prime_busy-basic-kern.log
Comment 6 maria guadalupe 2016-07-27 16:17:23 UTC
these issue also happen on ivb

steps to reproduced 
============================================
./prime_busy --r basic-after-default
./prime_busy --r basic-before-default
./prime_busy --r basic-wait-after-default
./prime_busy --r basic-wait-before-default

Actual results:
---------------
1. Tests are fail


Software information
============================================
Kernel version                  : 4.7.0-rc7-nightly+
Linux distribution              : Ubuntu 16.04.1 LTS
Architecture                    : 64-bit
xf86-video-intel version        : 2.99.917
Xorg-Xserver version            : 1.18.3
Intel GPU Tools version         : 1.14-1 <-- This is a generic version of IGT
Kernel driver in use            : i915
Hardware acceleration           :
Bios revision                   : 4.6

 Hardware information
============================================
Platform                        :
Motherboard model               :
Motherboard type                : DH77EB Desktop
Motherboard manufacturer        :
CPU family                      : Core i7
CPU information                 : Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz
GPU Card                        : Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor Graphics Controller (rev 09) (prog-if 00 [VGA controller])
Memory ram                      : 8 GB
Maximum memory ram allowed      : 32 GB
Display resolution              :
CPU's number                    : 8
Hard drive Capacity             : 250 GB
Comment 7 yann 2016-08-03 16:00:58 UTC
From these error dumps (same symptom), hung is happening in batch buffer submitted by prime_busy with IPEHR 0x10000002 (MI_STORE_DATA_IMM)

for both SKL and BDW IOMMU is not enable but we have DMC 1.26 in SKL and hung appears quicker on this system.

Here is extract from BDW dump:
ACTHD: 0x00000000 ffffc554
IPEHR: 0x10000002
INSTDONE: 0xffd7ffff
    busy: CS
    busy: GAM

and extract from batch buffer around active head 0xffffc554
...
0xffffc540:      0x10000002: MI_STORE_DATA_IMM
0xffffc544:      0xffffed50:    dword 1
0xffffc548:      0x00000000:    dword 2
0xffffc54c:      0x00000354:    dword 3
0xffffc550:      0x10000002: MI_STORE_DATA_IMM
0xffffc554:      0xffffed54:    dword 1
0xffffc558:      0x00000000:    dword 2
0xffffc55c:      0x00000355:    dword 3
Comment 8 Chris Wilson 2016-08-04 19:41:25 UTC
commit ad778f8967ea2f0bfda02701f918bcfcd495b721
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Thu Aug 4 16:32:42 2016 +0100

    drm/i915: Export our request as a dma-buf fence on the reservation object
    
    If the GEM objects being rendered with in this request have been
    exported via dma-buf to a third party, hook ourselves into the dma-buf
    reservation object so that the third party can serialise with our
    rendering via the dma-buf fences.
    
    Testcase: igt/prime_busy
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>
    Link: http://patchwork.freedesktop.org/patch/msgid/1470324762-2545-26-git-send-email-chris@chris-wilson.co.uk
Comment 9 Chris Wilson 2016-08-04 19:41:47 UTC
*** Bug 96751 has been marked as a duplicate of this bug. ***


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.