Bug 89051

Summary: [SNB IVB BSW SKL BXT KBL] igt / gem_reloc_vs_gpu / faulting-reloc-interruptible-hang fail
Product: DRI Reporter: Ding Heng <hengx.ding>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: christophe.prigent, intel-gfx-bugs, ramix.ben.hassine
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: BSW/CHT, IVB, KBL, SKL, SNB i915 features: GEM/Other
Attachments:
Description Flags
dmesg
none
bsw-gem_reloc_vs_gpu-faulting-reloc-interruptible-hang-output
none
BSW-gem_reloc_vs_gpu-faulting-reloc-interruptible-hang_kern.log none

Description Ding Heng 2015-02-10 03:16:42 UTC
Created attachment 113292 [details]
dmesg

==System Environment==
--------------------------
Regression: no, This is a new issue.

no-working platforms: SNB
==kernel==
--------------------------
drm-intel-nightly-2015-02-10/ad95125eaef18eebb9f47261ce3c99957f5953de

==Bug detailed description==
./gem_reloc_vs_gpu  --run-subtest faulting-reloc-interruptible-hang
IGT-Version: 1.9-gbef692d (x86_64) (Linux: 3.19.0_drm-intel-nightly_ad9512_20150210+ x86_64)
(gem_reloc_vs_gpu:4073) CRITICAL: Test assertion failure function do_test, file gem_reloc_vs_gpu.c:240:
(gem_reloc_vs_gpu:4073) CRITICAL: Failed assertion: test == 0xdeadbeef
(gem_reloc_vs_gpu:4073) CRITICAL: mismatch in buffer 0: 0x00000000 instead of 0xdeadbeef
Subtest faulting-reloc-interruptible-hang: FAIL (104.713s)



==Reproduce steps==
---------------------------- 
1. ./gem_reloc_vs_gpu --run-subtest faulting-reloc-interruptible-hang
Comment 1 cprigent 2015-10-14 10:34:34 UTC
Reproduced on BSW:
Hardware:
Platform: Braswell M
CPU : Intel(R) Celeron N3060 1.60GHz @ 1.6 GHz (family: 6, model: 76 stepping: 4)
SoC : BSW D0
QDF : K6XC
CRB : BRASWELL RVP Fab2
Mandatory Reworks : All 
Feature Reworks: F28, F32, F33, F35, F37
Optional reworks : O-01a; O-02, O-03
Software :
Linux distribution: Ubuntu 14.04 LTS 64 bits 
BIOS : BRAS.X64.B084.R00.1508310642
TXE FW : 2.0.0.2073
Ksc : 1.08
kernel 4.3.0-rc3-drm-intel-nightly+ (eb69e51) from git://anongit.freedesktop.org/drm-intel
Mesa - 11.0.2 from http://cgit.freedesktop.org/mesa/mesa/
xf86-video-intel - 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm - 2.4.64 from http://cgit.freedesktop.org/mesa/drm/
Libva - 1.6.1 from http://cgit.freedesktop.org/libva/
vaapi intel-driver - 1.6.1 from http://cgit.freedesktop.org/vaapi/intel-driver
Cairo - 1.14.2 from http://cgit.freedesktop.org/cairo
Xorg Xserver - 1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Comment 2 cprigent 2015-10-20 06:52:10 UTC
Reproduced on BSW:
Platform: Braswell M
CPU : Intel(R) Celeron N3060 1.60GHz @ 1.6 GHz (family: 6, model: 76 stepping: 4)
SoC : BSW D0
QDF : K6XC
CRB : BRASWELL RVP Fab2
Mandatory Reworks : All 
Feature Reworks: F28, F32, F33, F35, F37
Optional reworks : O-01a; O-02, O-03

BIOS : BRAS.X64.B084.R00.1508310642
TXE FW : 2.0.0.2073
Ksc : 1.08
Linux distribution: Ubuntu 14.04 LTS 64 bits
kernel 4.3.0-rc5-drm-intel-nightly+ 819f710081d7ea116b9b44a9264061d2c030f009 from git://anongit.freedesktop.org/drm-intel
Mesa - 11.0.3 from http://cgit.freedesktop.org/mesa/mesa/
xf86-video-intel - 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm - 2.4.65 from http://cgit.freedesktop.org/mesa/drm/
Libva - 1.6.1 from http://cgit.freedesktop.org/libva/
vaapi intel-driver - 1.6.1 from http://cgit.freedesktop.org/vaapi/intel-driver
Cairo - 1.14.2 from http://cgit.freedesktop.org/cairo
Xorg Xserver - 1.17.2 from http://cgit.freedesktop.org/xorg/xserver

Kernel commit 819f710081d7ea116b9b44a9264061d2c030f009
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Wed Oct 14 19:05:17 2015 +0200
drm-intel-nightly: 2015y-10m-14d-17h-04m-36s UTC integration manifest
Comment 3 cprigent 2015-11-07 11:55:39 UTC
Reproduced on SKL-Y:

Platform: SKY LAKE Y A0 
CPU : Intel(R) Core(TM) m5-6Y57 CPU @ 1.10GHz (family: 6, model: 78  stepping: 3)
MCP : SKL-Y  D1 2+2 (ou ULX-D1)
QDF : QJK9 
CPU : SKL D0
Chipset PCH: Sunrise Point LP C1       
CRB : SKY LAKE Y LPDDR3 RVP3 CRB FAB2
Reworks : All Mandatories + FBS02,FBS03, F23, O-02 & O-06
Software
BIOS : SKLSE2R1.R00.X097.B02.1509020030
ME FW : 11.0.0.1173
Ksc (EC FW): 1.19
Linux distribution: Ubuntu 14.04 LTS 64 bits
kernel 4.3.0-rc7-drm-intel-nightly (86ba603) from git://anongit.freedesktop.org/drm-intel
  commit 86ba603f327626055fe1436112b3786eaaaf7fb1
  Author: Daniel Vetter <daniel.vetter@ffwll.ch>
  Date:   Sat Oct 31 09:27:21 2015 +0100
  drm-intel-nightly: 2015y-10m-31d-08h-26m-39s UTC integration manifest
Mesa 11.0.4 from http://cgit.freedesktop.org/mesa/mesa/
xf86-video-intel - 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm - 2.4.65 from http://cgit.freedesktop.org/mesa/drm/
Libva - 1.6.1 from http://cgit.freedesktop.org/libva/
vaapi intel-driver - 1.6.1 from http://cgit.freedesktop.org/vaapi/intel-driver
Cairo - 1.14.2 from http://cgit.freedesktop.org/cairo
Xorg Xserver - 1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Comment 4 cprigent 2015-11-29 10:56:27 UTC
Fail on IVB with kernel testing tag 2015-11-20.

Hardware:
Motherboard: DH77EB
cpu model name : Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz
cpu model : 58
cpu family : 6
Graphic card: Xeon E3-1200 v2/3rd Gen Core processor Graphics
Software:
Kernel tag drm-intel-testing-2015-11-20 5074e51 (4.4-rc1) from git://anongit.freedesktop.org/drm-intel 
  commit 5074e51ef3a2b0ad4c2354e95aec5380a93966b3
  Author: Imre Deak <imre.deak@intel.com>
  Date:   Fri Nov 20 11:50:03 2015 +0200
  drm-intel-nightly: 2015y-11m-20d-09h-49m-41s UTC integration manifest
Ubuntu 14.04.1 LTS
Bios: EBH7710H.86A.0096.2012.1012.1645
Libdrm: 2.4.65
Comment 5 cprigent 2016-01-17 17:04:40 UTC
Fail on a SKL production device

Dell Inspiron 15 7000 2-1
Intel Core (TM) i7 CPU @ 6500 2.5
Intel HD Graphics
Linux distribution: Ubuntu 15.10 64 bits
Kernel: drm-intel-nightly 4.4.0-rc8 8834392 from http://cgit.freedesktop.org/drm-intel/
libdrm-2.4.66 from http://cgit.freedesktop.org/mesa/drm/
xf86-video-intel 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
mesa-11.0.8 from http://cgit.freedesktop.org/mesa/mesa/
libva-1.6.2 from http://cgit.freedesktop.org/libva/
intel-driver 1.6.2 from http://cgit.freedesktop.org/vaapi/intel-driver
cairo 1.15.2 from http://cgit.freedesktop.org/cairo
IGT 1.13 cd658a7c6bca74eb897a8ae87751029793f0c4f4 from http://cgit.freedesktop.org/xorg/app/intel-gpu-tools/
DMC and GUC firmware from https://01.org/linuxgraphics/intel-linux-graphics-firmwares
Comment 6 Rami 2016-02-26 16:26:50 UTC
Reproduced on KBL with last setup:
Hardware:
---------
Platform: KABY LAKE-U 
CPU : Intel(R) Core(TM) @ 2.60GHz 
MCP : KBL-U  G0  2+2
QDF : QYQ8
Chipset PCH: SPT-LP C1       
CRB : KABY LAKE U DDR3L RVP7 CRB FAB1
BIOS : KBLSE2R1.R00.X015.B01.1511271314
ME FW : 11.5.0.1008
Ksc (EC FW): 1.20
Software:
----------
Linux distribution: Ubuntu 15.10 64 bits
Kernel drm-intel-nightly 4.5.0-rc5_1a139ea from http://cgit.freedesktop.org/drm-intel/
drm: (HEAD, tag: libdrm-2.4.67, tag: 2.4.67)49efb55846c6ec96400061213b76ab867461c1a from git://git.freedesktop.org/git/mesa/drm
mesa: tag mesa-11.1.1 330aa44from http://cgit.freedesktop.org/mesa/mesa/
cairo: tag 1.15.2 db8a7f1 from http://cgit.freedesktop.org/cairo
waffle: master bb29b2a from https://github.com/waffle-gl/waffle
xorg-server-macros: master d7acec2 from git://git.freedesktop.org/git/xorg/util/macros
libva: (HEAD, tag: libva-1.6.2.pre1) be42ac44bc12c752f572788f0f4bbc62c13dc6ae from git://git.freedesktop.org/git/vaapi/libva
vaapi-intel-driver: tag 1.6.1 f6fe842 from http://cgit.freedesktop.org/vaapi/intel-driverFirmware
Tools:
-------
intel-gpu-tools: (HEAD, origin/master, origin/HEAD) f27d295fe3a1ca005dfa0fbfd81d6808b1f5ca47 from git://git.freedesktop.org/git/xorg/app/intel-gpu-tools
Comment 7 cprigent 2016-03-25 14:34:36 UTC
Created attachment 122547 [details]
bsw-gem_reloc_vs_gpu-faulting-reloc-interruptible-hang-output

Reproduced on BSW:

Hardware: 
Motherboard: Aspire XC-704 
cpu model name : Intel(R) Pentium(R) CPU N3700 @ 1.60GHz 
cpu model : 76 
cpu family : 6 
Graphic card: Device 22b1 (rev 21)

Software:
Bios: R01-A2
kernel nightly 4.5 10e913a  from https://cgit.freedesktop.org/cgit/?url=drm-intel/
  commit 10e913a48ca36790da9b58bed8729598ea79ebdb
  Author: Imre Deak <imre.deak@intel.com>
  Date:   Thu Mar 17 15:23:17 2016 +0200
  drm-intel-nightly: 2016y-03m-17d-13h-22m-41s UTC integration manifest
cairo 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo
libdrm-2.4.67-7 49041c3 from git://anongit.freedesktop.org/mesa/drm
IGT 1.14 8f6ed9e from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git
Comment 8 cprigent 2016-03-25 14:34:53 UTC
Created attachment 122548 [details]
BSW-gem_reloc_vs_gpu-faulting-reloc-interruptible-hang_kern.log
Comment 9 Chris Wilson 2016-09-09 17:55:21 UTC
commit 821ed7df6e2a1dbae243caebcfe21a0a4329fca0
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Fri Sep 9 14:11:53 2016 +0100

    drm/i915: Update reset path to fix incomplete requests
    
    Update reset path in preparation for engine reset which requires
    identification of incomplete requests and associated context and fixing
    their state so that engine can resume correctly after reset.
    
    The request that caused the hang will be skipped and head is reset to the
    start of breadcrumb. This allows us to resume from where we left-off.
    Since this request didn't complete normally we also need to cleanup elsp
    queue manually. This is vital if we employ nonblocking request
    submission where we may have a web of dependencies upon the hung request
    and so advancing the seqno manually is no longer trivial.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.