Bug 91569 - [BDW-U SKL] System crashing with gem_mmap_gtt over huge-bo-tiledx huge-bo-tiledy and huge-copy-xy
Summary: [BDW-U SKL] System crashing with gem_mmap_gtt over huge-bo-tiledx huge-bo-til...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-08-05 20:16 UTC by Elio
Modified: 2017-07-24 22:45 UTC (History)
1 user (show)

See Also:
i915 platform: HSW
i915 features:


Attachments
tiledx_log (59.01 KB, text/plain)
2015-08-05 20:16 UTC, Elio
no flags Details
tyley_log (59.29 KB, text/plain)
2015-08-05 20:21 UTC, Elio
no flags Details
huge-copy-xy-log (59.38 KB, text/plain)
2015-08-05 20:22 UTC, Elio
no flags Details

Description Elio 2015-08-05 20:16:29 UTC
Created attachment 117548 [details]
tiledx_log

Test Environment:
````````````````````````````````````
Kernel 4.2from Christophe 4.2.0-rc4-drm-intel-testing-ww32+
Mesa: mesa-10.6.3 from http://cgit.freedesktop.org/mesa/mesa/
Xf86_video_intel: 2.99.917 from
http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm: libdrm-2.4.62 from http://cgit.freedesktop.org/mesa/drm/
Cairo: 1.14.2 from http://cgit.freedesktop.org/cairo
libva: libva-1.6.0 from http://cgit.freedesktop.org/libva/
intel-driver: 1.6.0. from http://cgit.freedesktop.org/vaapi/intel-driver
xorg: 1.17.99 installed with script git_xorg.sh
Xserver: xorg-server-1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Intel-gpu-tools: 1.11 from http://cgit.freedesktop.org/xorg/app/intel-gpu-tools/

1. Launch igt with command (as root):
${IGT_DIRNAME}/scripts/run-tests.sh -t gem_mmap_gtt@huge-bo-tiledx or
gem_mmap_gtt@huge-bo-tiledy or gem_mmap_gtt@huge-copy-xy
2. The test start

Actual result:
---------------
2. The test crashes after about 4 seconds sending the following message:gem_mmap_gtt: executing
[  214.755154] gem_mmap_gtt: starting subtest huge-bo-tiledY
[  214.839788] gem_mmap_gtt (3225): drop_caches: 3
[  218.192209] BUG: unable to handle kernel paging request at ffffc90003800000
[  218.192258] 

Expected result:
----------------
2. The test is successful (eventually set to fail/timeout)
Comment 1 Elio 2015-08-05 20:21:20 UTC
Created attachment 117549 [details]
tyley_log
Comment 2 Elio 2015-08-05 20:22:52 UTC
Created attachment 117550 [details]
huge-copy-xy-log
Comment 3 cprigent 2015-08-09 15:11:06 UTC
huge-copy-xy crash is known: https://bugs.freedesktop.org/show_bug.cgi?id=91116
Comment 4 Elio 2015-08-12 14:16:12 UTC
kernel tag drm-intel-testing-2015-07-31 (4.2-rc4) from git://anongit.freedesktop.org/drm-intel 

Platform: BDW-U (Lenovo G50)
Comment 5 cprigent 2015-08-16 13:53:26 UTC
Result crash is reproduced on SKL with huge-bo-tiledx and huge-bo-tiledy.

Platform: SKY LAKE Y A0
CPU : Intel(R) Core(TM) m3-6Y30 CPU @ 0.8GHz 4MB (family: 6, model: 78  stepping: 3)
MCP : SKL-Y  D1  2+2
QDF : QVY3 
CPU : SKL D1
Chipset PCH: Sunrise Point LP C1       
CRB : SKY LAKE Y LPDDR3 RVP3 CRB FAB2
Reworks : All Mandatories + FBS02 & FBS03, O-06
Software
BIOS : SKLSE2R1.R00.X093.B02.1507222151
ME FW : 11.0.0.1157
Ksc (EC FW): 1.15
Linux distribution: Ubuntu 14.04 LTS 64 bits
Kernel : drm-intel-nightly 308b72e08b237aa7cde758fc44f88851710e417d 4.2.0-rc5 from git://anongit.freedesktop.org/drm-intel 
Mesa: mesa-10.6.3 ddc976368fef367e464472ebcc2ac4fd89eb9fd8 from http://cgit.freedesktop.org/mesa/mesa/
Xf86_video_intel: 2.99.917 baec802b21387d04aebb10ac29e719a1800c5aa0 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm: libdrm-2.4.62 ba4b5ac010ab85406ec52e3906e13d58cd9aa782 from http://cgit.freedesktop.org/mesa/drm/
Cairo: 1.14.2 from 93422b3cb5e0ef8104b8194c8873124ce2f5ea2d http://cgit.freedesktop.org/cairo
libva: libva-1.6.0 a8008998bc0d4a76ae6927607c048e52ba50fd0e from http://cgit.freedesktop.org/libva/
intel-driver: 1.6.0 32268c46d538667d437dc9266aa4c183e51c1286 from http://cgit.freedesktop.org/vaapi/intel-driver
Xserver: xorg-server-1.17.2 2123f7682d522619f101b05fb75efa75dabbe371 from http://cgit.freedesktop.org/xorg/xserver

IGT: 1.11-g5c07135 from http://cgit.freedesktop.org/xorg/app/intel-gpu-tools/ 

Kernel commit log:
commit 308b72e08b237aa7cde758fc44f88851710e417d
Author: Daniel Vetter
Date: Fri Aug 7 19:09:47 2015 +0200
drm-intel-nightly: 2015y-08m-07d-17h-08m-56s UTC integration manifest
Comment 6 cprigent 2015-08-16 13:58:56 UTC
SKL - huge-bo-tiledy crash is known: https://bugs.freedesktop.org/show_bug.cgi?id=91372
Comment 7 Chris Wilson 2015-08-16 14:14:14 UTC
(In reply to cprigent from comment #6)
> SKL - huge-bo-tiledy crash is known:
> https://bugs.freedesktop.org/show_bug.cgi?id=91372

Nope, completely different failure.
Comment 8 Humberto Israel Perez Rodriguez 2015-10-15 15:43:42 UTC
Reproduced on HSW with the latest configuration:

Configuration :
---------------------------------------------
kernel: 4.3.0-rc4-drm-intel-testing-2015-10-10
xorg-server-1.17.2
libdrm-2.4.65
xf86-video-intel-2.99.917
mesa-11.0.2
libva-1.6.1
intel-driver-1.6.1
cairo-1.14.2
IGT Version : 1.12-g1f9e055

Sub-tests
------------------------------------------
huge-bo-tiledY
huge-bo-tiledX
Comment 9 Elio 2015-11-25 19:22:35 UTC
The problem is present on BYT with latest graphic stack and the following kernel: 4.4.0-rc1-nightly+

[  604.956891] WARNING: CPU: 1 PID: 2354 at /home/shared/kernels/drm-intel/drivers/gpu/drm/i915/i915_gem.c:1880 i915_gem_fault+0x255/0x470 [i915]()
[  604.956895] unhandled error in i915_gem_fault: -7
[  604.956898] Modules linked in: binfmt_misc nls_iso8859_1 uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core hid_multitouch v4l2_common videodev usbhid snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic intel_rapl intel_soc_dts_iosf intel_powerclamp coretemp kvm_intel kvm irqbypass arc4 rtl8188ee crct10dif_pclmul crc32_pclmul rtl_pci rtlwifi mac80211 cryptd snd_soc_rt5640 snd_hda_intel snd_soc_rl6231 snd_intel_sst_acpi cfg80211 snd_hda_codec snd_intel_sst_core snd_soc_sst_mfld_platform snd_soc_core snd_hda_core joydev snd_compress serio_raw snd_hwdep snd_pcm_dmaengine i915 snd_pcm toshiba_acpi sparse_keymap snd_seq_midi snd_seq_midi_event drm_kms_helper toshiba_bluetooth snd_rawmidi drm dw_dmac snd_seq dw_dmac_core video i2c_algo_bit intel_smartconnect wmi fb_sys_fops
[  604.956969]  snd_seq_device syscopyarea snd_timer rfkill_gpio sysfillrect snd i2c_hid sysimgblt xhci_pci mei_txe hid xhci_hcd mei snd_soc_sst_acpi spi_pxa2xx_platform i2c_designware_platform i2c_designware_core 8250_dw shpchp iosf_mbi lpc_ich soundcore mac_hid parport_pc ppdev lp parport autofs4 psmouse r8169 mii ahci libahci sdhci_acpi sdhci
[  604.957010] CPU: 1 PID: 2354 Comm: gem_mmap_gtt Tainted: G     U          4.4.0-rc1-nightly+ #1
[  604.957013] Hardware name: TOSHIBA Satellite C55t-A/Portable PC, BIOS 1.30 03/24/2014
[  604.957017]  ffffffffa03ee670 ffff88005e7dfc60 ffffffff8139129d ffff88005e7dfca8
[  604.957023]  ffff88005e7dfc98 ffffffff81076196 0000000000000002 ffff880036933800
[  604.957028]  00000000fffffff9 ffff8801705fd170 ffff88005e7dfdb8 ffff88005e7dfcf8
[  604.957034] Call Trace:
[  604.957044]  [<ffffffff8139129d>] dump_stack+0x44/0x57
[  604.957051]  [<ffffffff81076196>] warn_slowpath_common+0x86/0xc0
[  604.957056]  [<ffffffff8107621c>] warn_slowpath_fmt+0x4c/0x50
[  604.957087]  [<ffffffffa0355315>] i915_gem_fault+0x255/0x470 [i915]
[  604.957093]  [<ffffffff8119accd>] __do_fault+0x3d/0xa0
[  604.957099]  [<ffffffff810686d3>] ? pte_alloc_one+0x33/0x40
[  604.957104]  [<ffffffff8119f03a>] handle_mm_fault+0xe9a/0x1820
[  604.957110]  [<ffffffff810b0c32>] ? pick_next_task_fair+0x322/0x4b0
[  604.957117]  [<ffffffff8106321a>] __do_page_fault+0x19a/0x430
[  604.957122]  [<ffffffff810634e0>] do_page_fault+0x30/0x80
[  604.957128]  [<ffffffff8116e7ad>] ? context_tracking_exit+0x1d/0x30
[  604.957134]  [<ffffffff81769c78>] page_fault+0x28/0x30
[  604.957138] ---[ end trace 4898d5e110f1f6dd ]---
Comment 10 Chris Wilson 2015-11-25 19:26:53 UTC
(In reply to Elio from comment #9)
> The problem is present on BYT with latest graphic stack and the following
> kernel: 4.4.0-rc1-nightly+
> 
> [  604.956891] WARNING: CPU: 1 PID: 2354 at
> /home/shared/kernels/drm-intel/drivers/gpu/drm/i915/i915_gem.c:1880
> i915_gem_fault+0x255/0x470 [i915]()
> [  604.956895] unhandled error in i915_gem_fault: -7

That's a different failure case, not the bug reported here. That is actually the expected failure on the current kernel.
Comment 11 Chris Wilson 2016-01-28 10:14:14 UTC
As the most recent entry here is for a dupe of the big-copy-xy fail, and the original bug is fixed (bug 91116 and friends), marking as resolved.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.