Bug 90064 - [BSW, BDW-U]igt/gem_exec_big sporadically fails
Summary: [BSW, BDW-U]igt/gem_exec_big sporadically fails
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: All Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-04-17 05:42 UTC by lu hua
Modified: 2017-07-24 22:47 UTC (History)
2 users (show)

See Also:
i915 platform: BDW, SKL
i915 features: GEM/Other


Attachments
dmesg (124.86 KB, text/plain)
2015-04-17 05:42 UTC, lu hua
no flags Details

Description lu hua 2015-04-17 05:42:17 UTC
Created attachment 115146 [details]
dmesg

==System Environment==
--------------------------
Regression: not sure

Non-working platforms: BSW

==kernel==
--------------------------
drm-intel-nightly/d600654ab94b325f253e267422dcf60302120ea0
commit d600654ab94b325f253e267422dcf60302120ea0
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Thu Apr 16 17:54:10 2015 +0200

    drm-intel-nightly: 2015y-04m-16d-15h-53m-28s UTC integration manifest

==Bug detailed description==
-----------------------------
It sporadically fails on BSW. it also has bug 90045.

output:
batch_size 1610612736, reloc_ofs 435945472
batch_size 1610612736, reloc_ofs 435949568
batch_size 1610612736, reloc_ofs 435953664
batch_size 1610612736, reloc_ofs 435957760
batch_size 1610612736, reloc_ofs 435961856
batch_size 1610612736, reloc_ofs 435965952
batch_size 1610612736, reloc_ofs 435970048
batch_size 1610612736, reloc_ofs 435974144
batch_size 1610612736, reloc_ofs 435978240
batch_size 1610612736, reloc_ofs 435982336
batch_size 1610612736, reloc_ofs 435986432
batch_size 1610612736, reloc_ofs 435990528
batch_size 1610612736, reloc_ofs 435994624
batch_size 1610612736, reloc_ofs 435998720
batch_size 1610612736, reloc_ofs 436002816
Test assertion failure function gem_execbuf, file ioctl_wrappers.c:445:
Failed assertion: result == 0
Last errno: 5, Input/output error
****  END  ****
FAIL (82.855s)

Dmesg:
[  100.130065] kworker/u8:2: page allocation failure: order:0, mode:0x204020
[  100.130077] CPU: 1 PID: 1024 Comm: kworker/u8:2 Tainted: G        W       4.0.0_drm-intel-nightly_d60065_20150417+ #234
[  100.130123] Workqueue: i915-hangcheck i915_hangcheck_elapsed [i915]
[  100.130128]  0000000000000000 0000000000204020 ffffffff81795847 0000000000000000
[  100.130135]  ffffffff810d56d2 00000000ffffffff ffff88017fffac00 0000000000000001
[  100.130140]  0000000000000001 ffff88017fffa3e0 0000000000000000 0000000000000000
[  100.130146] Call Trace:
[  100.130159]  [<ffffffff81795847>] ? dump_stack+0x40/0x50
[  100.130167]  [<ffffffff810d56d2>] ? warn_alloc_failed+0x109/0x11b
[  100.130174]  [<ffffffff810d7c56>] ? __alloc_pages_nodemask+0x59e/0x6fc
[  100.130181]  [<ffffffff81104767>] ? alloc_pages_current+0xad/0xca
[  100.130188]  [<ffffffff81107fdd>] ? new_slab+0xba/0x278
[  100.130194]  [<ffffffff8179360a>] ? __slab_alloc+0x33c/0x44d
[  100.130223]  [<ffffffffa0107a42>] ? i915_error_object_create+0x160/0x359 [i915]
[  100.130252]  [<ffffffffa0107a42>] ? i915_error_object_create+0x160/0x359 [i915]
[  100.130259]  [<ffffffff811090c6>] ? kmem_cache_alloc_trace+0x5d/0xfb
[  100.130287]  [<ffffffffa0107a42>] ? i915_error_object_create+0x160/0x359 [i915]
[  100.130321]  [<ffffffffa00a7568>] ? i915_capture_error_state+0x1005/0x1471 [i915]
[  100.130357]  [<ffffffffa00aeeed>] ? i915_handle_error+0x7a/0x587 [i915]
[  100.130394]  [<ffffffffa00af75d>] ? i915_hangcheck_elapsed+0x325/0x3bf [i915]
[  100.130402]  [<ffffffff8104ca7f>] ? process_one_work+0x1b2/0x31d
[  100.130409]  [<ffffffff8104d278>] ? worker_thread+0x24d/0x339
[  100.130415]  [<ffffffff8104d02b>] ? cancel_delayed_work_sync+0xa/0xa
[  100.130421]  [<ffffffff81050b25>] ? kthread+0xce/0xd6
[  100.130427]  [<ffffffff81050a57>] ? kthread_create_on_node+0x162/0x162
[  100.130433]  [<ffffffff8179b048>] ? ret_from_fork+0x58/0x90
[  100.130439]  [<ffffffff81050a57>] ? kthread_create_on_node+0x162/0x162
[  100.130443] Mem-Info:
[  100.130446] Node 0 DMA per-cpu:
[  100.130450] CPU    0: hi:    0, btch:   1 usd:   0
[  100.130454] CPU    1: hi:    0, btch:   1 usd:   0
[  100.130457] CPU    2: hi:    0, btch:   1 usd:   0
[  100.130461] CPU    3: hi:    0, btch:   1 usd:   0
[  100.130463] Node 0 DMA32 per-cpu:
[  100.130468] CPU    0: hi:  186, btch:  31 usd:   8
[  100.130471] CPU    1: hi:  186, btch:  31 usd:  30
[  100.130475] CPU    2: hi:  186, btch:  31 usd: 181
[  100.130478] CPU    3: hi:  186, btch:  31 usd:  38
[  100.130481] Node 0 Normal per-cpu:
[  100.130485] CPU    0: hi:  186, btch:  31 usd:  54
[  100.130488] CPU    1: hi:  186, btch:  31 usd:  46
[  100.130492] CPU    2: hi:  186, btch:  31 usd: 185
[  100.130495] CPU    3: hi:  186, btch:  31 usd:  35
[  100.130504] active_anon:10899 inactive_anon:852118 isolated_anon:0
[  100.130504]  active_file:4448 inactive_file:3561 isolated_file:0

==Reproduce steps==
---------------------------- 
1. ./gem_exec_big
Comment 1 Chris Wilson 2015-04-20 20:14:39 UTC
The oom here is mostly irrelevant. It is a victim of the kernel's too-small-to-fail rule, but the failure is the error capture *after* the unexpected GPU hang. Is that GPU hang being tracked? I suspect it is just the same as the thousand or so other such reports we have for BSW already...
Comment 2 lu hua 2015-05-13 02:52:20 UTC
bug 90045 is about gpu hang.
Comment 3 Humberto Israel Perez Rodriguez 2015-09-23 18:02:20 UTC
still happen on SKL-Y with the latest kernel :

Kernel commit log:
commit b4c4542ba1abfb0d3d6913504502573bf2c62b12
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Fri Aug 28 15:51:30 2015 +0200
drm-intel-nightly: 2015y-08m-28d-13h-50m-34s UTC integration manifest
Comment 4 Chris Wilson 2015-09-23 18:04:39 UTC
(In reply to Humberto Israel Perez Rodriguez from comment #3)
> still happen on SKL-Y with the latest kernel :
> 
> Kernel commit log:
> commit b4c4542ba1abfb0d3d6913504502573bf2c62b12
> Author: Daniel Vetter <daniel.vetter@ffwll.ch>
> Date:   Fri Aug 28 15:51:30 2015 +0200
> drm-intel-nightly: 2015y-08m-28d-13h-50m-34s UTC integration manifest

What failure mode?
Comment 5 Elio 2015-10-01 15:27:27 UTC
This issue is reproducible on BDW-U with the following configuration:

Kernel 4.3.0-rc8-drm-intel-testing-2015-08-28
Mesa: mesa-10.6.7 from http://cgit.freedesktop.org/mesa/mesa/
Xf86_video_intel: 2.99.917 from
http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm: libdrm-2.4.64 from http://cgit.freedesktop.org/mesa/drm/
Cairo: 1.14.2 from http://cgit.freedesktop.org/cairo
libva: libva-1.6.0 from http://cgit.freedesktop.org/libva/
intel-driver: 1.6.1. from http://cgit.freedesktop.org/vaapi/intel-driver
xorg: 1.17.99 installed with script git_xorg.sh
Xserver: xorg-server-1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Intel-gpu-tools: 1.12 from http://cgit.freedesktop.org/xorg/app/intel-gpu
Comment 6 Chris Wilson 2015-10-01 15:34:08 UTC
Which issue? This bug report contains at least two. Without providing details of the test failure we cannot make any progress on the bug or even confirm if you are seeing the same issue as the original report.
Comment 7 cprigent 2015-11-01 16:28:28 UTC
gem_exec_big is Pass on BSW-M with last setup.
Let's check on the other platforms

Platform: Braswell M
CPU : Intel(R) Celeron N3060 1.60GHz @ 1.6 GHz (family: 6, model: 76 stepping: 4)
SoC : BSW D0
QDF : K6XC
CRB : BRASWELL RVP Fab2
Mandatory Reworks : All 
Feature Reworks: F28, F32, F33, F35, F37
Optional reworks : O-01a; O-02, O-03
Software
BIOS : BRAS.X64.B084.R00.1508310642
TXE FW : 2.0.0.2073
Ksc : 1.08
Linux distribution: Ubuntu 14.04 LTS 64 bits
kernel 4.3.0-rc6-drm-intel-nightly+ (ab48d30) from git://anongit.freedesktop.org/drm-intel
commit ab48d308f2fe5a8ad3d2fd57d313e74995cf1c20 Author: Daniel Vetter <daniel.vetter@ffwll.ch> Date: Fri Oct 23 12:03:04 2015 +0200 drm-intel-nightly: 2015y-10m-23d-10h-02m-25s UTC integration manifest
Mesa 11.0.4 from http://cgit.freedesktop.org/mesa/mesa/
xf86-video-intel - 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm - 2.4.65 from http://cgit.freedesktop.org/mesa/drm/
Libva - 1.6.1 from http://cgit.freedesktop.org/libva/
vaapi intel-driver - 1.6.1 from http://cgit.freedesktop.org/vaapi/intel-driver
Cairo - 1.14.2 from http://cgit.freedesktop.org/cairo
Xorg Xserver - 1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Comment 8 Rami 2015-11-19 10:23:17 UTC
reproduced on BSW-M with last setup:

setup:

Hardware:
Platform: Braswell M 
CPU : Intel(R) Celeron N3060 1.60GHz @ 1.6 GHz (family: 6, model: 76 stepping: 4)
SoC : BSW C0
QDF : K6XC
CRB : BRASWELL RVP Fab2
Mandatory Reworks : All Feature Reworks: F28, F32, F33, F35, F37
Optional reworks : O-01a; O-02, O-03 

Software:
Linux distribution: Ubuntu 15.04 LTS 64 bits 
BIOS : BRAS.X64.B084.R00.1508310642
TXE FW : 2.0.0.2073
Ksc : 1.08
kernel  drm-intel-nightly: 2015y-11m-12d-15h-35m-53s UTC integration manifest
commit 4c2531304c0a2f36f6b2cce2add5b5b2bd3fd893
Author: Jani Nikula <jani.nikula@intel.com>
Date:   Thu Nov 12 17:36:12 2015 +0200
cairo: (HEAD, tag: 1.14.2) 93422b3cb5e0ef8104b8194c8873124ce2f5ea2d from git://git.freedesktop.org/git/cairo
drm: (HEAD, tag: libdrm-2.4.65, tag: 2.4.65) c3496167637e35cf8a52d5e7e53a412e79d80db0 from git://git.freedesktop.org/git/mesa/drm
intel-driver: (HEAD, tag: 1.6.1, origin/v1.6-branch) 35858c69166b845c59ca32e19a3dbb0b758df209 from git://git.freedesktop.org/git/vaapi/intel-driver
libva: (HEAD, tag: libva-1.6.1, origin/v1.6-branch) 613eb962b45fbbd1526d751e88e0d8897af6c0e0 from git://git.freedesktop.org/git/vaapi/libva
mesa: (HEAD, tag: mesa-11.0.5) ee57c22141c42d9b511a7dfa5971c4428cd1c6e7 from git://git.freedesktop.org/git/mesa/mesa
xf86-video-intel: (HEAD, tag: 2.99.917) baec802b21387d04aebb10ac29e719a1800c5aa0 from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel
xserver: (HEAD, tag: xorg-server-1.17.2) 2123f7682d522619f101b05fb75efa75dabbe371 from git://git.freedesktop.org/git/xorg/xserver

* Tools *
intel-gpu-tools: (HEAD, origin/master, origin/HEAD, master) e42936d86b52c6804da41755df7155cafded5eb2 from git://git.freedesktop.org/git/xorg/app/intel-gpu-tools
Comment 9 Chris Wilson 2015-11-19 10:34:24 UTC
(In reply to Rami from comment #8)
> reproduced on BSW-M with last setup:

What was? Can you please attach the test output and supporting log files?
Comment 10 cprigent 2016-02-25 17:48:41 UTC
Tested with last setup.
BDW-U: test is skip
BSW: test is fail, it is tracked by https://bugs.freedesktop.org/show_bug.cgi?id=88392

Platform: Braswell M
CPU : Intel(R) Celeron N3060 1.60GHz @ 1.6 GHz (family: 6, model: 76 stepping: 4)
SoC : BSW D0
QDF : K6XC
CRB : BRASWELL RVP Fab2

WSB5 : Wilson Beach PVT2
Hardware 
Platform:  Wilson Beach PVT2 Ultrabook 
CPU: Intel(R) Core (TM) i5-5300U CPU @ 2.30GHz  (family: 6, model: 61 stepping: 4)
SoC: Broadwell 2+2 F0 (QDF : QH3F)
Chipset PCH: Wildcat Point B2

Software
Linux OS: Ubuntu 15.04 64 bits
BIOS : BRAS.X64.B088.R00.1510270350
TXE FW : 2.0.0.2093
Ksc : 1.08
  kernel 4.5.0-rc5 08fc1b1 from from http://cgit.freedesktop.org/drm-intel
  commit 08fc1b101049694778bff7559e1d05250d2e7072
  Author: Ville Syrjälä <ville.syrjala@linux.intel.com>
  Date: Mon Feb 22 19:31:15 2016 +0200
  drm-intel-nightly: 2016y-02m-22d-17h-30m-27s UTC integration manifest
drm: tag libdrm-2.4.66 9b77443 from http://cgit.freedesktop.org/mesa/drm/
IGT 1.13 babcf40 from https://cgit.freedesktop.org/xorg/app/intel-gpu-tools/
One eDP connected

*** This bug has been marked as a duplicate of bug 88392 ***
Comment 11 cprigent 2016-02-25 17:48:57 UTC
So closed
Comment 12 Chris Wilson 2016-02-25 18:04:46 UTC
Please note this is not a duplicate of the old failure case, but a GPU hang associated with gem_exec_big as well as an oom.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.