Bug 88845 - [BSW ppgtt Bisected]igt/gem_exec_nop/render sporadically causes system hang
Summary: [BSW ppgtt Bisected]igt/gem_exec_nop/render sporadically causes system hang
Status: CLOSED DUPLICATE of bug 88652
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: All Linux (All)
: high critical
Assignee: Nick Hoath
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-01-28 08:25 UTC by lu hua
Modified: 2017-10-06 14:31 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
dmesg (18.67 KB, text/plain)
2015-01-28 08:25 UTC, lu hua
no flags Details

Description lu hua 2015-01-28 08:25:06 UTC
Created attachment 112904 [details]
dmesg

==System Environment==
--------------------------
Regression:  not sure, fail rate:2/4

no-working platforms: BSW

==kernel==
--------------------------
drm-intel-nightly/70438b51bdd44180b83f10591c3bdc1832d11dc2
commit 70438b51bdd44180b83f10591c3bdc1832d11dc2
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Tue Jan 27 12:11:03 2015 +0100

    drm-intel-nightly: 2015y-01m-27d-11h-10m-41s UTC integration manifest

==Bug detailed description==
-----------------------------
It sporadically causes system hang, fail rate:2/4. Test -nightly kernel and -next-queued kernel.

output:
IGT-Version: 1.9-gebd8b32 (x86_64) (Linux: 3.19.0-rc6_drm-intel-nightly_70438b_20150128+ x86_64)
Time to exec x 1:               375.000µs (ring=render)
Time to exec x 2:               102.000µs (ring=render)
Time to exec x 4:                27.250µs (ring=render)
Time to exec x 8:                18.625µs (ring=render)
Time to exec x 16:               13.812µs (ring=render)
Time to exec x 32:               10.344µs (ring=render)
Time to exec x 64:                8.109µs (ring=render)
Time to exec x 128:               7.000µs (ring=render)
Time to exec x 256:               6.312µs (ring=render)
Time to exec x 512:               5.424µs (ring=render)
Time to exec x 1024:              5.168µs (ring=render)
Time to exec x 2048:              5.700µs (ring=render)
Time to exec x 4096:              4.883µs (ring=render)
Time to exec x 8192:              4.754µs (ring=render)
Time to exec x 16384:             4.821µs (ring=render)
(gem_exec_nop:4427) CRITICAL: Test assertion failure function loop, file gem_exec_nop.c:100:
(gem_exec_nop:4427) CRITICAL: Failed assertion: exec(fd, handle, count, ring_id) == 0
Subtest render: FAIL (66.469s)


dmesg:
[  131.802039] ------------[ cut here ]------------
[  131.802092] WARNING: CPU: 3 PID: 4427 at drivers/gpu/drm/i915/intel_lrc.c:506 intel_logical_ring_advance_and_submit+0x73/0x254 [i915]()
[  131.802096] execlist context submission without request
[  131.802099] Modules linked in: ipv6 dm_mod snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support snd_hda_codec_realtek snd_hda_codec_generic serio_raw pcspkr snd_hda_intel snd_hda_controller snd_hda_codec snd_hwdep snd_pcm lpc_ich snd_timer mfd_core snd soundcore i2c_i801 battery ac acpi_cpufreq joydev i915 button video drm_kms_helper drm cfbfillrect cfbimgblt cfbcopyarea
[  131.802147] CPU: 3 PID: 4427 Comm: gem_exec_nop Not tainted 3.19.0-rc6_drm-intel-nightly_70438b_20150128+ #1026
[  131.802151]  0000000000000000 0000000000000009 ffffffff81799da8 ffff880175213a78
[  131.802158]  ffffffff8103bdec 0000000000012480 ffffffffa00ba963 0000000000000000
[  131.802164]  ffff8801754518b8 0000000000000000 ffff880175450000 ffff8800028e4900
[  131.802171] Call Trace:
[  131.802184]  [<ffffffff81799da8>] ? dump_stack+0x40/0x50
[  131.802193]  [<ffffffff8103bdec>] ? warn_slowpath_common+0x98/0xb0
[  131.802228]  [<ffffffffa00ba963>] ? intel_logical_ring_advance_and_submit+0x73/0x254 [i915]
[  131.802236]  [<ffffffff8103be9c>] ? warn_slowpath_fmt+0x45/0x4a
[  131.802271]  [<ffffffffa00ba963>] ? intel_logical_ring_advance_and_submit+0x73/0x254 [i915]
[  131.802297]  [<ffffffffa010c407>] ? logical_ring_wait_for_space+0xc2/0x150 [i915]
[  131.802333]  [<ffffffffa00bae2a>] ? intel_logical_ring_begin+0xea/0x1ea [i915]
[  131.802341]  [<ffffffff8110d587>] ? kmem_cache_free+0xf6/0x134
[  131.802376]  [<ffffffffa00bb006>] ? gen8_emit_flush_render+0x3d/0xe1 [i915]
[  131.802411]  [<ffffffffa00bb482>] ? intel_execlists_submission+0x230/0x34b [i915]
[  131.802441]  [<ffffffffa009e347>] ? i915_gem_do_execbuffer.isra.12+0xca1/0xd88 [i915]
[  131.802449]  [<ffffffff8105d25f>] ? check_preempt_wakeup+0x117/0x169
[  131.802455]  [<ffffffff8105eafc>] ? enqueue_task_fair+0x441/0x88d
[  131.802462]  [<ffffffff8110d964>] ? __kmalloc+0x66/0x151
[  131.802491]  [<ffffffffa009f381>] ? i915_gem_execbuffer2+0x172/0x209 [i915]
[  131.802520]  [<ffffffffa009f20f>] ? i915_gem_execbuffer+0x350/0x350 [i915]
[  131.802538]  [<ffffffffa001070a>] ? drm_ioctl+0x279/0x3bc [drm]
[  131.802544]  [<ffffffff8105ca3d>] ? set_next_entity+0x14/0x37
[  131.802574]  [<ffffffffa009f20f>] ? i915_gem_execbuffer+0x350/0x350 [i915]
[  131.802590]  [<ffffffffa0010491>] ? drm_copy_field+0x50/0x50 [drm]
[  131.802598]  [<ffffffff81122409>] ? do_vfs_ioctl+0x412/0x459
[  131.802605]  [<ffffffff81129786>] ? __fdget+0x5/0x7
[  131.802612]  [<ffffffff81122499>] ? SyS_ioctl+0x49/0x78
[  131.802620]  [<ffffffff8179f792>] ? system_call_fastpath+0x12/0x17
[  131.802625] ---[ end trace 26c26963615ce067 ]---
[  131.802628] ------------[ cut here ]------------
[  131.802663] WARNING: CPU: 3 PID: 4427 at include/linux/kref.h:47 intel_logical_ring_advance_and_submit+0xed/0x254 [i915]()


==Reproduce steps==
---------------------------- 
1. ./gem_exec_nop --run-subtest render
Comment 1 lu hua 2015-02-06 02:39:26 UTC
add i915.enable_ppgtt=0 and run 10 cycles, it works well.
Comment 2 lu hua 2015-02-10 06:04:24 UTC
Bisect shows: 6d3d8274bc45de4babb62d64562d92af984dd238 is the first bad commit.
commit 6d3d8274bc45de4babb62d64562d92af984dd238
Author:     Nick Hoath <nicholas.hoath@intel.com>
AuthorDate: Thu Jan 15 13:10:39 2015 +0000
Commit:     Daniel Vetter <daniel.vetter@ffwll.ch>
CommitDate: Tue Jan 27 09:50:53 2015 +0100

    drm/i915: Subsume intel_ctx_submit_request in to drm_i915_gem_request

    Move all remaining elements that were unique to execlists queue items
    in to the associated request.

    Issue: VIZ-4274

    v2: Rebase. Fixed issue of overzealous freeing of request.
    v3: Removed re-addition of cleanup work queue (found by Daniel Vetter)
    v4: Rebase.
    v5: Actual removal of intel_ctx_submit_request. Update both tail and postfix
    pointer in __i915_add_request (found by Thomas Daniel)
    v6: Removed unrelated changes

    Signed-off-by: Nick Hoath <nicholas.hoath@intel.com>
    Reviewed-by: Thomas Daniel <thomas.daniel@intel.com>
    [danvet: Reformat comment with strange linebreaks.]
    Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Comment 3 Jani Nikula 2015-02-10 08:18:04 UTC

*** This bug has been marked as a duplicate of bug 88652 ***
Comment 4 lu hua 2015-03-13 06:53:25 UTC
Verified.Fixed.
Comment 5 Elizabeth 2017-10-06 14:31:45 UTC
Closing old verified.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.