Bug 78283

Summary: [BDW Regression]igt/gem_cpu_reloc stops at 1% and Call Trace in dmesg
Product: DRI Reporter: Guo Jinxian <jinxianx.guo>
Component: DRM/IntelAssignee: Mika Kuoppala <mika.kuoppala>
Status: CLOSED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: high CC: intel-gfx-bugs, yk
Version: unspecified   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
dmesg none

Description Guo Jinxian 2014-05-05 07:45:22 UTC
Created attachment 98453 [details]
dmesg

*System Environment:
--------------------------
Platform: BDW
kernel: 
-nightly: 08ce6614d07dd1e426109672a5e323317c8d6ec7(fails)
-queued: e5c03ca362819ba8ffbe5674340b61b9cd75de8f (fails)
-fixes: 9bbfd20abe5025adbb0ac75160bd2e41158a9e83 (works)


 *Bug detailed description:
-----------------------------
igt/gem_cpu_reloc stops at 1% and Call Trace in dmesg

It's a regression bug
good commit: c79057922ed6c2c6df1214e6ab4414fea1b23db2
bad commit: e5c03ca362819ba8ffbe5674340b61b9cd75de8f 
We will bisect it later

Dmesg shows:
[   27.313507] Oops: 0002 [#1] SMP
[   27.313530] Modules linked in: dm_mod snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support ppdev pcspkr lpc_ich mfd_core snd_hda_intel snd_hda_controller i2c_i801 snd_hda_codec snd_hwdep snd_pcm snd_timer snd soundcore battery parport_pc parport ac acpi_cpufreq i915 video button drm_kms_helper drm
[   27.313730] CPU: 0 PID: 4331 Comm: gem_cpu_reloc Not tainted 3.15.0-rc2_drm-intel-nightly_08ce66_20140504+ #2286
[   27.313786] task: ffff88014483f620 ti: ffff880149ace000 task.ti: ffff880149ace000
[   27.313826] RIP: 0010:[<ffffffff812e6723>]  [<ffffffff812e6723>] iowrite32+0xe/0x28
[   27.313872] RSP: 0018:ffff880149acfc00  EFLAGS: 00010292
[   27.313902] RAX: 0000000011000001 RBX: ffff880145629780 RCX: 000000000001ffe0
[   27.313940] RDX: 000000000001fff0 RSI: ffffc900110a0000 RDI: ffffc900110a0000
[   27.313978] RBP: 0000000000000002 R08: ffff880147558578 R09: ffff880147558578
[   27.314016] R10: 0000000000000000 R11: 0000000000000000 R12: ffff880145628000
[   27.314054] R13: 0000000000000000 R14: 0000000000022040 R15: ffff8801492f4de0
[   27.314093] FS:  00007fa1904238c0(0000) GS:ffff88014f400000(0000) knlGS:0000000000000000
[   27.314136] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   27.314167] CR2: ffffc900110a0000 CR3: 000000014833f000 CR4: 00000000003407f0
[   27.314205] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   27.314243] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   27.314281] Stack:
[   27.314294]  ffffffffa008b5ea 000000000081c000 0000000000000004 ffff880145629780
[   27.314342]  ffff880145628000 ffff88000242be00 0000000000000000 ffff880145629780
[   27.314389]  ffffffffa008a9a8 ffff88000242be00 ffffffffa008d3a1 0000000000000000
[   27.314437] Call Trace:
[   27.314467]  [<ffffffffa008b5ea>] ? gen6_signal+0x7b/0x10c [i915]
[   27.314511]  [<ffffffffa008a9a8>] ? gen6_add_request+0x13/0x88 [i915]
[   27.314556]  [<ffffffffa008d3a1>] ? intel_ring_flush_all_caches+0x1c/0x29 [i915]
[   27.314605]  [<ffffffffa007a17f>] ? __i915_add_request+0x1c4/0x1e3 [i915]
[   27.314650]  [<ffffffffa0075eb1>] ? i915_gem_do_execbuffer.isra.16+0xff1/0x1142 [i915]
[   27.314694]  [<ffffffff812e3c23>] ? __sg_page_iter_next+0x2b/0x58
[   27.314736]  [<ffffffffa00764d0>] ? i915_gem_execbuffer2+0x177/0x1fb [i915]
[   27.314779]  [<ffffffffa0002f22>] ? drm_ioctl+0x25c/0x3ad [drm]
[   27.314819]  [<ffffffffa0076359>] ? i915_gem_execbuffer+0x357/0x357 [i915]
[   27.314859]  [<ffffffff81725de3>] ? __schedule+0x638/0x77e
[   27.314893]  [<ffffffff810ee9b3>] ? do_vfs_ioctl+0x3ec/0x435
[   27.314926]  [<ffffffff810eea45>] ? SyS_ioctl+0x49/0x78
[   27.314958]  [<ffffffff8172d622>] ? system_call_fastpath+0x16/0x1b
[   27.314991] Code: 48 89 f7 76 05 e9 21 5c d4 ff 48 81 fe 00 00 01 00 77 09 48 c7 c6 15 9f 9b 81 eb aa c3 48 81 fe ff ff 03 00 89 f8 48 89 f7 76 03 <89> 06 c3 48 81 fe 00 00 01 00 76 05 0f b7 d6 ef c3 48 c7 c6 49
[   27.315227] RIP  [<ffffffff812e6723>] iowrite32+0xe/0x28
[   27.315261]  RSP <ffff880149acfc00>
[   27.315281] CR2: ffffc900110a0000
[   27.315301] ---[ end trace de5b068c0f112569 ]---


Output:
IGT-Version: 1.6-gc864279 (x86_64) (Linux: 3.15.0-rc2_drm-intel-nightly_08ce66_20140504+ x86_64)
gem_cpu_reloc:   1%


 *Reproduce steps:
---------------------------- 
1. ./gem_cpu_reloc
Comment 1 Chris Wilson 2014-05-15 10:46:27 UTC
commit d1533379584f8edcfcabb024dffc1b334db8da0f
Author: Oscar Mateo <oscar.mateo@intel.com>
Date:   Fri May 9 13:44:59 2014 +0100

    drm/i915: Ringbuffer signal func for the second BSD ring
    
    This is missing in:
    
    commit 78325f2d270897c9ee0887125b7abb963eb8efea
    Author: Ben Widawsky <benjamin.widawsky@intel.com>
    Date:   Tue Apr 29 14:52:29 2014 -0700
    
        drm/i915: Virtualize the ringbuffer signal func
    
    Looks to me like a rebase side-effect...
    
    Signed-off-by: Oscar Mateo <oscar.mateo@intel.com>
    Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Comment 2 Chris Wilson 2014-05-15 16:09:48 UTC
Mika is on the trail.
Comment 3 Chris Wilson 2014-05-16 10:28:34 UTC
commit 6e450ab24dc645d776e65bbb91fc5f6788087c32
Author: Mika Kuoppala <mika.kuoppala@linux.intel.com>
Date:   Thu May 15 20:58:07 2014 +0300

    drm/i915: Bail out early on gen6_signal if no semaphores
    
    If we dont have semaphores enabled, we allocate 4
    dwords for signalling. But end up emitting more regardless.
    
    Fix this by bailing out early if semaphores are not enabled.
    
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78274
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78283
    Signed-off-by: Mika Kuoppala <mika.kuoppala@intel.com>
    Reviewed-by: Ville Syrjälä <ville.syrjala@linux.intel.com>
    Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Comment 4 liulei 2014-05-26 08:29:09 UTC
Verified 
[root@x-bdw opt]# ./gem_cpu_reloc
IGT-Version: 1.6-gff3c122 (x86_64) (Linux: 3.15.0-rc6_drm-intel-nightly_c37f30_20140526+ x86_64)
gem_cpu_reloc: 100%
Test suceeded, cleanup up - this might take a while.
Comment 5 Elizabeth 2017-10-06 14:38:18 UTC
Closing old verified.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.