Bug 82740

Summary: [BDW/BSW Bisected] System unable to boot up, with execlist enabled
Product: DRI Reporter: Guo Jinxian <jinxianx.guo>
Component: DRM/IntelAssignee: Mika Kuoppala <mika.kuoppala>
Status: CLOSED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: blocker    
Priority: highest CC: intel-gfx-bugs, lei.a.liu, przanoni
Version: unspecified   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:

Description Guo Jinxian 2014-08-18 02:12:42 UTC
==System Environment==
--------------------------
Regression: Yes.
Good commit on -next-queued: 4ed91096881449012b14b1e879f40b4a37533e0e

Non-working platforms: BSW

==kernel==
--------------------------
origin/drm-intel-nightly: 186631131a9289dad22f51315d78b9b6ac5b425f(fails)
    drm-intel-nightly: 2014y-08m-15d-14h-55m-27s integration manifest
origin/drm-intel-next-queued: ecca3fd0ea82c45b558f68b1ff3bf6685b443ab1(fails)
    drm/i915/bdw: Enable Logical Ring Contexts (hence, Execlists)
origin/drm-intel-fixes: 103ae732ad26141515f109c80b5c1ced16e457c3(System is able to boot up, can log in by ssh, and testdisplay is able to detected monitors like eDP and HDMI, but the monitors unable to light up)
    drm/i915: Don't try to enable cursor from setplane when crtc is disabled

==Bug detailed description==
-----------------------------
System unable to boot up

Reproduce steps:
-------------------------
1. Boot the device.
Comment 1 Guo Jinxian 2014-08-18 02:44:01 UTC
ecca3fd0ea82c45b558f68b1ff3bf6685b443ab1 is the first bad commit
commit ecca3fd0ea82c45b558f68b1ff3bf6685b443ab1
Author:     Oscar Mateo <oscar.mateo@intel.com>
AuthorDate: Thu Jul 24 17:04:49 2014 +0100
Commit:     Daniel Vetter <daniel.vetter@ffwll.ch>
CommitDate: Fri Aug 15 14:48:25 2014 +0200


    drm/i915/bdw: Enable Logical Ring Contexts (hence, Execlists)

    The time has come, the Walrus said, to talk of many things.

    Signed-off-by: Oscar Mateo <oscar.mateo@intel.com>
    Reviewed-by: Damien Lespiau <damien.lespiau@intel.com>
    Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>

:040000 040000 c7e0f1164921eff4b758dddd07240cfe3de7e1e8 744b0adac0c50596a7a2d28eb5c921f6522ba3e5 M      drivers
Comment 2 Jani Nikula 2014-08-18 08:30:48 UTC
Please try http://patchwork.freedesktop.org/patch/31837
Comment 3 Jani Nikula 2014-08-18 08:41:21 UTC
*** Bug 82750 has been marked as a duplicate of this bug. ***
Comment 4 Damien Lespiau 2014-08-18 09:41:22 UTC
Hi,

Can we have the dmesg with drm.debug=6 to see where it stops booting? We're in the first week of the merge window, a great time for kernel-wide regressions.

Thanks,
Comment 5 Damien Lespiau 2014-08-18 09:44:12 UTC
The other bug has a dmesg with a backtrace. Let's see if they are indeed the same bug:

   1.237439] BUG: unable to handle kernel NULL pointer dereference at 00000000000001bc
[    1.237447] IP: [<ffffffffa009ca13>] intel_lr_context_deferred_create+0x4f1/0x64e [i915]
[    1.237483] PGD 0 
[    1.237486] Oops: 0000 [#1] SMP 
[    1.237489] Modules linked in: i915(+) button video drm_kms_helper drm
[    1.237497] CPU: 0 PID: 1433 Comm: udevd Not tainted 3.16.0_liuleill_ecca3f_20140818+ #2
[    1.237501] task: ffff8801497fe660 ti: ffff8800a7fb8000 task.ti: ffff8800a7fb8000
[    1.237504] RIP: 0010:[<ffffffffa009ca13>]  [<ffffffffa009ca13>] intel_lr_context_deferred_create+0x4f1/0x64e [i915]
[    1.237535] RSP: 0018:ffff8800a7fbb9a0  EFLAGS: 00010202
[    1.237538] RAX: 0000000000002270 RBX: ffff880149f21000 RCX: 0000000000000000
[    1.237540] RDX: ffffea000527c840 RSI: ffff880002cf7a10 RDI: ffff880002c14380
[    1.237543] RBP: ffff880002c141c0 R08: 0000000000000000 R09: ffff88014a00e000
[    1.237545] R10: ffff88014a00f428 R11: 0000000000000001 R12: ffff8800a7e96e40
[    1.237548] R13: 0000000000000000 R14: 0000000000000000 R15: ffff880002cf17d8
[    1.237551] FS:  00007f9ef5682840(0000) GS:ffff88014ec00000(0000) knlGS:0000000000000000
[    1.237554] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    1.237556] CR2: 00000000000001bc CR3: 00000000a7ee2000 CR4: 00000000003407f0
[    1.237558] Stack:
[    1.237560]  ffff8801493eb840 ffff880002c14380 ffffea000527c840 ffff8800a7dc1420
[    1.237564]  0000001300000000 0000000000000001 ffff880002cf17d8 ffff880144e3a800
[    1.237568]  ffff8801493eb840 ffff880144e3a800 0000000000000048 ffff880002cf0000
[    1.237572] Call Trace:
[    1.237599]  [<ffffffffa009cc00>] ? logical_ring_init+0x90/0x13d [i915]
[    1.237625]  [<ffffffffa009cd78>] ? intel_logical_rings_init+0xcb/0x3f3 [i915]
[    1.237653]  [<ffffffffa008a4af>] ? i915_gem_init_hw+0x109/0x1bc [i915]
[    1.237678]  [<ffffffffa008a6ea>] ? i915_gem_init+0x188/0x1e3 [i915]
[    1.237713]  [<ffffffffa00dfc9a>] ? i915_driver_load+0xc0b/0xde0 [i915]
[    1.237719]  [<ffffffff81060beb>] ? __wake_up+0x33/0x44
[    1.237724]  [<ffffffff81046172>] ? call_usermodehelper_exec+0xf0/0xfb
[    1.237728]  [<ffffffff81040000>] ? __dequeue_signal+0xcd/0xfc
[    1.237733]  [<ffffffff812dc908>] ? kobject_uevent_env+0x47a/0x4b2
[    1.237738]  [<ffffffff81380837>] ? get_device+0xe/0x16
[    1.237743]  [<ffffffff8170f7d1>] ? klist_add_tail+0x1b/0x3c
[    1.237747]  [<ffffffff81381ba4>] ? device_add+0x4ec/0x4fd
[    1.237751]  [<ffffffff812da708>] ? idr_replace+0x2d/0x93
[    1.237760]  [<ffffffffa0007570>] ? drm_dev_register+0x72/0xe3 [drm]
[    1.237769]  [<ffffffffa0009492>] ? drm_get_pci_dev+0xf4/0x1b0 [drm]
[    1.237773]  [<ffffffff8138a059>] ? __pm_runtime_resume+0x58/0x64
[    1.237778]  [<ffffffff81300aa2>] ? local_pci_probe+0x34/0x77
[    1.237782]  [<ffffffff81300ba6>] ? pci_device_probe+0xc1/0xe5
[    1.237786]  [<ffffffff81383bdd>] ? driver_probe_device+0x9b/0x1cf
[    1.237789]  [<ffffffff81383d95>] ? __driver_attach+0x53/0x73
[    1.237793]  [<ffffffff81383d42>] ? __device_attach+0x31/0x31
[    1.237796]  [<ffffffff813824fa>] ? bus_for_each_dev+0x6e/0x78
[    1.237799]  [<ffffffff813834d2>] ? bus_add_driver+0xfb/0x1c4
[    1.237803]  [<ffffffff81384289>] ? driver_register+0x83/0xbb
[    1.237806]  [<ffffffffa011e000>] ? 0xffffffffa011dfff
[    1.237809]  [<ffffffff810002fa>] ? do_one_initcall+0xe3/0x165
[    1.237814]  [<ffffffff810822e7>] ? load_module+0x163b/0x1b37
[    1.237818]  [<ffffffff8107fd2d>] ? store_uevent+0x30/0x30
[    1.237822]  [<ffffffff81082871>] ? SyS_init_module+0x8e/0x99
[    1.237826]  [<ffffffff8171bc12>] ? system_call_fastpath+0x16/0x1b
[    1.237828] Code: 47 0c 05 78 02 00 00 89 83 b8 00 00 00 41 8b 47 0c 05 74 02 00 00 89 83 c0 00 00 00 41 8b 47 0c 05 70 02 00 00 89 83 c8 00 00 00 <41> 8b 86 bc 01 00 00 89 83 94 00 00 00 49 8b 86 b8 01 00 00 89 
[    1.237859] RIP  [<ffffffffa009ca13>] intel_lr_context_deferred_create+0x4f1/0x64e [i915]
[    1.237888]  RSP <ffff8800a7fbb9a0>
[    1.237890] CR2: 00000000000001bc
[    1.237893] ---[ end trace 351738523b7064ec ]---
Comment 6 Paulo Zanoni 2014-08-18 14:48:27 UTC
I can't boot BDW on today's -nightly unless I pass i915.enable_execlists=0.

I see the exact same bakctrace as menitoned by Damien in comment #5.
Comment 7 liulei 2014-08-19 01:50:22 UTC
(In reply to comment #2)
> Please try http://patchwork.freedesktop.org/patch/31837
This patch works.
Comment 8 liulei 2014-08-20 05:31:45 UTC
I tested latest -nightly (49ceddc7b3b0e3f7eb21b9fb4c9b840cff630047). System can boot up successfully.
Comment 9 Chris Wilson 2014-08-20 05:59:09 UTC
Problem deferred.
Comment 10 Guo Jinxian 2014-08-20 06:18:09 UTC
> I tested latest -nightly (49ceddc7b3b0e3f7eb21b9fb4c9b840cff630047). System
> can boot up successfully.

Verified.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.