Bug 90229

Summary: [PNV/SNB/IVB/HSW/BSW/SKL regression]Call trace during system boot up progress then reboot fail
Product: DRI Reporter: ye.tian <yex.tian>
Component: DRM/IntelAssignee: Keyon <yang.jie>
Status: CLOSED NOTOURBUG QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: major    
Priority: high CC: intel-gfx-bugs, yang.jie
Version: unspecified   
Hardware: All   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
dmesg info with the nightly kernel
none
dmesg info with the fixes kernel
none
patch to fix the issue none

Description ye.tian 2015-04-29 08:34:29 UTC
Created attachment 115431 [details]
dmesg info with the nightly kernel

==System Environment==       
-----------------------------------------------------
Regression: Yes, 
Non-working platforms: BSW

==Kernel==
--------------------------------------------------
commit 9a4da5ec419897b78c1d4a3949c0c049db5402c5
Author: Jani Nikula <jani.nikula@intel.com>
Date:   Tue Apr 28 17:41:30 2015 +0300

    drm-intel-nightly: 2015y-04m-28d-14h-41m-15s UTC integration manifest

==Bug detailed description==
--------------------------------------------------
Call trace during system boot up progress then reboot fail on the latest nightly kernel, It does not exists on the latest next-queued kernel.
On latest fixes kernel, also have call trace when system boot up, but different with –nightly branch, and reboot is working.

==dmesg info==
[    4.019355] BUG: unable to handle kernel NULL pointer dereference at 0000000000000028
[    4.019365] IP: [<ffffffff81689d8d>] input_event+0x12/0x6b
[    4.019368] PGD 28de067 PUD 28f4067 PMD 0
[    4.019371] Oops: 0000 [#1] SMP
[    4.019392] Modules linked in: snd_hda_codec_realtek(+) iTCO_wdt iTCO_vendor_support snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_hda_core pcspkr snd_hwdep serio_raw snd_pcm snd_timer lpc_ich i2c_i801 mfd_core snd soundcore battery ac acpi_cpufreq i915 button video drm_kms_helper drm
[    4.019397] CPU: 2 PID: 2343 Comm: modprobe Not tainted 4.1.0-rc1_drm-intel-nightly_9a4da5_20150429+ #140
[    4.019399] task: ffff88017a2c4180 ti: ffff880002c90000 task.ti: ffff880002c90000
[    4.019404] RIP: 0010:[<ffffffff81689d8d>]  [<ffffffff81689d8d>] input_event+0x12/0x6b
[    4.019406] RSP: 0018:ffff880002c93a48  EFLAGS: 00010286
[    4.019408] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[    4.019409] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[    4.019411] RBP: 0000000000000006 R08: 0000000000000012 R09: ffff88007ba05c58
[    4.019412] R10: ffff88017a41ef00 R11: 00000000fffffffe R12: 0000000000000002
[    4.019414] R13: 0000000000000001 R14: ffff880002c93afc R15: 0000000000000002
[    4.019416] FS:  00007fea24c70740(0000) GS:ffff88017fd00000(0000) knlGS:0000000000000000
[    4.019418] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[    4.019420] CR2: 0000000000000028 CR3: 0000000002e95000 CR4: 00000000001006e0
[    4.019420] Stack:
[    4.019425]  0000000000000001 ffffffffa018f8cc ffff88017a41b840 0000000000000006
[    4.019428]  0000000000000002 ffffffffa018fc1f ffff88007ba05980 ffff880002e558c8
[    4.019431]  ffff88007ba06800 0000000000000012 0000000000000004 ffffffffa01f5f36
[    4.019432] Call Trace:
[    4.019443]  [<ffffffffa018f8cc>] ? snd_jack_new+0x8e/0x140 [snd]
[    4.019450]  [<ffffffffa018fc1f>] ? snd_jack_report+0x9d/0xd2 [snd]
[    4.019459]  [<ffffffffa01f5f36>] ? __snd_hda_jack_add_kctl+0xd2/0xe0 [snd_hda_codec]
[    4.019466]  [<ffffffffa01f6021>] ? add_jack_kctl+0xd3/0xf9 [snd_hda_codec]
[    4.019474]  [<ffffffffa01f60b2>] ? snd_hda_jack_add_kctls+0x6b/0x170 [snd_hda_codec]
[    4.019481]  [<ffffffffa021952d>] ? snd_hda_gen_build_controls+0x194/0x1a6 [snd_hda_codec_generic]
[    4.019492]  [<ffffffffa0232d39>] ? alc_build_controls+0x13/0x50 [snd_hda_codec_realtek]
[    4.019501]  [<ffffffffa01f5430>] ? snd_hda_codec_build_controls+0x34/0x1b6 [snd_hda_codec]
[    4.019508]  [<ffffffffa01f5832>] ? snd_hda_codec_build_pcms+0xff/0x175 [snd_hda_codec]
[    4.019515]  [<ffffffffa01f1417>] ? hda_codec_driver_probe+0xcb/0x118 [snd_hda_codec]
[    4.019521]  [<ffffffff813fc81b>] ? pm_runtime_barrier+0x59/0x85
[    4.019526]  [<ffffffff813f4fa0>] ? driver_probe_device+0xcd/0x225
[    4.019529]  [<ffffffff813f5180>] ? __driver_attach+0x53/0x73
[    4.019532]  [<ffffffff813f512d>] ? __device_attach+0x35/0x35
[    4.019535]  [<ffffffff813f3825>] ? bus_for_each_dev+0x6e/0x78
[    4.019539]  [<ffffffff813f4820>] ? bus_add_driver+0x101/0x1cb
[    4.019542]  [<ffffffff813f5775>] ? driver_register+0x83/0xbb
[    4.019545]  [<ffffffffa0240000>] ? 0xffffffffa0240000
[    4.019550]  [<ffffffff810002fd>] ? do_one_initcall+0xe2/0x161
[    4.019555]  [<ffffffff817a9be2>] ? preempt_schedule_common+0x1a/0x32
[    4.019558]  [<ffffffff817a9c0d>] ? _cond_resched+0x13/0x1a
[    4.019563]  [<ffffffff8110ebc3>] ? kmem_cache_alloc_trace+0x2a/0xfb
[    4.019566]  [<ffffffff817a32d1>] ? do_init_module+0x55/0x1b5
[    4.019572]  [<ffffffff810939ab>] ? load_module+0x149b/0x1973
[    4.019576]  [<ffffffff810914dc>] ? store_uevent+0x36/0x36
[    4.019583]  [<ffffffff817adc32>] ? page_fault+0x22/0x30
[    4.019587]  [<ffffffff81093f3e>] ? SyS_init_module+0xbb/0xea
[    4.019591]  [<ffffffff817ac217>] ? system_call_fastpath+0x12/0x6a
[    4.019624] Code: 01 00 00 bd 03 00 00 00 e9 06 fd ff ff 58 5b 5d 41 5c 41 5d 41 5e 41 5f c3 83 fe 1f 77 65 41 54 89 f0 55 53 48 89 fb 48 83 ec 10 <48> 0f a3 47 28 19 c0 85 c0 74 45 48 8d af f8 01 00 00 89 4c 24
[    4.019628] RIP  [<ffffffff81689d8d>] input_event+0x12/0x6b
[    4.019629]  RSP <ffff880002c93a48>
[    4.019630] CR2: 0000000000000028
[    4.019633] ---[ end trace 2ea81e31c7b5f5ad ]--

==Reproduce steps==
----------------------------
1, boot machine
Comment 1 ye.tian 2015-04-29 08:46:17 UTC
Created attachment 115432 [details]
dmesg info with the fixes kernel
Comment 2 Jani Nikula 2015-04-29 09:51:33 UTC
So nightly has one problem (NULL ptr dereference) and fixes has another (invalid opcode)?

The problem in nightly is not an i915 issue. Please try the sound-for-next branch of http://cgit.freedesktop.org/~jani/drm/. It countains v4.1-rc1 plus new audio related stuff that we merge to nightly. Please repeat a few times and see which problem we end up with.
Comment 3 ye.tian 2015-04-30 08:30:01 UTC
(In reply to Jani Nikula from comment #2)
> So nightly has one problem (NULL ptr dereference) and fixes has another
> (invalid opcode)?
> 
> The problem in nightly is not an i915 issue. Please try the sound-for-next
> branch of http://cgit.freedesktop.org/~jani/drm/. It countains v4.1-rc1 plus
> new audio related stuff that we merge to nightly. Please repeat a few times
> and see which problem we end up with.

Test sound-for-next branch of http://cgit.freedesktop.org/~jani/drm/ kernel, Test result is the same as the result of testing on nightly kernel.
Comment 4 Jani Nikula 2015-04-30 09:51:25 UTC
I dropped sound-next from drm-intel-nightly for now, closing.

Please reopen if you can still reproduce the NULL ptr dereference problem.

(In reply to ye.tian from comment #1)
> Created attachment 115432 [details]
> dmesg info with the fixes kernel

If this issue on drm-intel-fixes is still reproducible, please file a new bug.
Comment 5 ye.tian 2015-04-30 10:22:23 UTC
(In reply to Jani Nikula from comment #4)
> I dropped sound-next from drm-intel-nightly for now, closing.
> 
> Please reopen if you can still reproduce the NULL ptr dereference problem.
> 
> (In reply to ye.tian from comment #1)
> > Created attachment 115432 [details]
> > dmesg info with the fixes kernel
> 
> If this issue on drm-intel-fixes is still reproducible, please file a new
> bug.

Retest it on the latest nightly(014851e3) and latest fixes(a04f90a3), this problem does not exists.
So verified.
Comment 6 Jani Nikula 2015-04-30 10:49:30 UTC
For anyone stumbling here due to the backtrace, I reported this on alsa-devel: http://mid.gmane.org/87fv7i9bgu.fsf@intel.com
Comment 7 Keyon 2015-04-30 11:23:32 UTC
assign it to me, please.
Comment 8 Keyon 2015-04-30 12:21:55 UTC
Created attachment 115474 [details] [review]
patch to fix the issue

I just worked out a patch to fix it. no chance to test it yet.
will submit it to alsa once it passed testing.
Comment 9 Keyon 2015-05-04 00:46:19 UTC
thanks for reporting it, it is fixed now.
please merge latest alsa kernel to fix it.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.