Bug 105863 - [CI] igt@kms_flip@2x-modeset-vs-vblank-race - dmesg-fail - BUG: unable to handle kernel NULL pointer dereference -IP: snd_hda_spdif_ctls_unassign+0x30/0x40 [snd_hda_codec]
Summary: [CI] igt@kms_flip@2x-modeset-vs-vblank-race - dmesg-fail - BUG: unable to han...
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-04-03 13:05 UTC by Marta Löfstedt
Modified: 2018-10-05 14:11 UTC (History)
1 user (show)

See Also:
i915 platform: KBL
i915 features: display/audio


Attachments

Description Marta Löfstedt 2018-04-03 13:05:08 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_8/fi-kbl-7500u/igt@kms_flip@2x-modeset-vs-vblank-race.html

<7>[  507.508081] [drm:i915_audio_component_get_eld [i915]] Not valid for port C
<4>[  507.508127] irq event stamp: 5350905
<1>[  507.508129] BUG: unable to handle kernel NULL pointer dereference
<4>[  507.508154] hardirqs last  enabled at (5350905): [<00000000e7f8017c>] ktime_get_with_offset+0xff/0x120
<4>[  507.508161]  at 000000000000000c
<1>[  507.508184] IP: snd_hda_spdif_ctls_unassign+0x30/0x40 [snd_hda_codec]
<6>[  507.508189] PGD 0 P4D 0 
<4>[  507.508216] hardirqs last disabled at (5350904): [<000000003522daac>] ktime_get_with_offset+0x36/0x120
<4>[  507.508234] softirqs last  enabled at (5350878): [<00000000812e220d>] __do_softirq+0x32b/0x4e1
<4>[  507.508252] softirqs last disabled at (5350857): [<00000000ae03fd02>] irq_exit+0xa4/0xb0
<4>[  507.508287] Oops: 0002 [#1] PREEMPT SMP PTI
<0>[  507.508321] Dumping ftrace buffer:
<0>[  507.508333] ---------------------------------
<0>[  507.508567] CPU:1 [LOST 54908 EVENTS]
                    <idle>-0       1d.s2 482286194us : __i915_request_submit: bcs0 fence 286e:125 -> global_seqno 125
...
<0>[  507.567946] drv_modu-4022    2.... 506842908us : i915_request_retire: vcs0(11) fence 29ed:9, global_seqno 11
<0>[  507.567957] ---------------------------------
<4>[  507.567965] Modules linked in: snd_hda_intel i915 vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic btusb btrtl x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul btbcm crc32_pclmul ghash_clmulni_intel snd_hda_codec btintel snd_hwdep snd_hda_core bluetooth snd_pcm ecdh_generic e1000e mei_me mei prime_numbers [last unloaded: i915]
<4>[  507.568071] CPU: 3 PID: 4167 Comm: kms_flip Tainted: G     U           4.16.0-rc7-g0539b52e05cd-drmtip_8+ #1
<4>[  507.568080] Hardware name: GIGABYTE GB-BKi7(H)A-7500/MFLP7AP-00, BIOS F7 06/28/2017
<4>[  507.568103] RIP: 0010:snd_hda_spdif_ctls_unassign+0x30/0x40 [snd_hda_codec]
<4>[  507.568113] RSP: 0018:ffffaaf9c044f958 EFLAGS: 00010206
<4>[  507.568125] RAX: 0000000000000000 RBX: 000000000000000c RCX: 0000000000000001
<4>[  507.568134] RDX: 00000000ffffffff RSI: 0000000000000000 RDI: ffff9868de40b9c0
<4>[  507.568143] RBP: ffff9868de40b3f8 R08: 000000002ca1bc00 R09: 0000000000000000
<4>[  507.568151] R10: ffffaaf9c044f958 R11: 0000000000000004 R12: ffff9868de40b9c0
<4>[  507.568160] R13: ffff986897931bf8 R14: ffff9868de40b3f8 R15: 0000000000000001
<4>[  507.568170] FS:  00007f6472561980(0000) GS:ffff9868edd80000(0000) knlGS:0000000000000000
<4>[  507.568179] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[  507.568188] CR2: 000000000000000c CR3: 0000000217998005 CR4: 00000000003606e0
<4>[  507.568195] Call Trace:
<4>[  507.568218]  update_eld+0x26a/0x480 [snd_hda_codec_hdmi]
<4>[  507.568358]  ? i915_audio_component_get_eld+0x13a/0x140 [i915]
<4>[  507.568379]  hdmi_present_sense+0xdb/0x370 [snd_hda_codec_hdmi]
<4>[  507.568399]  check_presence_and_report+0x59/0x80 [snd_hda_codec_hdmi]
<4>[  507.568536]  intel_audio_codec_disable+0xae/0xe0 [i915]
<4>[  507.568702]  intel_disable_ddi+0x78/0x120 [i915]
<4>[  507.568860]  intel_encoders_disable.isra.63+0x70/0x90 [i915]
<4>[  507.569003]  haswell_crtc_disable+0x42/0x140 [i915]
<4>[  507.569146]  intel_atomic_commit_tail+0x772/0xc50 [i915]
<4>[  507.569293]  intel_atomic_commit+0x240/0x320 [i915]
<4>[  507.569317]  ? drm_mode_getcrtc+0x170/0x170
<4>[  507.569337]  drm_atomic_helper_set_config+0x7b/0x90
<4>[  507.569357]  __drm_mode_set_config_internal+0x62/0x110
<4>[  507.569375]  drm_mode_setcrtc+0x3df/0x570
<4>[  507.569413]  ? drm_mode_getcrtc+0x170/0x170
<4>[  507.569426]  drm_ioctl_kernel+0x56/0xb0
<4>[  507.569442]  drm_ioctl+0x28f/0x330
<4>[  507.569456]  ? drm_mode_getcrtc+0x170/0x170
<4>[  507.569483]  do_vfs_ioctl+0x8c/0x690
<4>[  507.569498]  ? __fget+0xfc/0x1e0
<4>[  507.569513]  SyS_ioctl+0x36/0x70
<4>[  507.569527]  ? do_syscall_64+0x19/0x1b0
<4>[  507.569543]  do_syscall_64+0x6b/0x1b0
<4>[  507.569565]  entry_SYSCALL_64_after_hwframe+0x42/0xb7
<4>[  507.569580] RIP: 0033:0x7f64712af5d7
<4>[  507.569592] RSP: 002b:00007fff34538408 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
<4>[  507.569611] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 00007f64712af5d7
<4>[  507.569623] RDX: 00007fff34538440 RSI: 00000000c06864a2 RDI: 0000000000000003
<4>[  507.569635] RBP: 00007fff34538440 R08: 0000000000000000 R09: 0000000000000000
<4>[  507.569647] R10: 0000000000000000 R11: 0000000000000246 R12: 00000000c06864a2
<4>[  507.569659] R13: 0000000000000003 R14: 00007f647215e6c6 R15: 00007f647215fbb4
<4>[  507.569686] Code: c8 05 00 00 55 53 48 89 fd 89 f3 4c 89 e7 31 f6 e8 76 f5 30 e3 0f af 9d d0 06 00 00 48 8b 85 d8 06 00 00 ba ff ff ff ff 4c 89 e7 <66> 89 14 18 5b 5d 41 5c e9 a3 e9 30 e3 0f 1f 00 41 54 55 48 89 
<1>[  507.569998] RIP: snd_hda_spdif_ctls_unassign+0x30/0x40 [snd_hda_codec] RSP: ffffaaf9c044f958
<4>[  507.570008] CR2: 000000000000000c
<4>[  507.570025] ---[ end trace 531c5152a98f867f ]---
<3>[  507.807666] BUG: sleeping function called from invalid context at ./include/linux/percpu-rwsem.h:34
<3>[  507.807670] in_atomic(): 0, irqs_disabled(): 1, pid: 4167, name: kms_flip
<4>[  507.807671] INFO: lockdep is turned off.
<4>[  507.807673] irq event stamp: 143370
<4>[  507.807677] hardirqs last  enabled at (143369): [<00000000cc0330b8>] console_unlock+0x426/0x640
<4>[  507.807680] hardirqs last disabled at (143370): [<00000000592d91e0>] error_entry+0x78/0xf0
<4>[  507.807682] softirqs last  enabled at (143354): [<00000000812e220d>] __do_softirq+0x32b/0x4e1
<4>[  507.807684] softirqs last disabled at (143333): [<00000000ae03fd02>] irq_exit+0xa4/0xb0
<4>[  507.807687] CPU: 3 PID: 4167 Comm: kms_flip Tainted: G     UD          4.16.0-rc7-g0539b52e05cd-drmtip_8+ #1
<4>[  507.807688] Hardware name: GIGABYTE GB-BKi7(H)A-7500/MFLP7AP-00, BIOS F7 06/28/2017
<4>[  507.807689] Call Trace:
<4>[  507.807693]  dump_stack+0x67/0x95
<4>[  507.807696]  ___might_sleep+0x167/0x250
<4>[  507.807698]  exit_signals+0x1b/0x2a0
<4>[  507.807701]  do_exit+0x93/0xc90
<4>[  507.807704]  ? SyS_ioctl+0x36/0x70
<4>[  507.807706]  ? do_syscall_64+0x19/0x1b0
<4>[  507.807709]  rewind_stack_do_exit+0x17/0x20
Comment 1 Martin Peres 2018-04-16 19:26:57 UTC
Also seen on drmtip_14 on skl: https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_14/fi-skl-6260u/igt@kms_pipe_crc_basic@read-crc-pipe-c-frame-sequence.html

<0>[  119.363828] ---------------------------------
<4>[  119.363830] Modules linked in: snd_hda_intel i915 vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth x86_pkg_temp_thermal intel_powerclamp coretemp ecdh_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core e1000e snd_pcm mei_me mei prime_numbers pinctrl_sunrisepoint pinctrl_intel [last unloaded: i915]
<4>[  119.363856] CPU: 2 PID: 1559 Comm: kms_pipe_crc_ba Tainted: G     U           4.16.0-rc7-ge023242a3eba-drmtip_14+ #1
<4>[  119.363857] Hardware name:  /NUC6i5SYB, BIOS SYSKLi35.86A.0057.2017.0119.1758 01/19/2017
<4>[  119.363863] RIP: 0010:snd_hda_spdif_ctls_unassign+0x30/0x40 [snd_hda_codec]
<4>[  119.363865] RSP: 0018:ffffb7f90088f990 EFLAGS: 00010206
<4>[  119.363867] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
<4>[  119.363869] RDX: 00000000ffffffff RSI: 0000000000000000 RDI: ffff93ce5f382870
<4>[  119.363871] RBP: ffff93ce5f3822a8 R08: ffff93ce610bb198 R09: 000000001d49e2c5
<4>[  119.363872] R10: ffffb7f90088f990 R11: ffff93ce610ba840 R12: ffff93ce5f382870
<4>[  119.363874] R13: ffff93ce5f3822a8 R14: ffff93ce5998c330 R15: 0000000000000000
<4>[  119.363876] FS:  00007fe54a6a2980(0000) GS:ffff93ce6ed00000(0000) knlGS:0000000000000000
<4>[  119.363878] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[  119.363880] CR2: 0000000000000000 CR3: 00000002596d2003 CR4: 00000000003606e0
<4>[  119.363881] Call Trace:
<4>[  119.363886]  update_eld+0x32a/0x460 [snd_hda_codec_hdmi]
<4>[  119.363918]  ? i915_audio_component_get_eld+0x141/0x150 [i915]
<4>[  119.363923]  hdmi_present_sense+0x22e/0x380 [snd_hda_codec_hdmi]
<4>[  119.363926]  ? __mutex_unlock_slowpath+0x48/0x2a0
<4>[  119.363930]  check_presence_and_report+0x59/0x80 [snd_hda_codec_hdmi]
<4>[  119.363962]  intel_audio_codec_disable+0xa6/0xd0 [i915]
<4>[  119.363996]  intel_disable_ddi+0xf3/0x120 [i915]
<4>[  119.364029]  intel_encoders_disable.isra.62+0x7a/0x80 [i915]
<4>[  119.364062]  haswell_crtc_disable+0x43/0x130 [i915]
<4>[  119.364096]  intel_atomic_commit_tail+0x20d/0xd10 [i915]
<4>[  119.364129]  intel_atomic_commit+0x24f/0x330 [i915]
<4>[  119.364134]  set_property_atomic+0xee/0x130
<4>[  119.364139]  drm_mode_obj_set_property_ioctl+0xe4/0x1a0
<4>[  119.364143]  ? drm_mode_connector_set_obj_prop+0x70/0x70
<4>[  119.364145]  drm_mode_connector_property_set_ioctl+0x39/0x60
<4>[  119.364148]  drm_ioctl_kernel+0xa3/0xe0
<4>[  119.364152]  drm_ioctl+0x2e2/0x380
<4>[  119.364155]  ? drm_mode_connector_set_obj_prop+0x70/0x70
<4>[  119.364160]  ? remove_vma+0x56/0x60
<4>[  119.364164]  do_vfs_ioctl+0x9a/0x6a0
<4>[  119.364167]  ? __task_pid_nr_ns+0xb9/0x1c0
<4>[  119.364170]  ? entry_SYSCALL_64_after_hwframe+0x52/0xb7
<4>[  119.364173]  SyS_ioctl+0x36/0x70
<4>[  119.364176]  do_syscall_64+0x65/0x1a0
<4>[  119.364180]  entry_SYSCALL_64_after_hwframe+0x42/0xb7
<4>[  119.364182] RIP: 0033:0x7fe549d525d7
<4>[  119.364183] RSP: 002b:00007ffd3b957978 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
<4>[  119.364186] RAX: ffffffffffffffda RBX: 000055d66ece3de0 RCX: 00007fe549d525d7
<4>[  119.364188] RDX: 00007ffd3b9579b0 RSI: 00000000c01064ab RDI: 0000000000000003
<4>[  119.364190] RBP: 00007ffd3b9579b0 R08: 0000000000000000 R09: 0000000000000000
<4>[  119.364191] R10: 0000000000000051 R11: 0000000000000246 R12: 00000000c01064ab
<4>[  119.364193] R13: 0000000000000003 R14: 0000000000000000 R15: 0000000000000000
<4>[  119.364197] Code: c8 05 00 00 55 53 48 89 fd 89 f3 4c 89 e7 31 f6 e8 96 bf 60 c4 0f af 9d d0 06 00 00 48 8b 85 d8 06 00 00 ba ff ff ff ff 4c 89 e7 <66> 89 14 18 5b 5d 41 5c e9 73 b3 60 c4 0f 1f 00 41 54 55 48 89 
<1>[  119.364245] RIP: snd_hda_spdif_ctls_unassign+0x30/0x40 [snd_hda_codec] RSP: ffffb7f90088f990
<4>[  119.364247] CR2: 0000000000000000
<4>[  119.364250] ---[ end trace e5e058b9642421ae ]---
Comment 2 Martin Peres 2018-06-15 08:54:31 UTC
Was seen on two runs (with 6 runs in between), then nothing since drmtip_14 (1 month, 4 weeks / 48 runs ago). Let's close the bug!
Comment 3 Jani Saarinen 2018-06-19 14:25:58 UTC
Closing, thanks.
Comment 4 felix 2018-10-05 13:20:19 UTC
Hi there,
we recently noticed this Bug on Boot on an Intel NUC7i3DNKE with Ubuntu 16.04 and Kernel 4.13.0-36-generic. 
This is nondeterministic and took us around 50 boots to reproduce. If this is really fixed is the driver version in Ubuntu 18.04 sufficient (meaning above drm_tip14)? I can't really make sense of the numbering in respect to versions.
Kind regards Felix

The trace:
 kernel: BUG: unable to handle kernel NULL pointer dereference at           (null)
 kernel: IP: snd_hda_spdif_ctls_unassign+0x3b/0x50 [snd_hda_codec]
 kernel: PGD 0 
 kernel: P4D 0 
 kernel: 
 kernel: Oops: 0002 [#1] SMP PTI
 kernel: Modules linked in: ccm arc4 snd_soc_skl(+) snd_soc_skl_ipc snd_soc_sst_ipc snd_hda_codec_hdmi(+) snd_soc_sst_dsp snd_hda_ext_core snd_soc_sst_match intel_rapl snd_soc_core x86_pkg_temp_thermal intel_powerclamp coretemp snd_compress ac97_bus snd_pcm_dmaengine kvm_intel snd_hda_intel iwlmvm snd_hda_codec kvm snd_hda_core mac80211 snd_hwdep irqbypass snd_pcm crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc snd_seq_midi iwlwifi snd_seq_midi_event snd_rawmidi snd_seq aesni_intel cfg80211 aes_x86_64 snd_seq_device snd_timer crypto_simd glue_helper cryptd intel_cstate intel_rapl_perf snd wmi_bmof btusb mei_me idma64 ucsi_acpi shpchp btrtl virt_dma mei intel_pch_thermal soundcore intel_lpss_pci typec_ucsi hci_uart typec btbcm serdev btqca btintel bluetooth ecdh_generic intel_lpss_acpi acpi_als
 kernel:  intel_lpss kfifo_buf tpm_crb industrialio acpi_pad mac_hid bridge stp llc autofs4 btrfs xor raid6_pq i915 i2c_algo_bit drm_kms_helper syscopyarea e1000e sysfillrect sysimgblt fb_sys_fops ahci ptp drm pps_core libahci wmi i2c_hid hid video pinctrl_sunrisepoint pinctrl_intel
 kernel: CPU: 0 PID: 494 Comm: Xorg Not tainted 4.13.0-36-generic #40~16.04.1-Ubuntu
 kernel: Hardware name: Intel Corporation NUC7i3DNKE/NUC7i3DNB, BIOS DNKBLi30.86A.0032.2017.1218.2051 12/18/2017
 kernel: task: ffff9bfd9e518000 task.stack: ffffaeaec1154000
 kernel: RIP: 0010:snd_hda_spdif_ctls_unassign+0x3b/0x50 [snd_hda_codec]
 kernel: RSP: 0018:ffffaeaec1157858 EFLAGS: 00010206
 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
 kernel: RDX: 00000000ffffffff RSI: 0000000000000000 RDI: ffff9bfda6745470
 kernel: RBP: ffffaeaec1157870 R08: ffff9bfda6745a10 R09: 0000000000000100
 kernel: R10: ffffffffc05db650 R11: ffff9bfda6745a10 R12: ffff9bfda6745000
 kernel: R13: ffff9bfda6745470 R14: ffff9bfda6745800 R15: ffff9bfda6745000
 kernel: FS:  00007fce8f5f7a00(0000) GS:ffff9bfdaec00000(0000) knlGS:0000000000000000
 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
 kernel: CR2: 0000000000000000 CR3: 0000000166bd6006 CR4: 00000000003606f0
 kernel: Call Trace:
 kernel:  update_eld+0x269/0x480 [snd_hda_codec_hdmi]
 kernel:  hdmi_present_sense+0x215/0x390 [snd_hda_codec_hdmi]
 kernel:  check_presence_and_report+0x60/0x90 [snd_hda_codec_hdmi]
 kernel:  intel_pin_eld_notify+0x6e/0x80 [snd_hda_codec_hdmi]
 kernel:  intel_audio_codec_disable+0xb1/0xe0 [i915]
 kernel:  intel_disable_ddi+0x40/0xd0 [i915]
 kernel:  intel_encoders_disable.isra.95+0x8e/0xa0 [i915]
 kernel:  haswell_crtc_disable+0x5a/0x170 [i915]
 kernel:  intel_atomic_commit_tail+0x153/0x10b0 [i915]
 kernel:  ? intel_atomic_commit_ready+0x75/0x7c [i915]
 kernel:  ? __i915_sw_fence_complete+0x101/0x1b0 [i915]
 kernel:  intel_atomic_commit+0x413/0x4b0 [i915]
 kernel:  ? drm_atomic_check_only+0x3d0/0x580 [drm]
 kernel:  ? drm_atomic_set_crtc_for_connector+0xe9/0x100 [drm]
 kernel:  drm_atomic_commit+0x51/0x60 [drm]
 kernel:  drm_atomic_helper_set_config+0x63/0xa0 [drm_kms_helper]
 kernel:  ? drm_modeset_lock+0x2f/0xa0 [drm]
 kernel:  __drm_mode_set_config_internal+0x6b/0x120 [drm]
 kernel:  drm_mode_setcrtc+0x4f6/0x640 [drm]
 kernel:  ? sock_recvmsg+0x43/0x50
 kernel:  ? drm_mode_getcrtc+0x180/0x180 [drm]
 kernel:  drm_ioctl_kernel+0x6b/0xb0 [drm]
 kernel:  drm_ioctl+0x3e4/0x450 [drm]
 kernel:  ? drm_mode_getcrtc+0x180/0x180 [drm]
 kernel:  do_vfs_ioctl+0xa4/0x600
 kernel:  ? __sys_recvmsg+0x51/0x90
 kernel:  ? __sys_recvmsg+0x80/0x90
 kernel:  SyS_ioctl+0x79/0x90
 kernel:  entry_SYSCALL_64_fastpath+0x24/0xab
 kernel: RIP: 0033:0x7fce8d441f47
 kernel: RSP: 002b:00007ffd069a67c8 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
 kernel: RAX: ffffffffffffffda RBX: 00007ffd069a57d0 RCX: 00007fce8d441f47
 kernel: RDX: 00007ffd069a6800 RSI: 00000000c06864a2 RDI: 000000000000000a
 kernel: RBP: 000055869cec5630 R08: 0000000000000001 R09: 0000000000000000
 kernel: R10: 0000000000000000 R11: 0000000000003246 R12: 000000000000001a
 kernel: R13: 0000000000000013 R14: 000055869cec5628 R15: 0000000000000012
 kernel: Code: 00 00 41 54 53 49 89 fc 4c 89 ef 89 f3 e8 1e e4 f3 f7 41 0f af 9c 24 b8 04 00 00 49 8b 84 24 c0 04 00 00 ba ff ff ff ff 4c 89 ef <66> 89 14 18 e8 3c de f3 f7 5b 41 5c 41 5d 5d c3 0f 1f 44 00 00 
 kernel: RIP: snd_hda_spdif_ctls_unassign+0x3b/0x50 [snd_hda_codec] RSP: ffffaeaec1157858
 kernel: CR2: 0000000000000000
 kernel: ---[ end trace a5378192d12b75de ]---
Comment 5 Lakshmi 2018-10-05 14:11:37 UTC
We haven't seen this issue since dmtip_14.
Please try to reproduce the error using drm-tip (https://cgit.freedesktop.org/drm-tip) with kernel parameters drm.debug=0x1e log_buf_len=4M, and if the problem persists attach the full dmesg from boot.

If you see the same issue with latest drmtip, you can reopen this issue.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.