Bug 109964 - [CI][BAT] igt@i915_module_load@reload-with-fault-injection - incomplete - RIP: 0010:drm_setup_crtcs
Summary: [CI][BAT] igt@i915_module_load@reload-with-fault-injection - incomplete - RIP...
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: highest normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2019-03-11 17:01 UTC by Martin Peres
Modified: 2019-11-29 18:10 UTC (History)
2 users (show)

See Also:
i915 platform: KBL
i915 features: display/Other


Attachments

Description Martin Peres 2019-03-11 17:01:48 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5718/fi-kbl-7560u/igt@i915_module_load@reload-with-fault-injection.html

<7> [352.833329] [drm:drm_helper_probe_single_connector_modes] [CONNECTOR:85:eDP-1] probed modes :
<7> [352.833337] [drm:drm_mode_debug_printmodeline] Modeline "3200x1800": 60 373250 3200 3248 3280 3360 1800 1803 1808 1852 0x48 0xa
<7> [352.833345] [drm:drm_mode_debug_printmodeline] Modeline "3200x1800": 48 298600 3200 3248 3280 3360 1800 1803 1808 1852 0x40 0xa
<4> [352.833357] general protection fault: 0000 [#1] PREEMPT SMP PTI
<4> [352.833363] CPU: 1 PID: 47 Comm: kworker/1:2 Tainted: G     U            5.0.0-CI-CI_DRM_5718+ #1
<4> [352.833369] Hardware name: Dell Inc. XPS 13 9360/0823VW, BIOS 2.3.1 10/03/2017
<4> [352.833393] Workqueue: events output_poll_execute
<4> [352.833399] RIP: 0010:drm_setup_crtcs+0x13c/0xc90
<4> [352.833403] Code: 00 00 45 8b 4f 58 45 85 c9 0f 8e f6 01 00 00 44 8b 64 24 20 44 8b 6c 24 28 31 db 31 ed 49 8b 47 68 48 63 d5 44 89 e6 83 c5 01 <48> 8b 04 d0 44 89 ea 48 8b 38 48 8b 87 b0 01 00 00 48 8b 40 20 e8
<4> [352.833415] RSP: 0018:ffffc90000237d40 EFLAGS: 00010202
<4> [352.833420] RAX: 6b6b6b6b6b6b6b6b RBX: 0000000000000002 RCX: 0000000000000000
<4> [352.833440] RDX: 0000000000000001 RSI: 0000000000000c80 RDI: 00000000ffffffff
<4> [352.833445] RBP: 0000000000000002 R08: 0000000000000000 R09: 0000000000000000
<4> [352.833451] R10: 0000000000000c80 R11: ffffffff82249a38 R12: 0000000000000c80
<4> [352.833456] R13: 0000000000000708 R14: ffff8882650f0390 R15: ffff8882624eafd8
<4> [352.833461] FS:  0000000000000000(0000) GS:ffff888276280000(0000) knlGS:0000000000000000
<4> [352.833467] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [352.833472] CR2: 000055724006ae70 CR3: 000000025cecc005 CR4: 00000000003606e0
<4> [352.833476] Call Trace:
<4> [352.833482]  ? drm_fb_helper_is_bound.isra.12+0x71/0xa0
<4> [352.833487]  drm_fb_helper_hotplug_event.part.24+0x84/0xb0
<4> [352.833492]  drm_kms_helper_hotplug_event+0x21/0x30
<4> [352.833496]  output_poll_execute+0x9d/0x1a0
<4> [352.833520]  process_one_work+0x245/0x610
<4> [352.833525]  worker_thread+0x37/0x380
<4> [352.833529]  ? process_one_work+0x610/0x610
<4> [352.833534]  kthread+0x119/0x130
<4> [352.833538]  ? kthread_park+0x80/0x80
<4> [352.833543]  ret_from_fork+0x3a/0x50
Comment 2 Chris Wilson 2019-06-04 16:32:07 UTC
commit 3b295cb1a411d9c82bbfaa66bc17a8508716ed07
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Jun 3 14:58:57 2019 +0100

    drm: Flush output polling on shutdown
    
    We need to mark the output polling as disabled to prevent concurrent
    irqs from queuing new work as shutdown the probe -- causing that work to
    execute after we have freed the structs:
    
    <4> [341.846490] DEBUG_LOCKS_WARN_ON(mutex_is_locked(lock))
    <4> [341.846497] WARNING: CPU: 3 PID: 3300 at kernel/locking/mutex-debug.c:103 mutex_destroy+0x49/0x50
    <4> [341.846508] Modules linked in: i915(-) vgem thunderbolt snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic mei_hdcp x86_pkg_temp_thermal coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm mcs7830 btusb usbnet btrtl mii btbcm btintel bluetooth ecdh_generic ecc mei_me mei prime_numbers i2c_hid pinctrl_sunrisepoint pinctrl_intel [last unloaded: i915]
    <4> [341.846546] CPU: 3 PID: 3300 Comm: i915_module_loa Tainted: G     U            5.2.0-rc2-CI-CI_DRM_6175+ #1
    <4> [341.846553] Hardware name: Dell Inc. XPS 13 9360/0823VW, BIOS 2.9.0 07/09/2018
    <4> [341.846560] RIP: 0010:mutex_destroy+0x49/0x50
    <4> [341.846565] Code: 00 00 5b c3 e8 a8 9f 3b 00 85 c0 74 ed 8b 05 3e 55 23 01 85 c0 75 e3 48 c7 c6 00 d0 08 82 48 c7 c7 a8 aa 07 82 e8 e7 08 fa ff <0f> 0b eb cc 0f 1f 00 48 b8 11 11 11 11 11 11 11 11 48 89 76 20 48
    <4> [341.846578] RSP: 0018:ffffc900006cfdb0 EFLAGS: 00010286
    <4> [341.846583] RAX: 0000000000000000 RBX: ffff88826759a168 RCX: 0000000000000000
    <4> [341.846589] RDX: 0000000000000002 RSI: 0000000000000000 RDI: ffffffff8112844c
    <4> [341.846595] RBP: ffff8882708fa548 R08: 0000000000000000 R09: 0000000000039600
    <4> [341.846601] R10: 0000000000000000 R11: 0000000000000ce4 R12: ffffffffa07de1e0
    <4> [341.846607] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffffa07de2d0
    <4> [341.846613] FS:  00007f62b5ae0e40(0000) GS:ffff888276380000(0000) knlGS:0000000000000000
    <4> [341.846620] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    <4> [341.846626] CR2: 000055a4e064f4a0 CR3: 0000000266b16006 CR4: 00000000003606e0
    <4> [341.846632] Call Trace:
    <4> [341.846639]  drm_fb_helper_fini.part.17+0xb3/0x100
    <4> [341.846682]  intel_fbdev_fini+0x20/0x80 [i915]
    <4> [341.846722]  intel_modeset_cleanup+0x9a/0x140 [i915]
    <4> [341.846750]  i915_driver_unload+0xa3/0x100 [i915]
    <4> [341.846778]  i915_pci_remove+0x19/0x30 [i915]
    <4> [341.846784]  pci_device_remove+0x36/0xb0
    <4> [341.846790]  device_release_driver_internal+0xd3/0x1b0
    <4> [341.846795]  driver_detach+0x3f/0x80
    <4> [341.846800]  bus_remove_driver+0x53/0xd0
    <4> [341.846805]  pci_unregister_driver+0x25/0xa0
    <4> [341.846843]  i915_exit+0x16/0x1c [i915]
    <4> [341.846849]  __se_sys_delete_module+0x162/0x210
    <4> [341.846855]  ? trace_hardirqs_off_thunk+0x1a/0x1c
    <4> [341.846859]  ? do_syscall_64+0xd/0x1c0
    <4> [341.846864]  do_syscall_64+0x55/0x1c0
    <4> [341.846869]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
    <4> [341.846875] RIP: 0033:0x7f62b51871b7
    <4> [341.846881] Code: 73 01 c3 48 8b 0d d1 8c 2c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 b8 b0 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a1 8c 2c 00 f7 d8 64 89 01 48
    <4> [341.846897] RSP: 002b:00007ffe7a227138 EFLAGS: 00000206 ORIG_RAX: 00000000000000b0
    <4> [341.846904] RAX: ffffffffffffffda RBX: 00007ffe7a2272b0 RCX: 00007f62b51871b7
    <4> [341.846910] RDX: 0000000000000001 RSI: 0000000000000800 RDI: 0000557cd6b55948
    <4> [341.846916] RBP: 0000557cd6b558e0 R08: 0000557cd6b5594c R09: 00007ffe7a227160
    <4> [341.846922] R10: 00007ffe7a226134 R11: 0000000000000206 R12: 0000000000000000
    <4> [341.846927] R13: 00007ffe7a227820 R14: 0000000000000000 R15: 0000000000000000
    <4> [341.846936] irq event stamp: 3547847
    <4> [341.846940] hardirqs last  enabled at (3547847): [<ffffffff819aad2c>] _raw_spin_unlock_irqrestore+0x4c/0x60
    <4> [341.846949] hardirqs last disabled at (3547846): [<ffffffff819aab9d>] _raw_spin_lock_irqsave+0xd/0x50
    <4> [341.846957] softirqs last  enabled at (3547376): [<ffffffff81c0033a>] __do_softirq+0x33a/0x4b9
    <4> [341.846966] softirqs last disabled at (3547367): [<ffffffff810b6379>] irq_exit+0xa9/0xc0
    <4> [341.846973] WARNING: CPU: 3 PID: 3300 at kernel/locking/mutex-debug.c:103 mutex_destroy+0x49/0x50
    <4> [341.846980] ---[ end trace ba94ca8952ba970e ]---
    <7> [341.866547] [drm:intel_dp_detect [i915]] MST support? port A: no, sink: no, modparam: yes
    <7> [341.890480] [drm:drm_add_display_info] non_desktop set to 0
    <7> [341.890530] [drm:drm_add_edid_modes] ELD: no CEA Extension found
    <7> [341.890537] [drm:drm_add_display_info] non_desktop set to 0
    <7> [341.890578] [drm:drm_helper_probe_single_connector_modes] [CONNECTOR:86:eDP-1] probed modes :
    <7> [341.890589] [drm:drm_mode_debug_printmodeline] Modeline "3200x1800": 60 373250 3200 3248 3280 3360 1800 1803 1808 1852 0x48 0xa
    <7> [341.890602] [drm:drm_mode_debug_printmodeline] Modeline "3200x1800": 48 298600 3200 3248 3280 3360 1800 1803 1808 1852 0x40 0xa
    <4> [341.890628] general protection fault: 0000 [#1] PREEMPT SMP PTI
    <4> [341.890636] CPU: 0 PID: 508 Comm: kworker/0:4 Tainted: G     U  W         5.2.0-rc2-CI-CI_DRM_6175+ #1
    <4> [341.890646] Hardware name: Dell Inc. XPS 13 9360/0823VW, BIOS 2.9.0 07/09/2018
    <4> [341.890655] Workqueue: events output_poll_execute
    <4> [341.890663] RIP: 0010:drm_setup_crtcs+0x13e/0xbe0
    <4> [341.890669] Code: 00 41 8b 44 24 58 85 c0 0f 8e f9 01 00 00 44 8b 6c 24 20 44 8b 74 24 28 31 db 31 ed 49 8b 44 24 60 48 63 d5 44 89 ee 83 c5 01 <48> 8b 04 d0 44 89 f2 48 8b 38 48 8b 87 88 01 00 00 48 8b 40 20 e8
    <4> [341.890686] RSP: 0018:ffffc9000033fd40 EFLAGS: 00010202
    <4> [341.890692] RAX: 6b6b6b6b6b6b6b6b RBX: 0000000000000002 RCX: 0000000000000000
    <4> [341.890700] RDX: 0000000000000001 RSI: 0000000000000c80 RDI: 00000000ffffffff
    <4> [341.890707] RBP: 0000000000000002 R08: 0000000000000000 R09: 0000000000000000
    <4> [341.890715] R10: 0000000000000c80 R11: 0000000000000000 R12: ffff888267599fe8
    <4> [341.890722] R13: 0000000000000c80 R14: 0000000000000708 R15: 0000000000000007
    <4> [341.890730] FS:  0000000000000000(0000) GS:ffff888276200000(0000) knlGS:0000000000000000
    <4> [341.890739] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    <4> [341.890745] CR2: 000055a4e064f4a0 CR3: 000000026d234003 CR4: 00000000003606f0
    <4> [341.890752] Call Trace:
    <4> [341.890760]  drm_fb_helper_hotplug_event.part.24+0x89/0xb0
    <4> [341.890768]  drm_kms_helper_hotplug_event+0x21/0x30
    <4> [341.890774]  output_poll_execute+0x9d/0x1a0
    <4> [341.890782]  process_one_work+0x245/0x610
    <4> [341.890790]  worker_thread+0x37/0x380
    <4> [341.890796]  ? process_one_work+0x610/0x610
    <4> [341.890802]  kthread+0x119/0x130
    <4> [341.890808]  ? kthread_park+0x80/0x80
    <4> [341.890815]  ret_from_fork+0x3a/0x50
    
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109964
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Reviewed-by: Imre Deak <imre.deak@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20190603135910.15979-2-chris@chris-wilson.co.uk
Comment 3 Lakshmi 2019-11-18 09:48:00 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5273/fi-kbl-7560u/igt@i915_module_load@reload-with-fault-injection.html
<4> [473.885162] DEBUG_LOCKS_WARN_ON(mutex_is_locked(lock))
<4> [473.885168] WARNING: CPU: 2 PID: 3264 at kernel/locking/mutex-debug.c:103 mutex_destroy+0x47/0x50
<4> [473.885178] Modules linked in: i915(-) vgem cdc_ether usbnet r8152 mii snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic mei_hdcp thunderbolt x86_pkg_temp_thermal coretemp crct10dif_pclmul snd_intel_dspcfg crc32_pclmul snd_hda_codec snd_hwdep snd_hda_core ghash_clmulni_intel snd_pcm btusb btrtl btbcm btintel bluetooth ecdh_generic ecc mei_me mei prime_numbers i2c_hid pinctrl_sunrisepoint pinctrl_intel [last unloaded: i915]
<4> [473.885216] CPU: 2 PID: 3264 Comm: i915_module_loa Tainted: G     U            5.4.0-rc6-CI-CI_DRM_7317+ #1
<4> [473.885223] Hardware name: Dell Inc. XPS 13 9360/0823VW, BIOS 2.9.0 07/09/2018
<4> [473.885230] RIP: 0010:mutex_destroy+0x47/0x50
<4> [473.885234] Code: 84 c0 74 ed e8 0a 1e 3d 00 85 c0 74 e4 8b 05 30 2a 23 01 85 c0 75 da 48 c7 c6 bd a9 09 82 48 c7 c7 c0 7e 08 82 e8 39 e7 f9 ff <0f> 0b eb c3 0f 1f 44 00 00 48 b8 11 11 11 11 11 11 11 11 48 89 76
<4> [473.885247] RSP: 0018:ffffc900001bbdb0 EFLAGS: 00010286
<4> [473.885252] RAX: 0000000000000000 RBX: ffff8881c57f11e8 RCX: 0000000000000001
<4> [473.885257] RDX: 0000000080000001 RSI: 0000000000000000 RDI: ffffffff8112ac5c
<4> [473.885263] RBP: ffff888253113800 R08: 0000000000000000 R09: 0000000000038840
<4> [473.885269] R10: 0000000000000000 R11: 0000000000000cc0 R12: ffffffffa04731e0
<4> [473.885274] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffffa04732d8
<4> [473.885280] FS:  00007fdb15028e40(0000) GS:ffff888276300000(0000) knlGS:0000000000000000
<4> [473.885287] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [473.885291] CR2: 0000555b806605c0 CR3: 0000000253ab0004 CR4: 00000000003606e0
<4> [473.885297] Call Trace:
<4> [473.885302]  drm_fb_helper_fini.part.8+0xb7/0x110
<4> [473.885350]  intel_fbdev_fini+0x20/0x60 [i915]
<4> [473.885393]  intel_modeset_driver_remove+0xa6/0x160 [i915]
<4> [473.885423]  i915_driver_remove+0xad/0x110 [i915]
<4> [473.885452]  i915_pci_remove+0x19/0x40 [i915]
<4> [473.885458]  pci_device_remove+0x36/0xb0
<4> [473.885464]  device_release_driver_internal+0xe0/0x1c0
<4> [473.885469]  driver_detach+0x42/0x80
<4> [473.885474]  bus_remove_driver+0x56/0xd0
<4> [473.885479]  pci_unregister_driver+0x25/0xa0
<4> [473.885519]  i915_exit+0x16/0x1c [i915]
<4> [473.885525]  __x64_sys_delete_module+0x115/0x1d0
<4> [473.885530]  ? trace_hardirqs_off_thunk+0x1a/0x20
<4> [473.885535]  ? do_syscall_64+0xd/0x210
<4> [473.885539]  do_syscall_64+0x4f/0x210
<4> [473.885544]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [473.885549] RIP: 0033:0x7fdb144c41b7
<4> [473.885554] Code: 73 01 c3 48 8b 0d d1 8c 2c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 b8 b0 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a1 8c 2c 00 f7 d8 64 89 01 48
<4> [473.885567] RSP: 002b:00007ffcc9bc7268 EFLAGS: 00000206 ORIG_RAX: 00000000000000b0
<4> [473.885573] RAX: ffffffffffffffda RBX: 00007ffcc9bc73e0 RCX: 00007fdb144c41b7
<4> [473.885579] RDX: 0000000000000001 RSI: 0000000000000800 RDI: 000055a1c8361168
<4> [473.885585] RBP: 000055a1c8361100 R08: 000055a1c836116c R09: 00007ffcc9bc7290
<4> [473.885590] R10: 00007ffcc9bc6264 R11: 0000000000000206 R12: 0000000000000000
<4> [473.885596] R13: 00007ffcc9bc7950 R14: 0000000000000000 R15: 0000000000000000
<4> [473.885604] irq event stamp: 4122737
<4> [473.885610] hardirqs last  enabled at (4122737): [<ffffffff814f9e37>] __free_object+0x257/0x310
<4> [473.885618] hardirqs last disabled at (4122736): [<ffffffff814f9c09>] __free_object+0x29/0x310
<4> [473.885625] softirqs last  enabled at (4122326): [<ffffffff81c00385>] __do_softirq+0x385/0x47f
<4> [473.885633] softirqs last disabled at (4122319): [<ffffffff810b7faa>] irq_exit+0xba/0xc0

After a gap of 5 months, it's happening again.
Comment 4 Jose Roberto de Souza 2019-11-27 02:18:40 UTC
Fix for the report above:
https://patchwork.freedesktop.org/series/70081/
Comment 5 Martin Peres 2019-11-29 18:10:22 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/243.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.