Bug 112156 - [CI][SHARDS]igt@i915_selftest@mock_requests - incomplete - GEM_BUG_ON(!list_empty(&timelines->hwsp_free_list))
Summary: [CI][SHARDS]igt@i915_selftest@mock_requests - incomplete - GEM_BUG_ON(!list_e...
Status: RESOLVED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: high major
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-10-28 16:00 UTC by Lakshmi
Modified: 2019-11-20 17:26 UTC (History)
1 user (show)

See Also:
i915 platform: SKL
i915 features: GEM/Other


Attachments

Description Lakshmi 2019-10-28 16:00:40 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7187/shard-skl7/igt@i915_selftest@mock_requests.html
<0> [102.469071] i915_sel-1024    1.... 93323517us : intel_timelines_fini: timelines_fini:570 GEM_BUG_ON(!list_empty(&timelines->hwsp_free_list))
<0> [102.469149] ---------------------------------
<4> [102.470776] ---[ end trace 191ff9a68b9f6389 ]---
<3> [102.549017] BUG: sleeping function called from invalid context at kernel/sched/completion.c:99
<3> [102.549120] in_atomic(): 0, irqs_disabled(): 0, non_block: 0, pid: 1024, name: i915_selftest
<4> [102.549211] INFO: lockdep is turned off.
<3> [102.549262] Preemption disabled at:
<4> [102.549275] [<0000000000000000>] 0x0
<4> [102.549369] CPU: 0 PID: 1024 Comm: i915_selftest Tainted: G     UD           5.4.0-rc4-CI-CI_DRM_7187+ #1
<4> [102.549466] Hardware name: Google Caroline/Caroline, BIOS MrChromebox 08/27/2018
<4> [102.549545] Call Trace:
<4> [102.549602]  dump_stack+0x67/0x9b
<4> [102.549656]  ___might_sleep+0x178/0x260
<4> [102.549718]  wait_for_completion+0x37/0x1a0
<4> [102.549794]  virt_efi_query_variable_info+0x161/0x1b0
<4> [102.549862]  efi_query_variable_store+0xb3/0x1a0
<4> [102.549941]  ? efivar_entry_set_safe+0x19c/0x220
<4> [102.549997]  ? efi_delete_dummy_variable+0x90/0x90
<4> [102.550060]  efivar_entry_set_safe+0x19c/0x220
<4> [102.550139]  ? efi_pstore_write+0x10b/0x150
<4> [102.550199]  efi_pstore_write+0x10b/0x150
<4> [102.550304]  pstore_dump+0x127/0x340
<4> [102.550393]  kmsg_dump+0x87/0x1c0
<4> [102.550455]  oops_end+0x3e/0x90
<4> [102.550503]  do_trap+0x80/0x100
<4> [102.550961]  ? intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.551033]  do_invalid_op+0x23/0x30
<4> [102.551523]  ? intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.551590]  invalid_op+0x23/0x30
<4> [102.551995] RIP: 0010:intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.552067] Code: 9b 48 ec e0 48 8b 35 23 aa 23 00 49 c7 c0 08 c4 3c a0 b9 3a 02 00 00 48 c7 c2 90 1d 3a a0 48 c7 c7 5f 67 26 a0 e8 6c 44 f3 e0 <0f> 0b 66 2e 0f 1f 84 00 00 00 00 00 55 48 8d 6f 10 53 48 c7 c2 70
<4> [102.552238] RSP: 0018:ffffc9000033fc08 EFLAGS: 00010286
<4> [102.552308] RAX: 000000000000000f RBX: ffff88816aa20978 RCX: 0000000000000000
<4> [102.552381] RDX: 0000000000000001 RSI: 0000000000000008 RDI: ffff88817a33cc00
<4> [102.552452] RBP: ffff88816aa2d3e0 R08: 0000000000001ecc R09: ffff888155f59000
<4> [102.552521] R10: 0000000000000000 R11: ffff88817a33cc00 R12: ffff88816aa2d3e0
<4> [102.552591] R13: 00000000ffffffff R14: ffff88816f0cb0e8 R15: ffffc9000033fe88
<4> [102.553080]  ? intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.553544]  mock_device_release+0x92/0x110 [i915]
<4> [102.554016]  i915_request_mock_selftests+0x33/0x80 [i915]
<4> [102.554508]  __run_selftests+0x134/0x170 [i915]
<4> [102.554575]  ? 0xffffffffa0525000
<4> [102.555029]  i915_mock_selftests+0x27/0x50 [i915]
<4> [102.555426]  i915_init+0x12/0x73 [i915]
<4> [102.555485]  ? 0xffffffffa0525000
<4> [102.555535]  do_one_initcall+0x58/0x2ff
<4> [102.555597]  ? rcu_read_lock_sched_held+0x4d/0x80
<4> [102.555665]  ? kmem_cache_alloc_trace+0x290/0x2c0
<4> [102.555736]  do_init_module+0x56/0x1f8
<4> [102.555795]  load_module+0x243e/0x29f0
<4> [102.555902]  ? __do_sys_finit_module+0xe9/0x110
<4> [102.555959]  __do_sys_finit_module+0xe9/0x110
<4> [102.556047]  do_syscall_64+0x4f/0x210
<4> [102.556108]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [102.556172] RIP: 0033:0x7f32eb4e6839
<4> [102.556226] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [102.556392] RSP: 002b:00007ffe37bdd308 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [102.556484] RAX: ffffffffffffffda RBX: 000055fa252b8d90 RCX: 00007f32eb4e6839
<4> [102.556555] RDX: 0000000000000000 RSI: 000055fa252c37a0 RDI: 0000000000000006
<4> [102.556626] RBP: 000055fa252c37a0 R08: 0000000000000004 R09: 000055fa252bd400
<4> [102.556695] R10: 00007ffe37bdd450 R11: 0000000000000246 R12: 0000000000000000
<4> [102.556766] R13: 000055fa252b5000 R14: 0000000000000020 R15: 0000000000000047
<4> [102.556974] ------------[ cut here ]------------
<4> [102.557133] WARNING: CPU: 0 PID: 1024 at kernel/rcu/tree_plugin.h:293 rcu_note_context_switch+0x7e/0x650
<4> [102.557227] Modules linked in: i915(+) snd_hda_codec_hdmi snd_intel_nhlt x86_pkg_temp_thermal snd_hda_codec coretemp crct10dif_pclmul crc32_pclmul snd_hwdep snd_hda_core ghash_clmulni_intel btusb btrtl btbcm btintel snd_pcm cdc_ether usbnet bluetooth r8152 mii ecdh_generic ecc i2c_hid pinctrl_sunrisepoint pinctrl_intel prime_numbers [last unloaded: i915]
<4> [102.557634] CPU: 0 PID: 1024 Comm: i915_selftest Tainted: G     UD W         5.4.0-rc4-CI-CI_DRM_7187+ #1
<4> [102.557721] Hardware name: Google Caroline/Caroline, BIOS MrChromebox 08/27/2018
<4> [102.557800] RIP: 0010:rcu_note_context_switch+0x7e/0x650
<4> [102.557858] Code: 74 17 65 48 8b 04 25 00 5f 01 00 8b 88 8c 08 00 00 85 c9 0f 84 96 03 00 00 45 84 ed 41 8b 84 24 80 03 00 00 75 69 85 c0 7e 11 <0f> 0b 41 80 bc 24 84 03 00 00 00 0f 84 a2 01 00 00 4c 89 e7 e8 89
<4> [102.558019] RSP: 0018:ffffc9000033f678 EFLAGS: 00010002
<4> [102.558078] RAX: 0000000000000001 RBX: ffff88817aa39ec0 RCX: 0000000000000000
<4> [102.558146] RDX: 0000000000000002 RSI: 0000000000000000 RDI: 0000000000000000
<4> [102.558216] RBP: ffffc9000033f710 R08: 0000000000000000 R09: 0000000000000000
<4> [102.558287] R10: 0000000000000000 R11: 0000000000895440 R12: ffff88816c90d040
<4> [102.558363] R13: 0000000000000000 R14: ffff88817aa39198 R15: ffff8881771d4000
<4> [102.558444] FS:  00007f32ebe2fe40(0000) GS:ffff88817aa00000(0000) knlGS:0000000000000000
<4> [102.558529] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [102.558593] CR2: 00007fd25ad5b090 CR3: 0000000155aac002 CR4: 00000000003606f0
<4> [102.558670] Call Trace:
<4> [102.558733]  __schedule+0xd0/0x7f0
<4> [102.558794]  ? x2apic_send_IPI_allbutself+0x10/0x10
<4> [102.558855]  ? check_preempt_wakeup+0x119/0x230
<4> [102.558917]  ? wait_for_completion+0x108/0x1a0
<4> [102.558975]  schedule+0x34/0xc0
<4> [102.559025]  schedule_timeout+0x225/0x3f0
<4> [102.559088]  ? wait_for_completion+0x3f/0x1a0
<4> [102.559156]  ? wait_for_completion+0x108/0x1a0
<4> [102.559211]  wait_for_completion+0x130/0x1a0
<4> [102.559269]  ? wake_up_q+0x70/0x70
<4> [102.559330]  virt_efi_set_variable+0x151/0x1a0
<4> [102.559395]  efivar_entry_set_safe+0x115/0x220
<4> [102.559469]  ? efi_pstore_write+0x10b/0x150
<4> [102.559522]  efi_pstore_write+0x10b/0x150
<4> [102.559618]  pstore_dump+0x127/0x340
<4> [102.559704]  kmsg_dump+0x87/0x1c0
<4> [102.559765]  oops_end+0x3e/0x90
<4> [102.559816]  do_trap+0x80/0x100
<4> [102.560252]  ? intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.560325]  do_invalid_op+0x23/0x30
<4> [102.560753]  ? intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.560811]  invalid_op+0x23/0x30
<4> [102.561204] RIP: 0010:intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.561275] Code: 9b 48 ec e0 48 8b 35 23 aa 23 00 49 c7 c0 08 c4 3c a0 b9 3a 02 00 00 48 c7 c2 90 1d 3a a0 48 c7 c7 5f 67 26 a0 e8 6c 44 f3 e0 <0f> 0b 66 2e 0f 1f 84 00 00 00 00 00 55 48 8d 6f 10 53 48 c7 c2 70
<4> [102.561433] RSP: 0018:ffffc9000033fc08 EFLAGS: 00010286
<4> [102.561493] RAX: 000000000000000f RBX: ffff88816aa20978 RCX: 0000000000000000
<4> [102.561562] RDX: 0000000000000001 RSI: 0000000000000008 RDI: ffff88817a33cc00
<4> [102.561633] RBP: ffff88816aa2d3e0 R08: 0000000000001ecc R09: ffff888155f59000
<4> [102.561703] R10: 0000000000000000 R11: ffff88817a33cc00 R12: ffff88816aa2d3e0
<4> [102.561772] R13: 00000000ffffffff R14: ffff88816f0cb0e8 R15: ffffc9000033fe88
<4> [102.562270]  ? intel_timelines_fini+0xb4/0xc0 [i915]
<4> [102.562748]  mock_device_release+0x92/0x110 [i915]
<4> [102.563240]  i915_request_mock_selftests+0x33/0x80 [i915]
<4> [102.563744]  __run_selftests+0x134/0x170 [i915]
<4> [102.563811]  ? 0xffffffffa0525000
<4> [102.564265]  i915_mock_selftests+0x27/0x50 [i915]
<4> [102.564669]  i915_init+0x12/0x73 [i915]
<4> [102.564723]  ? 0xffffffffa0525000
<4> [102.564774]  do_one_initcall+0x58/0x2ff
<4> [102.564837]  ? rcu_read_lock_sched_held+0x4d/0x80
<4> [102.564908]  ? kmem_cache_alloc_trace+0x290/0x2c0
<4> [102.564978]  do_init_module+0x56/0x1f8
<4> [102.565036]  load_module+0x243e/0x29f0
<4> [102.565144]  ? __do_sys_finit_module+0xe9/0x110
<4> [102.565201]  __do_sys_finit_module+0xe9/0x110
<4> [102.565289]  do_syscall_64+0x4f/0x210
<4> [102.565346]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [102.565405] RIP: 0033:0x7f32eb4e6839
<4> [102.565454] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [102.565613] RSP: 002b:00007ffe37bdd308 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [102.565698] RAX: ffffffffffffffda RBX: 000055fa252b8d90 RCX: 00007f32eb4e6839
<4> [102.565767] RDX: 0000000000000000 RSI: 000055fa252c37a0 RDI: 0000000000000006
<4> [102.565840] RBP: 000055fa252c37a0 R08: 0000000000000004 R09: 000055fa252bd400
<4> [102.565919] R10: 00007ffe37bdd450 R11: 0000000000000246 R12: 0000000000000000
<4> [102.565997] R13: 000055fa252b5000 R14: 0000000000000020 R15: 0000000000000047
<4> [102.566103] irq event stamp: 455395
<4> [102.566156] hardirqs last  enabled at (455395): [<ffffffff8101dcc5>] do_error_trap+0xa5/0x100
<4> [102.566246] hardirqs last disabled at (455394): [<ffffffff81001c0a>] trace_hardirqs_off_thunk+0x1a/0x20
<4> [102.566342] softirqs last  enabled at (455344): [<ffffffff81c00385>] __do_softirq+0x385/0x47f
<4> [102.566433] softirqs last disabled at (455333): [<ffffffff810b7efa>] irq_exit+0xba/0xc0
<4> [102.566511] ---[ end trace 191ff9a68b9f638a ]---
Comment 1 CI Bug Log 2019-10-28 16:02:10 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* SKL: igt@i915_selftest@mock_requests - incomplete - GEM_BUG_ON(!list_empty(&amp;timelines-&gt;hwsp_free_list))
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7187/shard-skl7/igt@i915_selftest@mock_requests.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7192/shard-skl5/igt@i915_selftest@mock_requests.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_5223/shard-skl8/igt@i915_selftest@mock_requests.html
Comment 2 Caz.Yokoyama 2019-10-29 15:02:25 UTC
<0> [102.469071] i915_sel-1024    1.... 93323517us : intel_timelines_fini: timelines_fini:570 GEM_BUG_ON(!list_empty(&timelines->hwsp_free_list))
-------
This is actually the following code in drivers/gpu/drm/i915/gt/intel_timeline.c.

static void timelines_fini(struct intel_gt *gt)
{
	struct intel_gt_timelines *timelines = &gt->timelines;

	GEM_BUG_ON(!list_empty(&timelines->active_list));
	GEM_BUG_ON(!list_empty(&timelines->hwsp_free_list));
Comment 3 Chris Wilson 2019-11-20 17:26:18 UTC
Again, should be fixed by updates to intel_gt_wait_for_idle.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.