Bug 110546 - [CI][SHARDS]gt@i915_selftest@mock_contexts - incomplete - WARN_ON(debug_locks && !lock_is_held(&(&request->i915->drm.struct_mutex)->dep_map))
Summary: [CI][SHARDS]gt@i915_selftest@mock_contexts - incomplete - WARN_ON(debug_locks...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: highest normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2019-04-29 10:50 UTC by Lakshmi
Modified: 2019-09-30 13:07 UTC (History)
1 user (show)

See Also:
i915 platform: BXT, SKL
i915 features: GEM/Other


Attachments

Description Lakshmi 2019-04-29 10:50:46 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6008/shard-skl9/igt@i915_selftest@mock_contexts.html

<6> [170.531385] [IGT] i915_selftest: executing
<6> [170.597109] [IGT] i915_selftest: starting subtest mock_contexts
<5> [171.141892] Setting dangerous option enable_psr - tainting kernel
<5> [171.141932] Setting dangerous option mock_selftests - tainting kernel
<6> [171.315893] i915: Performing mock selftests with st_random_seed=0xce114e90 st_timeout=1000
<6> [171.315917] i915: Running contexts
<6> [171.329177] i915: Running i915_gem_context_mock_selftests/mock_context_barrier
<3> [171.335672] context_barrier_task():"Did not retire immediately for all inactive engines\n"
<3> [171.335700] i915/i915_gem_context_mock_selftests: mock_context_barrier failed with error -22
<4> [171.339672] ------------[ cut here ]------------
<4> [171.339694] WARN_ON(debug_locks && !lock_is_held(&(&request->i915->drm.struct_mutex)->dep_map))
<4> [171.340268] WARNING: CPU: 2 PID: 1121 at drivers/gpu/drm/i915/i915_request.c:251 i915_request_retire+0x660/0xb40 [i915]
<4> [171.340285] Modules linked in: i915(+) snd_hda_codec_hdmi snd_hda_codec x86_pkg_temp_thermal coretemp snd_hwdep crct10dif_pclmul btusb crc32_pclmul btrtl snd_hda_core btbcm btintel ghash_clmulni_intel bluetooth snd_pcm cdc_ether usbnet r8152 mii ecdh_generic i2c_hid pinctrl_sunrisepoint pinctrl_intel prime_numbers [last unloaded: i915]
<4> [171.340417] CPU: 2 PID: 1121 Comm: i915_selftest Tainted: G     U            5.1.0-rc6-CI-CI_DRM_6008+ #1
<4> [171.340433] Hardware name: Google Caroline/Caroline, BIOS MrChromebox 08/27/2018
<4> [171.340904] RIP: 0010:i915_request_retire+0x660/0xb40 [i915]
<4> [171.340927] Code: ff ff ff 48 8d b8 c8 00 00 00 e8 eb 42 b6 e0 85 c0 0f 85 7c fa ff ff 48 c7 c6 18 c6 6b a0 48 c7 c7 ea 46 6e a0 e8 a0 d8 af e0 <0f> 0b 8b 83 38 01 00 00 85 c0 0f 8e 67 fa ff ff 48 c7 c1 98 c6 6b
<4> [171.340946] RSP: 0018:ffffc90000477b58 EFLAGS: 00010286
<4> [171.340968] RAX: 0000000000000000 RBX: ffff88816b25b658 RCX: 0000000000000002
<4> [171.340985] RDX: 0000000000000006 RSI: ffff888173aa58b8 RDI: ffffffff82120401
<4> [171.341003] RBP: ffffc90000477b90 R08: 0000000031028bf8 R09: 0000000000000000
<4> [171.341021] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88815e6ebcd8
<4> [171.341039] R13: ffffffffa05b0578 R14: 000000002444c748 R15: ffffc90000477e98
<4> [171.341059] FS:  00007f19492db980(0000) GS:ffff88817ab00000(0000) knlGS:0000000000000000
<4> [171.341076] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [171.341092] CR2: 00007f0545051dc0 CR3: 000000015ffd2002 CR4: 00000000003606e0
<4> [171.341107] Call Trace:
<4> [171.341586]  ? ring_retire_requests+0xa8/0x160 [i915]
<4> [171.342067]  ring_retire_requests+0x86/0x160 [i915]
<4> [171.342567]  i915_retire_requests+0x58/0xc0 [i915]
<4> [171.343038]  mock_device_flush+0x44/0x80 [i915]
<4> [171.343513]  mock_device_release+0x26/0x1c0 [i915]
<4> [171.343983]  i915_gem_context_mock_selftests+0x34/0x40 [i915]
<4> [171.344486]  __run_selftests+0x134/0x170 [i915]
<4> [171.344519]  ? 0xffffffffa07e4000
<4> [171.345020]  i915_mock_selftests+0x27/0x50 [i915]
<4> [171.345414]  i915_init+0x12/0x73 [i915]
<4> [171.345443]  ? 0xffffffffa07e4000
<4> [171.345468]  do_one_initcall+0x58/0x2e0
<4> [171.345494]  ? do_init_module+0x1d/0x1ea
<4> [171.345529]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [171.345551]  ? kmem_cache_alloc_trace+0x261/0x290
<4> [171.345599]  do_init_module+0x56/0x1ea
<4> [171.345634]  load_module+0x2701/0x29e0
<4> [171.345798]  ? __se_sys_finit_module+0xd3/0xf0
<4> [171.345820]  __se_sys_finit_module+0xd3/0xf0
<4> [171.345919]  do_syscall_64+0x55/0x190
<4> [171.345955]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [171.345977] RIP: 0033:0x7f1948b8f839
<4> [171.346017] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [171.346036] RSP: 002b:00007ffe2c28bce8 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [171.346058] RAX: ffffffffffffffda RBX: 00005600522d6d10 RCX: 00007f1948b8f839
<4> [171.346073] RDX: 0000000000000000 RSI: 00005600522dbdb0 RDI: 0000000000000006
<4> [171.346088] RBP: 00005600522dbdb0 R08: 0000000000000004 R09: 0000000000000000
<4> [171.346104] R10: 00007ffe2c28be60 R11: 0000000000000246 R12: 0000000000000000
<4> [171.346121] R13: 00005600522dae20 R14: 0000000000000020 R15: 0000000000000047
<4> [171.346207] irq event stamp: 285042
<4> [171.346237] hardirqs last  enabled at (285041): [<ffffffff811269c4>] vprintk_emit+0x124/0x320
<4> [171.346261] hardirqs last disabled at (285042): [<ffffffff810019b0>] trace_hardirqs_off_thunk+0x1a/0x1c
<4> [171.346283] softirqs last  enabled at (285030): [<ffffffff81c0033a>] __do_softirq+0x33a/0x4b9
<4> [171.346309] softirqs last disabled at (285023): [<ffffffff810b5519>] irq_exit+0xa9/0xc0
<4> [171.346792] WARNING: CPU: 2 PID: 1121 at drivers/gpu/drm/i915/i915_request.c:251 i915_request_retire+0x660/0xb40 [i915]
<4> [171.346815] ---[ end trace e82a1ff33eb2ab5b ]---
<3> [171.347017] i915_request_retire:252 GEM_BUG_ON(!i915_sw_fence_signaled(&request->submit))
<4> [171.347773] ------------[ cut here ]------------
<2> [171.347790] kernel BUG at drivers/gpu/drm/i915/i915_request.c:252!
<4> [171.347991] invalid opcode: 0000 [#1] PREEMPT SMP PTI
<4> [171.348051] CPU: 2 PID: 1121 Comm: i915_selftest Tainted: G     U  W         5.1.0-rc6-CI-CI_DRM_6008+ #1
<4> [171.348137] Hardware name: Google Caroline/Caroline, BIOS MrChromebox 08/27/2018
<4> [171.348672] RIP: 0010:i915_request_retire+0x6b5/0xb40 [i915]
<4> [171.348736] Code: da 70 b7 e0 48 8b 35 f2 4c 1c 00 49 c7 c0 98 c6 6b a0 b9 fc 00 00 00 48 c7 c2 80 45 69 a0 48 c7 c7 00 00 5b a0 e8 3b 17 be e0 <0f> 0b 48 8b 43 40 49 89 c6 e8 2d 66 b8 e0 68 47 00 5b a0 45 31 c9
<4> [171.348888] RSP: 0018:ffffc90000477b58 EFLAGS: 00010282
<4> [171.348947] RAX: 0000000000000011 RBX: ffff88816b25b658 RCX: 0000000000000000
<4> [171.349013] RDX: 0000000000000001 RSI: 0000000000000008 RDI: ffff88817a1a8ff8
<4> [171.349080] RBP: ffffc90000477b90 R08: 0000000000003bef R09: ffff888179a64000
<4> [171.349148] R10: 0000000000000000 R11: ffff88817a1a8ff8 R12: ffff88815e6ebcd8
<4> [171.349215] R13: ffffffffa05b0578 R14: 000000002444c748 R15: ffffc90000477e98
<4> [171.349289] FS:  00007f19492db980(0000) GS:ffff88817ab00000(0000) knlGS:0000000000000000
<4> [171.349368] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [171.349426] CR2: 00007f0545051dc0 CR3: 000000015ffd2002 CR4: 00000000003606e0
<4> [171.349492] Call Trace:
<4> [171.349983]  ? ring_retire_requests+0xa8/0x160 [i915]
<4> [171.350487]  ring_retire_requests+0x86/0x160 [i915]
<4> [171.351000]  i915_retire_requests+0x58/0xc0 [i915]
<4> [171.351499]  mock_device_flush+0x44/0x80 [i915]
<4> [171.352011]  mock_device_release+0x26/0x1c0 [i915]
<4> [171.352484]  i915_gem_context_mock_selftests+0x34/0x40 [i915]
<4> [171.353014]  __run_selftests+0x134/0x170 [i915]
<4> [171.353073]  ? 0xffffffffa07e4000
<4> [171.353584]  i915_mock_selftests+0x27/0x50 [i915]
<4> [171.354013]  i915_init+0x12/0x73 [i915]
<4> [171.354063]  ? 0xffffffffa07e4000
<4> [171.354110]  do_one_initcall+0x58/0x2e0
<4> [171.354163]  ? do_init_module+0x1d/0x1ea
<4> [171.354218]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [171.354275]  ? kmem_cache_alloc_trace+0x261/0x290
<4> [171.354341]  do_init_module+0x56/0x1ea
<4> [171.354396]  load_module+0x2701/0x29e0
<4> [171.354498]  ? __se_sys_finit_module+0xd3/0xf0
<4> [171.354551]  __se_sys_finit_module+0xd3/0xf0
<4> [171.354632]  do_syscall_64+0x55/0x190
<4> [171.354687]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [171.354745] RIP: 0033:0x7f1948b8f839
<4> [171.354791] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [171.354945] RSP: 002b:00007ffe2c28bce8 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [171.355025] RAX: ffffffffffffffda RBX: 00005600522d6d10 RCX: 00007f1948b8f839
<4> [171.355096] RDX: 0000000000000000 RSI: 00005600522dbdb0 RDI: 0000000000000006
<4> [171.355163] RBP: 00005600522dbdb0 R08: 0000000000000004 R09: 0000000000000000
<4> [171.355231] R10: 00007ffe2c28be60 R11: 0000000000000246 R12: 0000000000000000
<4> [171.355299] R13: 00005600522dae20 R14: 0000000000000020 R15: 0000000000000047
<4> [171.355396] Modules linked in: i915(+) snd_hda_codec_hdmi snd_hda_codec x86_pkg_temp_thermal coretemp snd_hwdep crct10dif_pclmul btusb crc32_pclmul btrtl snd_hda_core btbcm btintel ghash_clmulni_intel bluetooth snd_pcm cdc_ether usbnet r8152 mii ecdh_generic i2c_hid pinctrl_sunrisepoint pinctrl_intel prime_numbers [last unloaded: i915]
<0> [171.355708] Dumping ftrace buffer:
<0> [171.355752] ---------------------------------
<0> [171.355866] CPU:3 [LOST 6285 EVENTS]
Comment 1 CI Bug Log 2019-04-29 10:53:43 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* SKL: igt@i915_selftest@mock_contexts - incomplete - WARN_ON(debug_locks &amp;&amp; !lock_is_held(&amp;(&amp;request-&gt;i915-&gt;drm.struct_mutex)-&gt;dep_map))
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6008/shard-skl9/igt@i915_selftest@mock_contexts.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4222/shard-skl4/igt@i915_selftest@mock_contexts.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6010/shard-skl9/igt@i915_selftest@mock_contexts.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6011/shard-skl9/igt@i915_selftest@mock_contexts.html
Comment 2 Chris Wilson 2019-04-29 10:57:06 UTC
commit 62c8e423450d7140a640651227ec563d6b0141ce (HEAD -> drm-intel-next-queued, drm-intel/drm-intel-next-queued)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Apr 29 10:07:35 2019 +0100

    drm/i915: Skip unused contexts for context_barrier_task()
    
    If the context has not been used yet, it needs no barrier, and in the
    process fix up the selftest in mock_contexts.
    
    Testcase: igt/gem_ctx_clone/vm
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20190429090735.326-1-chris@chris-wilson.co.uk
Comment 3 CI Bug Log 2019-04-29 14:46:10 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL: igt@i915_selftest@mock_contexts - incomplete - WARN_ON(debug_locks &amp;&amp; !lock_is_held(&amp;(&amp;request-&gt;i915-&gt;drm.struct_mutex)-&gt;dep_map)) -}
{+ APL SKL: igt@i915_selftest@mock_contexts - incomplete - WARN_ON(debug_locks &amp;&amp; !lock_is_held(&amp;(&amp;request-&gt;i915-&gt;drm.struct_mutex)-&gt;dep_map)) +}


  No new failures caught with the new filter
Comment 4 CI Bug Log 2019-04-29 14:48:31 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* APL: igt@runner@aborted - fail - Previous test: i915_selftest (mock_contexts)
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6007/shard-apl4/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_12886/shard-apl6/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4214/shard-apl3/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4215/shard-apl4/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4216/shard-apl7/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/TrybotIGT_6/shard-apl6/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6008/shard-apl5/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4220/shard-apl8/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4222/shard-apl1/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6009/shard-apl5/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6010/shard-apl1/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6011/shard-apl5/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_2931/shard-apl1/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4223/shard-apl2/igt@runner@aborted.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_2932/shard-apl8/igt@runner@aborted.html
Comment 5 Lakshmi 2019-04-29 14:49:13 UTC
(In reply to CI Bug Log from comment #4)
> The CI Bug Log issue associated to this bug has been updated.
> 
> ### New filters associated
> 
> * APL: igt@runner@aborted - fail - Previous test: i915_selftest
> (mock_contexts)
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6007/shard-apl4/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_12886/shard-apl6/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4214/shard-apl3/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4215/shard-apl4/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4216/shard-apl7/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/TrybotIGT_6/shard-apl6/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6008/shard-apl5/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4220/shard-apl8/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4222/shard-apl1/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6009/shard-apl5/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6010/shard-apl1/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6011/shard-apl5/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_2931/shard-apl1/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4223/shard-apl2/
> igt@runner@aborted.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_2932/shard-apl8/
> igt@runner@aborted.html

Also seen on APL.
Comment 6 Lakshmi 2019-09-30 13:07:41 UTC
(In reply to Chris Wilson from comment #2)
> commit 62c8e423450d7140a640651227ec563d6b0141ce (HEAD ->
> drm-intel-next-queued, drm-intel/drm-intel-next-queued)
> Author: Chris Wilson <chris@chris-wilson.co.uk>
> Date:   Mon Apr 29 10:07:35 2019 +0100
> 
>     drm/i915: Skip unused contexts for context_barrier_task()
>     
>     If the context has not been used yet, it needs no barrier, and in the
>     process fix up the selftest in mock_contexts.
>     
>     Testcase: igt/gem_ctx_clone/vm
>     Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
>     Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>     Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>     Link:
> https://patchwork.freedesktop.org/patch/msgid/20190429090735.326-1-
> chris@chris-wilson.co.uk

This issue used to reproduce 100% until CI_DRM_6011 but not seen later. Current CI_DRM run is 6973, closing and archiving this issue.
Comment 7 CI Bug Log 2019-09-30 13:07:50 UTC
The CI Bug Log issue associated to this bug has been archived.

New failures matching the above filters will not be associated to this bug anymore.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.