Bug 111944

Summary: [CI][BAT] igt@i915_selftest@live_coherency - timeout - GEM_BUG_ON(gt->awake)
Product: DRI Reporter: Lakshmi <lakshminarayana.vudum>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: low CC: intel-gfx-bugs
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: GLK, SKL i915 features: GEM/Other

Description Lakshmi 2019-10-09 18:25:44 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7026/fi-glk-dsi/igt@i915_selftest@live_coherency.html

[574.456268] BUG: sleeping function called from invalid context at kernel/sched/completion.c:99
<3> [574.456274] in_atomic(): 0, irqs_disabled(): 0, non_block: 0, pid: 4474, name: i915_selftest
<4> [574.456276] INFO: lockdep is turned off.
<3> [574.456591] Preemption disabled at:
<4> [574.456595] [<0000000000000000>] 0x0
<4> [574.456601] CPU: 0 PID: 4474 Comm: i915_selftest Tainted: G     UD           5.4.0-rc2-CI-CI_DRM_7026+ #1
<4> [574.456603] Hardware name: Intel Corp. Geminilake/GLK RVP2 LP4SD (07), BIOS GELKRVPA.X64.0062.B30.1708222146 08/22/2017
<4> [574.456606] Call Trace:
<4> [574.456615]  dump_stack+0x67/0x9b
<4> [574.456620]  ___might_sleep+0x178/0x260
<4> [574.456625]  wait_for_completion+0x37/0x1a0
<4> [574.456632]  virt_efi_query_variable_info+0x161/0x1b0
<4> [574.456637]  efi_query_variable_store+0xb3/0x1a0
<4> [574.456643]  ? efivar_entry_set_safe+0x19c/0x220
<4> [574.456646]  ? efi_delete_dummy_variable+0x90/0x90
<4> [574.456649]  efivar_entry_set_safe+0x19c/0x220
<4> [574.456654]  ? efi_pstore_write+0x10b/0x150
<4> [574.456657]  efi_pstore_write+0x10b/0x150
<4> [574.456668]  pstore_dump+0x127/0x340
<4> [574.456677]  kmsg_dump+0x87/0x1c0
<4> [574.456682]  oops_end+0x3e/0x90
<4> [574.456685]  do_trap+0x80/0x100
<4> [574.456807]  ? intel_gt_driver_remove+0x54/0x60 [i915]
<4> [574.456811]  do_invalid_op+0x23/0x30
<4> [574.456884]  ? intel_gt_driver_remove+0x54/0x60 [i915]
<4> [574.456887]  invalid_op+0x23/0x30
<4> [574.456958] RIP: 0010:intel_gt_driver_remove+0x54/0x60 [i915]
<4> [574.456962] Code: bb 90 7b e0 48 8b 35 b3 0d 25 00 49 c7 c0 f9 a1 b2 a0 b9 7f 01 00 00 48 c7 c2 50 7d ac a0 48 c7 c7 cf 1e 97 a0 e8 7c 8b 82 e0 <0f> 0b 66 2e 0f 1f 84 00 00 00 00 00 e9 6b 87 fd ff 90 66 2e 0f 1f
<4> [574.456964] RSP: 0018:ffffc9000026fac0 EFLAGS: 00010282
<4> [574.456967] RAX: 000000000000000a RBX: ffff88816d800000 RCX: 0000000000000000
<4> [574.456970] RDX: 0000000000000001 RSI: 0000000000000008 RDI: ffff88817b097a38
<4> [574.456972] RBP: ffff88816d80a080 R08: 00000000003a9663 R09: ffff88815a5cf000
<4> [574.456974] R10: 0000000000000000 R11: ffff88817b097a38 R12: 0000000000000003
<4> [574.456976] R13: ffffffffa0bb2250 R14: ffffffffa0bb21e0 R15: ffffc9000026fe88
<4> [574.457060]  i915_gem_driver_remove+0x34/0xc0 [i915]
<4> [574.457131]  i915_driver_remove+0xe9/0x110 [i915]
<4> [574.457200]  i915_pci_remove+0x19/0x40 [i915]
<4> [574.457269]  i915_pci_probe+0xa3/0x1b0 [i915]
<4> [574.457275]  pci_device_probe+0x9e/0x120
<4> [574.457282]  really_probe+0xea/0x420
<4> [574.457286]  driver_probe_device+0x10b/0x120
<4> [574.457290]  device_driver_attach+0x4a/0x50
<4> [574.457294]  __driver_attach+0x97/0x130
<4> [574.457297]  ? device_driver_attach+0x50/0x50
<4> [574.457300]  bus_for_each_dev+0x74/0xc0
<4> [574.457305]  bus_add_driver+0x142/0x220
<4> [574.457309]  ? 0xffffffffa0135000
<4> [574.457312]  driver_register+0x56/0xf0
<4> [574.457315]  ? 0xffffffffa0135000
<4> [574.457319]  do_one_initcall+0x58/0x2ff
<4> [574.457324]  ? rcu_read_lock_sched_held+0x4d/0x80
<4> [574.457329]  ? kmem_cache_alloc_trace+0x290/0x2c0
<4> [574.457334]  do_init_module+0x56/0x1f8
<4> [574.457338]  load_module+0x243e/0x29f0
<4> [574.457350]  ? __do_sys_finit_module+0xe9/0x110
<4> [574.457353]  __do_sys_finit_module+0xe9/0x110
<4> [574.457360]  do_syscall_64+0x4f/0x210
<4> [574.457365]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [574.457368] RIP: 0033:0x7fe9df6f9839
<4> [574.457371] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [574.457374] RSP: 002b:00007ffe59c88c28 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [574.457377] RAX: ffffffffffffffda RBX: 000055605726e680 RCX: 00007fe9df6f9839
<4> [574.457379] RDX: 0000000000000000 RSI: 0000556057267240 RDI: 0000000000000006
<4> [574.457381] RBP: 0000556057267240 R08: 6173696420312d3d R09: 000055605726fe20
<4> [574.457383] R10: 7374736574666c65 R11: 0000000000000246 R12: 0000000000000000
<4> [574.457385] R13: 0000556057260000 R14: 0000000000000020 R15: 0000000000000048
Comment 2 Chris Wilson 2019-10-09 18:31:17 UTC
The missing chunk of dmesg does not help. But basically it is complaining that a request went missing.
Comment 3 CI Bug Log 2019-10-09 18:33:27 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL GLK: igt@i915_selftest@live_coherency - timeout - BUG: sleeping function called from invalid context at kernel/sched/completion.c:99 -}
{+ SKL GLK: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7026/fi-skl-lmem/igt@i915_selftest@live_coherency.html
Comment 4 Chris Wilson 2019-10-18 21:57:39 UTC
*** Bug 112062 has been marked as a duplicate of this bug. ***
Comment 5 CI Bug Log 2019-10-19 06:40:41 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL GLK: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) -}
{+ SKL GLK: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) +}


  No new failures caught with the new filter
Comment 6 CI Bug Log 2019-10-19 06:46:20 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL GLK: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) -}
{+ SKL KBL GLK: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) +}


  No new failures caught with the new filter
Comment 7 CI Bug Log 2019-10-19 06:47:49 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* KBL: igt@runner@aborted - fail - Previous test: i915_selftest (live_coherency)
  (No new failures associated)
Comment 8 CI Bug Log 2019-10-22 11:10:04 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL KBL GLK: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) -}
{+ SKL KBL GLK CFL: igt@i915_selftest@live_coherency - timeout -  GEM_BUG_ON(gt-&gt;awake) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7142/fi-cfl-8109u/igt@i915_selftest@live_coherency.html
Comment 9 CI Bug Log 2019-10-22 11:10:59 UTC
A CI Bug Log filter associated to this bug has been updated:

{- KBL: igt@runner@aborted - fail - Previous test: i915_selftest (live_coherency) -}
{+ KBL CFL: igt@runner@aborted - fail - Previous test: i915_selftest (live_coherency) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7142/fi-cfl-8109u/igt@runner@aborted.html
Comment 11 Chris Wilson 2019-11-20 17:32:47 UTC
commit a6edbca74b305adc165e67065d7ee766006e6a48
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Nov 20 16:55:13 2019 +0000

    drm/i915/gt: Close race between engine_park and intel_gt_retire_requests

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.