Bug 111074

Summary: [CI][BAT] igt@i915_selftest@live_hangcheck - dmesg-warn - Expected domain [12] to be held awake by caller, count=1
Product: DRI Reporter: Martin Peres <martin.peres>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:

Description Martin Peres 2019-07-05 07:24:27 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6418/fi-kbl-7500u/igt@i915_selftest@live_hangcheck.html

<4> [452.427098] ------------[ cut here ]------------
<4> [452.427102] Expected domain 2 to be held awake by caller, count=1
<4> [452.427175] WARNING: CPU: 1 PID: 4762 at drivers/gpu/drm/i915/intel_uncore.c:770 assert_forcewakes_active+0xd3/0x170 [i915]
<4> [452.427177] Modules linked in: i915(+) amdgpu gpu_sched ttm vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic mei_hdcp snd_hda_codec x86_pkg_temp_thermal coretemp snd_hwdep snd_hda_core crct10dif_pclmul snd_pcm crc32_pclmul btusb btrtl btbcm btintel bluetooth ghash_clmulni_intel e1000e ptp pps_core ecdh_generic ecc mei_me mei prime_numbers [last unloaded: i915]
<4> [452.427195] CPU: 1 PID: 4762 Comm: i915_selftest Tainted: G     U            5.2.0-rc7-CI-CI_DRM_6418+ #1
<4> [452.427197] Hardware name: GIGABYTE GB-BKi7(H)A-7500/MFLP7AP-00, BIOS F7 06/28/2017
<4> [452.427247] RIP: 0010:assert_forcewakes_active+0xd3/0x170 [i915]
<4> [452.427250] Code: 8b 54 24 10 39 ca 72 11 85 db 75 af e8 a6 62 7b e0 fb 5b 5d 41 5c 41 5d c3 41 8b 74 24 08 48 c7 c7 88 1a b6 a0 e8 cd e9 6c e0 <0f> 0b eb de f3 c3 89 de 48 c7 c7 48 1a b6 a0 e8 b9 e9 6c e0 0f 0b
<4> [452.427252] RSP: 0018:ffffc900004b7778 EFLAGS: 00010086
<4> [452.427254] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
<4> [452.427255] RDX: 0000000000000007 RSI: 0000000000000000 RDI: 00000000ffffffff
<4> [452.427257] RBP: 00000000ffffffff R08: 0000000000000000 R09: 0000000000000001
<4> [452.427259] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8882472ed798
<4> [452.427261] R13: ffff8881d9fc0d38 R14: 0000000000000001 R15: 00000000000003e9
<4> [452.427263] FS:  00007efdc5ec4e40(0000) GS:ffff888265a80000(0000) knlGS:0000000000000000
<4> [452.427265] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [452.427268] CR2: 00005649b7888d18 CR3: 0000000255712005 CR4: 00000000003606e0
<4> [452.427269] Call Trace:
<4> [452.427324]  intel_mocs_init_engine+0x34/0xd0 [i915]
<4> [452.427329]  ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4> [452.427381]  execlists_resume+0x2d/0x2b0 [i915]
<4> [452.427385]  ? _raw_spin_unlock_irqrestore+0x39/0x60
<4> [452.427431]  i915_reset_engine+0xf4/0x1f0 [i915]
<4> [452.427474]  __igt_reset_engines+0x3e6/0x7a0 [i915]
<4> [452.427532]  igt_reset_engines+0x30/0x50 [i915]
<4> [452.427589]  __i915_subtests+0xb8/0x210 [i915]
<4> [452.427646]  ? __i915_nop_teardown+0x10/0x10 [i915]
<4> [452.427695]  ? __i915_live_setup+0x10/0x10 [i915]
<4> [452.427743]  intel_hangcheck_live_selftests+0xa2/0xf0 [i915]
<4> [452.427796]  __run_selftests+0x112/0x170 [i915]
<4> [452.427847]  i915_live_selftests+0x2c/0x60 [i915]
<4> [452.427889]  i915_pci_probe+0x83/0x1a0 [i915]
<4> [452.427892]  ? _raw_spin_unlock_irqrestore+0x39/0x60
<4> [452.427898]  pci_device_probe+0x9e/0x120
<4> [452.427902]  really_probe+0xea/0x3c0
<4> [452.427905]  driver_probe_device+0x10b/0x120
<4> [452.427908]  device_driver_attach+0x4a/0x50
<4> [452.427911]  __driver_attach+0x97/0x130
<4> [452.427915]  ? device_driver_attach+0x50/0x50
<4> [452.427918]  bus_for_each_dev+0x74/0xc0
<4> [452.427922]  bus_add_driver+0x13f/0x210
<4> [452.427924]  ? 0xffffffffa0249000
<4> [452.427927]  driver_register+0x56/0xe0
<4> [452.427929]  ? 0xffffffffa0249000
<4> [452.427931]  do_one_initcall+0x58/0x300
<4> [452.427934]  ? do_init_module+0x1d/0x1f6
<4> [452.427937]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [452.427940]  ? kmem_cache_alloc_trace+0x261/0x290
<4> [452.427945]  do_init_module+0x56/0x1f6
<4> [452.427949]  load_module+0x24d1/0x2990
<4> [452.427962]  ? __se_sys_finit_module+0xd3/0xf0
<4> [452.427964]  __se_sys_finit_module+0xd3/0xf0
<4> [452.427975]  do_syscall_64+0x55/0x1c0
<4> [452.427977]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [452.427979] RIP: 0033:0x7efdc557f839
<4> [452.427982] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [452.427983] RSP: 002b:00007ffcf2676f78 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [452.427985] RAX: ffffffffffffffda RBX: 000055c1c3317530 RCX: 00007efdc557f839
<4> [452.427987] RDX: 0000000000000000 RSI: 000055c1c3310990 RDI: 0000000000000006
<4> [452.427988] RBP: 000055c1c3310990 R08: 0000000000000004 R09: 000055c1c1954c1b
<4> [452.427990] R10: 00007ffcf26771c0 R11: 0000000000000246 R12: 0000000000000000
<4> [452.427992] R13: 000055c1c33116a0 R14: 0000000000000020 R15: 0000000000000048
<4> [452.428001] irq event stamp: 49561256
<4> [452.428003] hardirqs last  enabled at (49561255): [<ffffffff819a907c>] _raw_spin_unlock_irqrestore+0x4c/0x60
<4> [452.428046] hardirqs last disabled at (49561256): [<ffffffffa09dfdfb>] assert_forcewakes_active+0x5b/0x170 [i915]
<4> [452.428048] softirqs last  enabled at (49561220): [<ffffffff81c0033a>] __do_softirq+0x33a/0x4b9
<4> [452.428051] softirqs last disabled at (49561213): [<ffffffff81a00e1a>] do_softirq_own_stack+0x2a/0x40
<4> [452.428091] WARNING: CPU: 1 PID: 4762 at drivers/gpu/drm/i915/intel_uncore.c:770 assert_forcewakes_active+0xd3/0x170 [i915]
<4> [452.428093] ---[ end trace 592e7b9cadfcd79d ]---
Comment 4 Chris Wilson 2019-07-05 10:51:50 UTC
commit badf1f2724d430c8a310be549b49f8333817dbeb (HEAD -> drm-intel-next-queued, drm-intel/for-linux-next, drm-intel/drm-intel-next-queued)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Fri Jul 5 08:45:57 2019 +0100

    drm/i915: Order assert forcewake test
    
    Read the current value before computing the expected to ensure that if
    the timer does complete early (against our will), it should not cause a
    false positive.
    
    v2: The local irq disable did not prevent the timer from running.
    
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111074
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20190705074604.16496-1-chris@chris-wilson.co.uk

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.