Bug 111055

Summary: [CI][SHARDS] igt@gem_eio@in-flight-10ms - dmesg-warn - i915_reset_device timed out, cancelling all in-flight rendering.
Product: DRI Reporter: Lakshmi <lakshminarayana.vudum>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED MOVED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: chris, intel-gfx-bugs
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: GLK, SKL i915 features: GEM/Other

Description Lakshmi 2019-07-04 05:46:39 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6398/shard-glk5/igt@gem_eio@in-flight-10ms.html

<6> [767.021215] Console: switching to colour dummy device 80x25
<6> [767.021315] [IGT] gem_eio: executing
<5> [767.035566] Setting dangerous option reset - tainting kernel
<5> [767.035673] Setting dangerous option reset - tainting kernel
<7> [767.041448] [drm:i915_reset_device [i915]] resetting chip
<5> [767.059160] i915 0000:00:02.0: Resetting chip for Manually set wedged engine mask = ffffffffffffffff
<6> [767.066293] [IGT] gem_eio: starting subtest in-flight-10ms
<7> [767.071388] [drm:vgem_gem_dumb_create [vgem]] Created object of size 1
<7> [767.104428] [drm:vgem_gem_dumb_create [vgem]] Created object of size 1
<7> [767.137528] [drm:vgem_gem_dumb_create [vgem]] Created object of size 1
<7> [767.161311] [drm:vgem_gem_dumb_create [vgem]] Created object of size 1
<5> [767.191524] Setting dangerous option reset - tainting kernel
<7> [767.220952] [drm:i915_reset_device [i915]] resetting chip
<5> [767.233712] i915 0000:00:02.0: Resetting chip for Manually set wedged engine mask = ffffffffffffffff
<7> [767.238935] [drm:i915_reset [i915]] GPU reset disabled
<7> [767.238945] __i915_gem_set_wedged rcs0
<7> [767.238949] __i915_gem_set_wedged 	Awake? 3
<7> [767.238952] __i915_gem_set_wedged 	Hangcheck: 44 ms ago
<7> [767.238956] __i915_gem_set_wedged 	Reset count: 3 (global 6)
<7> [767.238960] __i915_gem_set_wedged 	Requests:
<7> [767.238975] __i915_gem_set_wedged 		active  e46:2*  prio=3 @ 44ms: gem_eio[4065]
<7> [767.238978] __i915_gem_set_wedged 		ring->start:  0x00802000
<7> [767.238982] __i915_gem_set_wedged 		ring->head:   0x00000000
<7> [767.238985] __i915_gem_set_wedged 		ring->tail:   0x00002ca8
<7> [767.238989] __i915_gem_set_wedged 		ring->emit:   0x00002cb0
<7> [767.238992] __i915_gem_set_wedged 		ring->space:  0x00001310
<7> [767.238995] __i915_gem_set_wedged 		ring->hwsp:   0xffffe200
<7> [767.238999] __i915_gem_set_wedged [head 0000, postfix 0060, tail 00b0, batch 0x00000000_00040000]:
<7> [767.239026] __i915_gem_set_wedged [0000] 7a000004 00000000 00000000 00000000 00000000 00000000 7a000004 01144c1c
<7> [767.239031] __i915_gem_set_wedged [0020] fffff080 00000000 00000000 00000000 02800000 00000000 10400002 ffffe200
<7> [767.239035] __i915_gem_set_wedged [0040] 00000000 00000001 04000001 18800101 00040000 00000000 04000000 00000000
<7> [767.239040] __i915_gem_set_wedged [0060] 7a000004 01005021 ffffe200 00000000 00000002 00000000 7a000004 00100080
<7> [767.239044] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 01000000 04000001 0e40c002 00000000
<7> [767.239048] __i915_gem_set_wedged [00a0] 007ea0c8 00000000 02800000 00000000
<7> [767.239071] __i915_gem_set_wedged 	RING_START: 0x00802000
<7> [767.239077] __i915_gem_set_wedged 	RING_HEAD:  0x00000058
<7> [767.239083] __i915_gem_set_wedged 	RING_TAIL:  0x00002ca8
<7> [767.239090] __i915_gem_set_wedged 	RING_CTL:   0x00003001
<7> [767.239098] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [767.239104] __i915_gem_set_wedged 	RING_IMR: fffffefe
<7> [767.239114] __i915_gem_set_wedged 	ACTHD:  0x00000000_00040040
<7> [767.239124] __i915_gem_set_wedged 	BBADDR: 0x00000000_00040041
<7> [767.239134] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00802058
<7> [767.239140] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [767.239145] __i915_gem_set_wedged 	IPEHR: 0x18800101
<7> [767.239153] __i915_gem_set_wedged 	Execlist status: 0x00044052 00000002, entries 6
<7> [767.239157] __i915_gem_set_wedged 	Execlist CSB read 2, write 2, tasklet queued? no (disabled)
<7> [767.239165] __i915_gem_set_wedged 		Active[0: ring:{start:00802000, hwsp:ffffe200, seqno:00000001}, rq:  e46:82-  prio=3 @ 37ms: gem_eio[4065]
<7> [767.239173] __i915_gem_set_wedged 		E  e46:2*  prio=3 @ 44ms: gem_eio[4065]
<7> [767.239177] __i915_gem_set_wedged 		E  e46:4  prio=3 @ 44ms: gem_eio[4065]
<7> [767.239182] __i915_gem_set_wedged 		E  e46:6  prio=3 @ 44ms: gem_eio[4065]
<7> [767.239186] __i915_gem_set_wedged 		E  e46:8  prio=3 @ 44ms: gem_eio[4065]
<7> [767.239190] __i915_gem_set_wedged 		E  e46:a  prio=3 @ 44ms: gem_eio[4065]
<7> [767.239195] __i915_gem_set_wedged 		E  e46:c  prio=3 @ 43ms: gem_eio[4065]
<7> [767.239199] __i915_gem_set_wedged 		E  e46:e  prio=3 @ 43ms: gem_eio[4065]
<7> [767.239211] __i915_gem_set_wedged 		...skipping 57 executing requests...
<7> [767.239215] __i915_gem_set_wedged 		E  e46:82-  prio=3 @ 37ms: gem_eio[4065]
<7> [767.239218] __i915_gem_set_wedged 		Queue priority hint: 3
<7> [767.239221] __i915_gem_set_wedged HWSP:
<7> [767.239226] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239229] __i915_gem_set_wedged *
<7> [767.239233] __i915_gem_set_wedged [0040] 00008002 00000002 00008002 00000002 00008002 00000002 00008002 00000002
<7> [767.239238] __i915_gem_set_wedged [0060] 00008002 00000002 00008002 00000002 00000000 00000000 00000000 00000002
<7> [767.239242] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239245] __i915_gem_set_wedged *
<7> [767.239250] __i915_gem_set_wedged Idle? no
<7> [767.239253] __i915_gem_set_wedged Signals:
<7> [767.239259] __i915_gem_set_wedged 	[e46:82] @ 37ms
<7> [767.239263] __i915_gem_set_wedged bcs0
<7> [767.239266] __i915_gem_set_wedged 	Awake? 0
<7> [767.239269] __i915_gem_set_wedged 	Hangcheck: 103 ms ago
<7> [767.239272] __i915_gem_set_wedged 	Reset count: 0 (global 6)
<7> [767.239276] __i915_gem_set_wedged 	Requests:
<7> [767.239287] __i915_gem_set_wedged 	RING_START: 0x007ef000
<7> [767.239293] __i915_gem_set_wedged 	RING_HEAD:  0x000003b8
<7> [767.239298] __i915_gem_set_wedged 	RING_TAIL:  0x000003b8
<7> [767.239306] __i915_gem_set_wedged 	RING_CTL:   0x00000000
<7> [767.239314] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [767.239319] __i915_gem_set_wedged 	RING_IMR: feffffff
<7> [767.239329] __i915_gem_set_wedged 	ACTHD:  0x00000000_000003b8
<7> [767.239339] __i915_gem_set_wedged 	BBADDR: 0x00000000_00000000
<7> [767.239349] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [767.239355] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [767.239360] __i915_gem_set_wedged 	IPEHR: 0x00000000
<7> [767.239368] __i915_gem_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [767.239371] __i915_gem_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [767.239376] __i915_gem_set_wedged HWSP:
<7> [767.239381] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239384] __i915_gem_set_wedged *
<7> [767.239388] __i915_gem_set_wedged [0040] 00000001 00000000 00000818 00000001 00000001 00000000 00000818 00000000
<7> [767.239392] __i915_gem_set_wedged [0060] 00000001 00000000 00000818 00000001 00000000 00000000 00000000 00000003
<7> [767.239397] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239399] __i915_gem_set_wedged *
<7> [767.239404] __i915_gem_set_wedged Idle? yes
<7> [767.239407] __i915_gem_set_wedged vcs0
<7> [767.239410] __i915_gem_set_wedged 	Awake? 0
<7> [767.239413] __i915_gem_set_wedged 	Hangcheck: 136 ms ago
<7> [767.239416] __i915_gem_set_wedged 	Reset count: 1 (global 6)
<7> [767.239419] __i915_gem_set_wedged 	Requests:
<7> [767.239430] __i915_gem_set_wedged 	RING_START: 0x007f0000
<7> [767.239436] __i915_gem_set_wedged 	RING_HEAD:  0x00000178
<7> [767.239441] __i915_gem_set_wedged 	RING_TAIL:  0x00000178
<7> [767.239449] __i915_gem_set_wedged 	RING_CTL:   0x00000000
<7> [767.239457] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [767.239462] __i915_gem_set_wedged 	RING_IMR: fffffeff
<7> [767.239472] __i915_gem_set_wedged 	ACTHD:  0x00000000_00000178
<7> [767.239482] __i915_gem_set_wedged 	BBADDR: 0x00000000_00000000
<7> [767.239492] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [767.239498] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [767.239503] __i915_gem_set_wedged 	IPEHR: 0x00000000
<7> [767.239511] __i915_gem_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [767.239515] __i915_gem_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [767.239519] __i915_gem_set_wedged HWSP:
<7> [767.239524] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239527] __i915_gem_set_wedged *
<7> [767.239531] __i915_gem_set_wedged [0040] 00000001 00000000 00000818 00000001 00000001 00000000 00000818 00000000
<7> [767.239535] __i915_gem_set_wedged [0060] 00000001 00000000 00000818 00000001 00000000 00000000 00000000 00000003
<7> [767.239539] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239542] __i915_gem_set_wedged *
<7> [767.239546] __i915_gem_set_wedged Idle? yes
<7> [767.239549] __i915_gem_set_wedged vecs0
<7> [767.239552] __i915_gem_set_wedged 	Awake? 0
<7> [767.239556] __i915_gem_set_wedged 	Hangcheck: 79 ms ago
<7> [767.239559] __i915_gem_set_wedged 	Reset count: 0 (global 6)
<7> [767.239562] __i915_gem_set_wedged 	Requests:
<7> [767.239573] __i915_gem_set_wedged 	RING_START: 0x007f1000
<7> [767.239579] __i915_gem_set_wedged 	RING_HEAD:  0x00000178
<7> [767.239584] __i915_gem_set_wedged 	RING_TAIL:  0x00000178
<7> [767.239592] __i915_gem_set_wedged 	RING_CTL:   0x00000000
<7> [767.239599] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [767.239604] __i915_gem_set_wedged 	RING_IMR: fffffeff
<7> [767.239614] __i915_gem_set_wedged 	ACTHD:  0x00000000_00000178
<7> [767.239624] __i915_gem_set_wedged 	BBADDR: 0x00000000_00000000
<7> [767.239634] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [767.239639] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [767.239645] __i915_gem_set_wedged 	IPEHR: 0x00000000
<7> [767.239652] __i915_gem_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [767.239656] __i915_gem_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [767.239661] __i915_gem_set_wedged HWSP:
<7> [767.239665] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239668] __i915_gem_set_wedged *
<7> [767.239672] __i915_gem_set_wedged [0040] 00000001 00000000 00000818 00000001 00000001 00000000 00000818 00000000
<7> [767.239676] __i915_gem_set_wedged [0060] 00000001 00000000 00000818 00000001 00000000 00000000 00000000 00000003
<7> [767.239681] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [767.239684] __i915_gem_set_wedged *
<7> [767.239687] __i915_gem_set_wedged Idle? yes
<3> [772.406453] i915 0000:00:02.0: i915_reset_device timed out, cancelling all in-flight rendering.
<7> [772.597938] hangcheck rcs0
<7> [772.597944] hangcheck 	Awake? 4
<7> [772.597948] hangcheck 	Hangcheck: 5403 ms ago
<7> [772.597952] hangcheck 	Reset count: 3 (global 6)
<7> [772.597955] hangcheck 	Requests:
<7> [772.598031] hangcheck 	RING_START: 0x00000000
<7> [772.598036] hangcheck 	RING_HEAD:  0x00000000
<7> [772.598041] hangcheck 	RING_TAIL:  0x00000000
<7> [772.598049] hangcheck 	RING_CTL:   0x00000000
<7> [772.598056] hangcheck 	RING_MODE:  0x00000200 [idle]
<7> [772.598062] hangcheck 	RING_IMR: ffffffff
<7> [772.598071] hangcheck 	ACTHD:  0x00000000_00000000
<7> [772.598081] hangcheck 	BBADDR: 0x00000000_00000000
<7> [772.598091] hangcheck 	DMA_FADDR: 0x00000000_00000000
<7> [772.598096] hangcheck 	IPEIR: 0x00000000
<7> [772.598101] hangcheck 	IPEHR: 0x00000000
<7> [772.598109] hangcheck 	Execlist status: 0x00000001 00000000, entries 6
<7> [772.598113] hangcheck 	Execlist CSB read 2, write 2, tasklet queued? no (disabled)
<7> [772.598122] hangcheck 		Active[0: ring:{start:00802000, hwsp:ffffe200, seqno:00000001}, rq:  e46:82-  prio=3 @ 5396ms: gem_eio[4065]
<7> [772.598130] hangcheck 		E  e46:2*  prio=3 @ 5403ms: gem_eio[4065]
<7> [772.598134] hangcheck 		E  e46:4  prio=3 @ 5403ms: gem_eio[4065]
<7> [772.598139] hangcheck 		E  e46:6  prio=3 @ 5403ms: gem_eio[4065]
<7> [772.598143] hangcheck 		E  e46:8  prio=3 @ 5403ms: gem_eio[4065]
<7> [772.598147] hangcheck 		E  e46:a  prio=3 @ 5403ms: gem_eio[4065]
<7> [772.598151] hangcheck 		E  e46:c  prio=3 @ 5402ms: gem_eio[4065]
<7> [772.598155] hangcheck 		E  e46:e  prio=3 @ 5402ms: gem_eio[4065]
<7> [772.598159] hangcheck 		...skipping 57 executing requests...
<7> [772.598163] hangcheck 		E  e46:82-  prio=3 @ 5396ms: gem_eio[4065]
<7> [772.598166] hangcheck 		Queue priority hint: 3
<7> [772.598169] hangcheck HWSP:
<7> [772.598174] hangcheck [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [772.598177] hangcheck *
<7> [772.598182] hangcheck [0040] 00008002 00000002 00008002 00000002 00008002 00000002 00008002 00000002
<7> [772.598186] hangcheck [0060] 00008002 00000002 00008002 00000002 00000000 00000000 00000000 00000002
<7> [772.598190] hangcheck [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [772.598193] hangcheck *
<7> [772.598198] hangcheck Idle? no
<7> [772.598201] hangcheck Signals:
<7> [772.598205] hangcheck 	[e46:82] @ 5396ms
<7> [774.043789] [drm:i915_reset_request [i915]] client gem_eio[4065]: gained 1 ban score, now 1
<5> [774.048587] Setting dangerous option reset - tainting kernel
<7> [774.058957] [drm:i915_reset_device [i915]] resetting chip
<5> [774.076973] i915 0000:00:02.0: Resetting chip for Manually set wedged engine mask = ffffffffffffffff
<5> [774.098094] Setting dangerous option reset - tainting kernel
<7> [774.119214] [drm:i915_reset_device [i915]] resetting chip
<5> [774.128854] i915 0000:00:02.0: Resetting chip for Manually set wedged engine mask = ffffffffffffffff
<7> [774.129013] [drm:i915_reset [i915]] GPU reset disabled
<7> [774.129024] __i915_gem_set_wedged rcs0
<7> [774.129028] __i915_gem_set_wedged 	Awake? 3
<7> [774.129033] __i915_gem_set_wedged 	Hangcheck: 28 ms ago
<7> [774.129037] __i915_gem_set_wedged 	Reset count: 3 (global 8)
<7> [774.129041] __i915_gem_set_wedged 	Requests:
<7> [774.129058] __i915_gem_set_wedged 		active  e4b:2*  prio=3 @ 28ms: gem_eio[4065]
<7> [774.129062] __i915_gem_set_wedged 		ring->start:  0x00802000
<7> [774.129067] __i915_gem_set_wedged 		ring->head:   0x00000000
<7> [774.129071] __i915_gem_set_wedged 		ring->tail:   0x00002ca8
<7> [774.129075] __i915_gem_set_wedged 		ring->emit:   0x00002cb0
<7> [774.129079] __i915_gem_set_wedged 		ring->space:  0x00001310
<7> [774.129083] __i915_gem_set_wedged 		ring->hwsp:   0xffffe200
<7> [774.129088] __i915_gem_set_wedged [head 0000, postfix 0060, tail 00b0, batch 0x00000000_00040000]:
<7> [774.129111] __i915_gem_set_wedged [0000] 7a000004 00000000 00000000 00000000 00000000 00000000 7a000004 01144c1c
<7> [774.129118] __i915_gem_set_wedged [0020] fffff080 00000000 00000000 00000000 02800000 00000000 10400002 ffffe200
<7> [774.129123] __i915_gem_set_wedged [0040] 00000000 00000001 04000001 18800101 00040000 00000000 04000000 00000000
<7> [774.129129] __i915_gem_set_wedged [0060] 7a000004 01005021 ffffe200 00000000 00000002 00000000 7a000004 00100080
<7> [774.129135] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 01000000 04000001 0e40c002 00000000
<7> [774.129140] __i915_gem_set_wedged [00a0] 007ea0c8 00000000 02800000 00000000
<7> [774.129164] __i915_gem_set_wedged 	RING_START: 0x00802000
<7> [774.129171] __i915_gem_set_wedged 	RING_HEAD:  0x00000058
<7> [774.129180] __i915_gem_set_wedged 	RING_TAIL:  0x00002ca8
<7> [774.129189] __i915_gem_set_wedged 	RING_CTL:   0x00003001
<7> [774.129199] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [774.129205] __i915_gem_set_wedged 	RING_IMR: fffffefe
<7> [774.129217] __i915_gem_set_wedged 	ACTHD:  0x00000000_00040040
<7> [774.129228] __i915_gem_set_wedged 	BBADDR: 0x00000000_00040041
<7> [774.129240] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00802058
<7> [774.129246] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [774.129253] __i915_gem_set_wedged 	IPEHR: 0x18800101
<7> [774.129262] __i915_gem_set_wedged 	Execlist status: 0x00044052 00000002, entries 6
<7> [774.129267] __i915_gem_set_wedged 	Execlist CSB read 2, write 2, tasklet queued? no (disabled)
<7> [774.129276] __i915_gem_set_wedged 		Active[0: ring:{start:00802000, hwsp:ffffe200, seqno:00000001}, rq:  e4b:82-  prio=3 @ 21ms: gem_eio[4065]
<7> [774.129285] __i915_gem_set_wedged 		E  e4b:2*  prio=3 @ 28ms: gem_eio[4065]
<7> [774.129291] __i915_gem_set_wedged 		E  e4b:4  prio=3 @ 28ms: gem_eio[4065]
<7> [774.129297] __i915_gem_set_wedged 		E  e4b:6  prio=3 @ 28ms: gem_eio[4065]
<7> [774.129302] __i915_gem_set_wedged 		E  e4b:8  prio=3 @ 28ms: gem_eio[4065]
<7> [774.129307] __i915_gem_set_wedged 		E  e4b:a  prio=3 @ 27ms: gem_eio[4065]
<7> [774.129313] __i915_gem_set_wedged 		E  e4b:c  prio=3 @ 27ms: gem_eio[4065]
<7> [774.129318] __i915_gem_set_wedged 		E  e4b:e  prio=3 @ 27ms: gem_eio[4065]
<7> [774.129331] __i915_gem_set_wedged 		...skipping 57 executing requests...
<7> [774.129337] __i915_gem_set_wedged 		E  e4b:82-  prio=3 @ 21ms: gem_eio[4065]
<7> [774.129340] __i915_gem_set_wedged 		Queue priority hint: 3
<7> [774.129344] __i915_gem_set_wedged HWSP:
<7> [774.129350] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.129355] __i915_gem_set_wedged *
<7> [774.129360] __i915_gem_set_wedged [0040] 00008002 00000002 00008002 00000002 00008002 00000002 00008002 00000002
<7> [774.129366] __i915_gem_set_wedged [0060] 00008002 00000002 00008002 00000002 00000000 00000000 00000000 00000002
<7> [774.129371] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.129375] __i915_gem_set_wedged *
<7> [774.129381] __i915_gem_set_wedged Idle? no
<7> [774.129385] __i915_gem_set_wedged Signals:
<7> [774.129392] __i915_gem_set_wedged 	[e4b:82] @ 21ms
<7> [774.129396] __i915_gem_set_wedged bcs0
<7> [774.129400] __i915_gem_set_wedged 	Awake? 0
<7> [774.129404] __i915_gem_set_wedged 	Hangcheck: 50 ms ago
<7> [774.129408] __i915_gem_set_wedged 	Reset count: 0 (global 8)
<7> [774.129412] __i915_gem_set_wedged 	Requests:
<7> [774.129427] __i915_gem_set_wedged 	RING_START: 0x007ef000
<7> [774.129433] __i915_gem_set_wedged 	RING_HEAD:  0x000003f8
<7> [774.129440] __i915_gem_set_wedged 	RING_TAIL:  0x000003f8
<7> [774.129448] __i915_gem_set_wedged 	RING_CTL:   0x00000000
<7> [774.129457] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [774.129464] __i915_gem_set_wedged 	RING_IMR: feffffff
<7> [774.129475] __i915_gem_set_wedged 	ACTHD:  0x00000000_000003f8
<7> [774.129486] __i915_gem_set_wedged 	BBADDR: 0x00000000_00000000
<7> [774.129498] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_007ef3f8
<7> [774.129504] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [774.129510] __i915_gem_set_wedged 	IPEHR: 0x0e40c002
<7> [774.129520] __i915_gem_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [774.129524] __i915_gem_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [774.129531] __i915_gem_set_wedged HWSP:
<7> [774.129536] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.129540] __i915_gem_set_wedged *
<7> [774.129546] __i915_gem_set_wedged [0040] 00000001 00000000 00000818 00000003 00000001 00000000 00000818 00000000
<7> [774.129551] __i915_gem_set_wedged [0060] 00000001 00000000 00000818 00000001 00000000 00000000 00000000 00000003
<7> [774.129556] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.129560] __i915_gem_set_wedged *
<7> [774.129565] __i915_gem_set_wedged Idle? yes
<7> [774.129569] __i915_gem_set_wedged vcs0
<7> [774.129573] __i915_gem_set_wedged 	Awake? 0
<7> [774.129577] __i915_gem_set_wedged 	Hangcheck: 48 ms ago
<7> [774.129581] __i915_gem_set_wedged 	Reset count: 1 (global 8)
<7> [774.129585] __i915_gem_set_wedged 	Requests:
<7> [774.129598] __i915_gem_set_wedged 	RING_START: 0x007f0000
<7> [774.129605] __i915_gem_set_wedged 	RING_HEAD:  0x000001b8
<7> [774.129611] __i915_gem_set_wedged 	RING_TAIL:  0x000001b8
<7> [774.129620] __i915_gem_set_wedged 	RING_CTL:   0x00000000
<7> [774.129629] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [774.129636] __i915_gem_set_wedged 	RING_IMR: fffffeff
<7> [774.129647] __i915_gem_set_wedged 	ACTHD:  0x00000000_000001b8
<7> [774.129659] __i915_gem_set_wedged 	BBADDR: 0x00000000_00000000
<7> [774.129670] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [774.129676] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [774.129683] __i915_gem_set_wedged 	IPEHR: 0x00000000
<7> [774.129692] __i915_gem_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [774.129697] __i915_gem_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [774.129702] __i915_gem_set_wedged HWSP:
<7> [774.129708] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.129712] __i915_gem_set_wedged *
<7> [774.129717] __i915_gem_set_wedged [0040] 00000001 00000000 00000818 00000003 00000001 00000000 00000818 00000000
<7> [774.129722] __i915_gem_set_wedged [0060] 00000001 00000000 00000818 00000001 00000000 00000000 00000000 00000003
<7> [774.129728] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.129731] __i915_gem_set_wedged *
<7> [774.129736] __i915_gem_set_wedged Idle? yes
<7> [774.129740] __i915_gem_set_wedged vecs0
<7> [774.129744] __i915_gem_set_wedged 	Awake? 0
<7> [774.129748] __i915_gem_set_wedged 	Hangcheck: 48 ms ago
<7> [774.129752] __i915_gem_set_wedged 	Reset count: 0 (global 8)
<7> [774.129756] __i915_gem_set_wedged 	Requests:
<7> [774.131072] __i915_gem_set_wedged 	RING_START: 0x007f1000
<7> [774.131080] __i915_gem_set_wedged 	RING_HEAD:  0x000001b8
<7> [774.131086] __i915_gem_set_wedged 	RING_TAIL:  0x000001b8
<7> [774.131095] __i915_gem_set_wedged 	RING_CTL:   0x00000000
<7> [774.131104] __i915_gem_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [774.131110] __i915_gem_set_wedged 	RING_IMR: fffffeff
<7> [774.131122] __i915_gem_set_wedged 	ACTHD:  0x00000000_000001b8
<7> [774.131133] __i915_gem_set_wedged 	BBADDR: 0x00000000_00000000
<7> [774.131145] __i915_gem_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [774.131151] __i915_gem_set_wedged 	IPEIR: 0x00000000
<7> [774.131158] __i915_gem_set_wedged 	IPEHR: 0x00000000
<7> [774.131167] __i915_gem_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [774.131171] __i915_gem_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [774.131178] __i915_gem_set_wedged HWSP:
<7> [774.131183] __i915_gem_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.131187] __i915_gem_set_wedged *
<7> [774.131193] __i915_gem_set_wedged [0040] 00000001 00000000 00000818 00000003 00000001 00000000 00000818 00000000
<7> [774.131198] __i915_gem_set_wedged [0060] 00000001 00000000 00000818 00000001 00000000 00000000 00000000 00000003
<7> [774.131203] __i915_gem_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [774.131207] __i915_gem_set_wedged *
Comment 2 Chris Wilson 2019-07-04 20:22:04 UTC
Inside set-wedged, the only sleep is for synchronize_rcu(). That can be slow, but we already use the expedited version to try and avoid such issues. Still at the mercy of the scheduler, nothing else jumps out from the code as to why we might otherwise be stuck.
Comment 3 Francesco Balestrieri 2019-07-30 04:16:21 UTC
Given Chris' comment and the reproducibility of the issue, I'm setting the priority to Medium.
Comment 4 CI Bug Log 2019-08-05 06:21:25 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* SKL: igt@gem_eio@in-flight-contexts-immediate - dmesg-warn - intel_gt_reset_global timed out, cancelling all in-flight rendering.
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6607/shard-skl3/igt@gem_eio@in-flight-contexts-immediate.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6608/shard-skl6/igt@gem_eio@in-flight-contexts-immediate.html
Comment 5 Lakshmi 2019-08-05 06:24:04 UTC
(In reply to CI Bug Log from comment #4)
> The CI Bug Log issue associated to this bug has been updated.
> 
> ### New filters associated
> 
> * SKL: igt@gem_eio@in-flight-contexts-immediate - dmesg-warn -
> intel_gt_reset_global timed out, cancelling all in-flight rendering.
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6607/shard-skl3/
> igt@gem_eio@in-flight-contexts-immediate.html
>   -
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6608/shard-skl6/
> igt@gem_eio@in-flight-contexts-immediate.html

This is on SKL.

> [2955.440766] i915 0000:00:02.0: intel_gt_reset_global timed out, cancelling all in-flight rendering.
<7> [2961.215952] [drm:__i915_request_reset [i915]] client gem_eio[6017]: gained 1 ban score, now 1
<5> [2961.230930] Setting dangerous option reset - tainting kernel
<7> [2961.231267] [IGT] Forcing GPU reset
<7> [2961.233755] [drm:intel_gt_reset_global [i915]] resetting chip
<5> [2961.235617] i915 0000:00:02.0: Resetting chip for Manually set wedged engine mask = ffffffffffffffff
<7> [2961.238441] [IGT] Checking that the GPU recovered
<7> [2961.370782] [drm:intel_power_well_disable [i915]] disabling DC off
<7> [2961.372644] [drm:skl_enable_dc6 [i915]] Enabling DC6
<7> [2961.373042] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 02
<5> [2961.429027] Setting dangerous option reset - tainting kernel
<7> [2961.431420] [drm:intel_power_well_enable [i915]] enabling DC off
<7> [2961.432101] [drm:gen9_set_dc_state [i915]] Setting DC state from 02 to 00
<7> [2961.610669] [drm:intel_gt_reset_global [i915]] resetting chip
<5> [2961.612419] i915 0000:00:02.0: Resetting chip for Manually set wedged engine mask = ffffffffffffffff
<7> [2961.612922] [drm:intel_gt_reset [i915]] GPU reset disabled
<7> [2961.613143] __intel_gt_set_wedged rcs0
<7> [2961.613171] __intel_gt_set_wedged 	Awake? 0
<7> [2961.613195] __intel_gt_set_wedged 	Hangcheck: 371 ms ago
<7> [2961.613215] __intel_gt_set_wedged 	Reset count: 7 (global 55)
<7> [2961.613234] __intel_gt_set_wedged 	Requests:
<7> [2961.613302] __intel_gt_set_wedged 	RING_START: 0x00ea7000
<7> [2961.613323] __intel_gt_set_wedged 	RING_HEAD:  0x00000578
<7> [2961.613344] __intel_gt_set_wedged 	RING_TAIL:  0x00000578
<7> [2961.613369] __intel_gt_set_wedged 	RING_CTL:   0x00000000
<7> [2961.613393] __intel_gt_set_wedged 	RING_MODE:  0x00000300 [idle]
<7> [2961.613411] __intel_gt_set_wedged 	RING_IMR: fffffeff
<7> [2961.614049] __intel_gt_set_wedged 	ACTHD:  0x00000000_00000578
<7> [2961.614095] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [2961.614139] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [2961.614167] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [2961.614191] __intel_gt_set_wedged 	IPEHR: 0x00000000
<7> [2961.614227] __intel_gt_set_wedged 	Execlist status: 0x00000301 00000000, entries 6
<7> [2961.614254] __intel_gt_set_wedged 	Execlist CSB read 3, write 3, tasklet queued? no (disabled)
<7> [2961.614292] __intel_gt_set_wedged HWSP:
<7> [2961.614329] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [2961.614350] __intel_gt_set_wedged *
<7> [2961.614386] __intel_gt_set_wedged [0040] 00000001 00000000 00000818 00000043 00000001 00000000 00000818 00000000
<7> [2961.614420] __intel_gt_set_wedged [0060] 00000001 00000000 00008002 00000002 00000000 00000000 00000000 00000003
<7> [2961.614449] __intel_gt_set_wedged [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [2961.614472] __intel_gt_set_wedged *
Comment 6 CI Bug Log 2019-08-07 06:56:45 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL: igt@gem_eio@in-flight-contexts-immediate - dmesg-warn - intel_gt_reset_global timed out, cancelling all in-flight rendering. -}
{+ SKL: igt@gem_eio@in-flight-contexts-immediate|igt@gem_eio@in-flight-10ms - dmesg-warn/dmesg-fail - intel_gt_reset_global timed out, cancelling all in-flight rendering. +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6640/shard-skl6/igt@gem_eio@in-flight-10ms.html
Comment 7 CI Bug Log 2019-08-19 12:04:27 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL: igt@gem_eio@in-flight-contexts-immediate|igt@gem_eio@in-flight-10ms - dmesg-warn/dmesg-fail - intel_gt_reset_global timed out, cancelling all in-flight rendering. -}
{+ SKL GLK: igt@gem_eio@in-flight-* - dmesg-warn/dmesg-fail - intel_gt_reset_global timed out, cancelling all in-flight rendering. +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6723/shard-skl1/igt@gem_eio@in-flight-internal-1us.html
  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6733/shard-glk5/igt@gem_eio@in-flight-internal-1us.html
Comment 8 CI Bug Log 2019-09-23 12:52:39 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SKL GLK: igt@gem_eio@in-flight-* - dmesg-warn/dmesg-fail - intel_gt_reset_global timed out, cancelling all in-flight rendering. -}
{+ SKL GLK: igt@gem_eio@* - dmesg-warn/dmesg-fail - intel_gt_reset_global timed out, cancelling all in-flight rendering. +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6935/shard-skl7/igt@gem_eio@unwedge-stress.html
Comment 9 CI Bug Log 2019-10-18 07:36:38 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* SNB:  igt@gem_eio@kms- dmesg-warn - intel_gt_reset_global timed out, cancelling all in-flight rendering.
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7094/shard-snb2/igt@gem_eio@kms.html
Comment 10 CI Bug Log 2019-10-18 07:38:05 UTC
The CI Bug Log issue associated to this bug has been updated.

### Removed filters

* SNB:  igt@gem_eio@kms- dmesg-warn - intel_gt_reset_global timed out, cancelling all in-flight rendering. (added on 2 minutes ago)
Comment 11 Martin Peres 2019-11-29 19:15:48 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/324.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.