Bug 111144

Summary: [CI][SHARDS] igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine bcs0 failed, err=-5
Product: DRI Reporter: Martin Peres <martin.peres>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED MOVED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: XOrg git   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: ICL i915 features: GEM/Other

Description Martin Peres 2019-07-16 09:05:39 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6487/fi-icl-u4/igt@i915_selftest@live_hangcheck.html

<7> [756.897310] __intel_gt_set_wedged rcs0
<7> [756.897344] __intel_gt_set_wedged 	Awake? 1
<7> [756.897347] __intel_gt_set_wedged 	Hangcheck: 5765 ms ago
<7> [756.897350] __intel_gt_set_wedged 	Reset count: 4032 (global 43)
<7> [756.897356] __intel_gt_set_wedged 	Requests:
<7> [756.898056] __intel_gt_set_wedged 	RING_START: 0x0017b000
<7> [756.898064] __intel_gt_set_wedged 	RING_HEAD:  0x000004e0
<7> [756.898084] __intel_gt_set_wedged 	RING_TAIL:  0x000004e0
<7> [756.898101] __intel_gt_set_wedged 	RING_CTL:   0x00000000
<7> [756.898892] __intel_gt_set_wedged 	RING_MODE:  0x00000200 [idle]
<7> [756.898897] __intel_gt_set_wedged 	RING_IMR: 00000000
<7> [756.898908] __intel_gt_set_wedged 	ACTHD:  0x00000000_000004e0
<7> [756.898919] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [756.898944] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [756.900178] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [756.900878] __intel_gt_set_wedged 	IPEHR: 0x00000000
<7> [756.902366] __intel_gt_set_wedged 	Execlist status: 0x00018001 00000000, entries 12
<7> [756.902383] __intel_gt_set_wedged 	Execlist CSB read 1, write 1, tasklet queued? no (enabled)
<7> [756.902405] __intel_gt_set_wedged 		E  29003:16!  prio=-4093 @ 4769ms: [i915]
<7> [756.902443] __intel_gt_set_wedged HWSP:
<7> [756.902448] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.902451] __intel_gt_set_wedged *
<7> [756.902456] __intel_gt_set_wedged [0040] 10000001 00000000 10000018 00000000 10000001 00000000 10000018 00000000
<7> [756.902461] __intel_gt_set_wedged [0060] 10008002 00000020 10000018 00000020 10000001 00000000 10008002 00000020
<7> [756.902465] __intel_gt_set_wedged [0080] 10008002 00000020 10000001 00000000 10008002 00000020 10008002 00000020
<7> [756.902470] __intel_gt_set_wedged [00a0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000001
<7> [756.902474] __intel_gt_set_wedged [00c0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.902477] __intel_gt_set_wedged *
<7> [756.904053] __intel_gt_set_wedged Idle? yes
<7> [756.904067] __intel_gt_set_wedged bcs0
<7> [756.904070] __intel_gt_set_wedged 	Awake? 16
<7> [756.904073] __intel_gt_set_wedged 	Hangcheck: 5766 ms ago
<7> [756.904076] __intel_gt_set_wedged 	Reset count: 5780 (global 43)
<7> [756.904095] __intel_gt_set_wedged 	Requests:
<7> [756.904896] __intel_gt_set_wedged 	RING_START: 0x001f1000
<7> [756.904902] __intel_gt_set_wedged 	RING_HEAD:  0x00001f24
<7> [756.904908] __intel_gt_set_wedged 	RING_TAIL:  0x00001f40
<7> [756.904920] __intel_gt_set_wedged 	RING_CTL:   0x00003001
<7> [756.904954] __intel_gt_set_wedged 	RING_MODE:  0x00000000
<7> [756.906419] __intel_gt_set_wedged 	RING_IMR: 00000000
<7> [756.906855] __intel_gt_set_wedged 	ACTHD:  0x00000000_00001f24
<7> [756.906881] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [756.907711] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_001f2f40
<7> [756.907721] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [756.907737] __intel_gt_set_wedged 	IPEHR: 0x7a000004
<7> [756.908699] __intel_gt_set_wedged 	Execlist status: 0x40202098 60000200, entries 12
<7> [756.908703] __intel_gt_set_wedged 	Execlist CSB read 5, write 5, tasklet queued? no (enabled)
<7> [756.908746] __intel_gt_set_wedged 		Active[0: ring:{start:001f1000, hwsp:fedf9500, seqno:000000f8}, rq:  2903a:fa-  prio=2 @ 5465ms: igt/bcs0[5322]
<7> [756.908752] __intel_gt_set_wedged 		Active[1: ring:{start:001f5000, hwsp:fedf9540, seqno:000000f8}, rq:  2903b:fa  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908757] __intel_gt_set_wedged 		Pending[0] ring:{start:001f5000, hwsp:fedf9540, seqno:000000f8}, rq:  2903b:fa  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908763] __intel_gt_set_wedged 		Pending[1] ring:{start:001f9000, hwsp:fedf9580, seqno:000000f8}, rq:  2903c:fa  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908794] __intel_gt_set_wedged 		E  2903b:fa  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908798] __intel_gt_set_wedged 		E  2903c:fa  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908801] __intel_gt_set_wedged 		Queue priority hint: 3
<7> [756.908805] __intel_gt_set_wedged 		Q  29035:fc  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908810] __intel_gt_set_wedged 		Q  29036:fc  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908814] __intel_gt_set_wedged 		Q  29037:fc  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908819] __intel_gt_set_wedged 		Q  29038:fc  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908823] __intel_gt_set_wedged 		Q  29039:fc  prio=3 @ 5464ms: igt/bcs0[5322]
<7> [756.908827] __intel_gt_set_wedged 		Q  2903a:fa-  prio=2 @ 5465ms: igt/bcs0[5322]
<7> [756.908831] __intel_gt_set_wedged 		Q  2903a:fc  prio=2 @ 5464ms: igt/bcs0[5322]
<7> [756.908878] __intel_gt_set_wedged HWSP:
<7> [756.908883] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.908926] __intel_gt_set_wedged *
<7> [756.908931] __intel_gt_set_wedged [0040] 00000014 600001a0 00000018 600001c0 00000001 60000000 00000014 600001e0
<7> [756.908951] __intel_gt_set_wedged [0060] 00008002 60000200 00008002 60000200 00000014 60000220 00000018 60000240
<7> [756.908972] __intel_gt_set_wedged [0080] 00000001 60000000 00000014 60000160 00000018 60000180 00000001 60000000
<7> [756.908991] __intel_gt_set_wedged [00a0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000005
<7> [756.908996] __intel_gt_set_wedged [00c0] 00000000 00000000 00000001 00000000 00000000 00000000 00000000 00000000
<7> [756.909029] __intel_gt_set_wedged [00e0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.909033] __intel_gt_set_wedged *
<7> [756.909069] __intel_gt_set_wedged Idle? no
<7> [756.909073] __intel_gt_set_wedged vcs0
<7> [756.909075] __intel_gt_set_wedged 	Awake? 16
<7> [756.909094] __intel_gt_set_wedged 	Hangcheck: 5775 ms ago
<7> [756.909097] __intel_gt_set_wedged 	Reset count: 8087 (global 43)
<7> [756.909156] __intel_gt_set_wedged 	Requests:
<7> [756.909614] __intel_gt_set_wedged 	RING_START: 0x001c1000
<7> [756.909620] __intel_gt_set_wedged 	RING_HEAD:  0x00003db8
<7> [756.909625] __intel_gt_set_wedged 	RING_TAIL:  0x00003db8
<7> [756.909634] __intel_gt_set_wedged 	RING_CTL:   0x00003000
<7> [756.909664] __intel_gt_set_wedged 	RING_MODE:  0x00000000
<7> [756.910810] __intel_gt_set_wedged 	RING_IMR: 00000000
<7> [756.912183] __intel_gt_set_wedged 	ACTHD:  0x00000000_07c03e78
<7> [756.913066] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [756.914768] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_001bcf78
<7> [756.914790] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [756.914814] __intel_gt_set_wedged 	IPEHR: 0x0e40c002
<7> [756.915619] __intel_gt_set_wedged 	Execlist status: 0x00018001 200000a0, entries 12
<7> [756.915623] __intel_gt_set_wedged 	Execlist CSB read 9, write 9, tasklet queued? no (enabled)
<7> [756.915629] __intel_gt_set_wedged 		E  29030:7dfa!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915633] __intel_gt_set_wedged 		E  29031:7dfa!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915639] __intel_gt_set_wedged 		E  29032:7dfa!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915644] __intel_gt_set_wedged 		E  29033:7dfa!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915648] __intel_gt_set_wedged 		E  29034:7dfa!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915652] __intel_gt_set_wedged 		E  2902c:7dfc!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915656] __intel_gt_set_wedged 		E  2902e:7dfc!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915660] __intel_gt_set_wedged 		E  2902f:7dfc!  prio=3 @ 1ms: igt/vcs0[5323]
<7> [756.915681] __intel_gt_set_wedged HWSP:
<7> [756.915686] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.915704] __intel_gt_set_wedged *
<7> [756.915709] __intel_gt_set_wedged [0040] 00000001 20000000 00000018 20000120 00000001 20000000 00000018 20000140
<7> [756.915729] __intel_gt_set_wedged [0060] 00000001 20000000 00000018 20000040 00000001 20000000 00000018 20000080
<7> [756.915733] __intel_gt_set_wedged [0080] 00000001 20000000 00000018 200000a0 00000001 20000000 00000018 20000100
<7> [756.915738] __intel_gt_set_wedged [00a0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000009
<7> [756.915742] __intel_gt_set_wedged [00c0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.915781] __intel_gt_set_wedged *
<7> [756.918237] __intel_gt_set_wedged Idle? no
<7> [756.918313] __intel_gt_set_wedged vcs2
<7> [756.918316] __intel_gt_set_wedged 	Awake? 16
<7> [756.918319] __intel_gt_set_wedged 	Hangcheck: 5785 ms ago
<7> [756.918322] __intel_gt_set_wedged 	Reset count: 8087 (global 43)
<7> [756.918325] __intel_gt_set_wedged 	Requests:
<7> [756.919082] __intel_gt_set_wedged 	RING_START: 0x001b5000
<7> [756.919099] __intel_gt_set_wedged 	RING_HEAD:  0x00002cf8
<7> [756.919931] __intel_gt_set_wedged 	RING_TAIL:  0x00002cf8
<7> [756.920909] __intel_gt_set_wedged 	RING_CTL:   0x00003000
<7> [756.922458] __intel_gt_set_wedged 	RING_MODE:  0x00000000
<7> [756.923328] __intel_gt_set_wedged 	RING_IMR: 00000000
<7> [756.923345] __intel_gt_set_wedged 	ACTHD:  0x00000000_07a02d78
<7> [756.923370] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [756.924184] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_0020fdb8
<7> [756.924189] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [756.924195] __intel_gt_set_wedged 	IPEHR: 0x0e40c002
<7> [756.924208] __intel_gt_set_wedged 	Execlist status: 0x00001098 20020300, entries 12
<7> [756.924226] __intel_gt_set_wedged 	Execlist CSB read 8, write 8, tasklet queued? no (enabled)
<7> [756.924232] __intel_gt_set_wedged 		E  2902b:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924235] __intel_gt_set_wedged 		E  2902d:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924239] __intel_gt_set_wedged 		E  2903d:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924243] __intel_gt_set_wedged 		E  2903e:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924247] __intel_gt_set_wedged 		E  2903f:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924251] __intel_gt_set_wedged 		E  29040:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924254] __intel_gt_set_wedged 		E  29042:7b6e!  prio=3 @ 1ms: igt/vcs2[5324]
<7> [756.924258] __intel_gt_set_wedged 		E  29044:7b6e!  prio=3 @ 0ms: igt/vcs2[5324]
<7> [756.924277] __intel_gt_set_wedged HWSP:
<7> [756.924280] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.924281] __intel_gt_set_wedged *
<7> [756.924294] __intel_gt_set_wedged [0040] 00000018 20020280 00000001 20020000 00000018 200202a0 00000001 20020000
<7> [756.924297] __intel_gt_set_wedged [0060] 00000018 200202c0 00000001 20020000 00000018 20020300 00000001 20020000
<7> [756.924299] __intel_gt_set_wedged [0080] 00000018 20020340 00000001 20020000 00000018 20020020 00000001 20020000
<7> [756.924301] __intel_gt_set_wedged [00a0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 0000000a
<7> [756.924304] __intel_gt_set_wedged [00c0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.924305] __intel_gt_set_wedged *
<7> [756.925076] __intel_gt_set_wedged Idle? no
<7> [756.925079] __intel_gt_set_wedged vecs0
<7> [756.925082] __intel_gt_set_wedged 	Awake? 16
<7> [756.925084] __intel_gt_set_wedged 	Hangcheck: 5771 ms ago
<7> [756.925087] __intel_gt_set_wedged 	Reset count: 5770 (global 43)
<7> [756.925090] __intel_gt_set_wedged 	Requests:
<7> [756.925913] __intel_gt_set_wedged 	RING_START: 0x0021d000
<7> [756.925919] __intel_gt_set_wedged 	RING_HEAD:  0x00003b38
<7> [756.926911] __intel_gt_set_wedged 	RING_TAIL:  0x00003bf8
<7> [756.928983] __intel_gt_set_wedged 	RING_CTL:   0x00003001
<7> [756.930276] __intel_gt_set_wedged 	RING_MODE:  0x00000000 [idle]
<7> [756.930432] __intel_gt_set_wedged 	RING_IMR: 00000000
<7> [756.931130] __intel_gt_set_wedged 	ACTHD:  0x00000000_07c03df8
<7> [756.931156] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [756.932032] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_0020cdf8
<7> [756.933125] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [756.933991] __intel_gt_set_wedged 	IPEHR: 0x0e40c002
<7> [756.934539] __intel_gt_set_wedged 	Execlist status: 0x00018001 40000320, entries 12
<7> [756.934558] __intel_gt_set_wedged 	Execlist CSB read 2, write 2, tasklet queued? no (enabled)
<7> [756.934564] __intel_gt_set_wedged 		E  29045:7dfc!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934568] __intel_gt_set_wedged 		E  29046:7dfc!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934572] __intel_gt_set_wedged 		E  29047:7dfc!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934578] __intel_gt_set_wedged 		E  29048:7dfc!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934582] __intel_gt_set_wedged 		E  29049:7dfc!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934586] __intel_gt_set_wedged 		E  2904a:7dfc!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934590] __intel_gt_set_wedged 		E  29041:7dfe!  prio=3 @ 1ms: igt/vecs0[5325]
<7> [756.934594] __intel_gt_set_wedged 		E  29043:7dfe!  prio=3 @ 0ms: igt/vecs0[5325]
<7> [756.934607] __intel_gt_set_wedged HWSP:
<7> [756.934627] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.934630] __intel_gt_set_wedged *
<7> [756.934668] __intel_gt_set_wedged [0040] 00000018 400002e0 00000001 40000000 00000018 40000320 00000001 40000000
<7> [756.934673] __intel_gt_set_wedged [0060] 00000018 40000360 00000014 400003a0 00000018 400003c0 00000001 40000000
<7> [756.934693] __intel_gt_set_wedged [0080] 00000018 400003e0 00000001 40000000 00000018 40000400 00000001 40000000
<7> [756.934697] __intel_gt_set_wedged [00a0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000006
<7> [756.934702] __intel_gt_set_wedged [00c0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [756.934739] __intel_gt_set_wedged *
<7> [756.935415] __intel_gt_set_wedged Idle? no
<3> [756.937963] kthread for other engine bcs0 failed, err=-5
Comment 1 CI Bug Log 2019-07-16 09:06:30 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine bcs0 failed, err=-5
  - https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_13557/shard-iclb1/igt@i915_selftest@live_hangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6487/fi-icl-u4/igt@i915_selftest@live_hangcheck.html
  - https://intel-gfx-ci.01.org/tree/drm-tip/Trybot_4615/shard-iclb4/igt@i915_selftest@live_hangcheck.html
Comment 2 CI Bug Log 2019-07-16 12:14:34 UTC
A CI Bug Log filter associated to this bug has been updated:

{- ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine bcs0 failed, err=-5 -}
{+ ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0) failed, err=-5 +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6491/fi-icl-u3/igt@i915_selftest@live_hangcheck.html
Comment 3 Chris Wilson 2019-07-20 11:46:12 UTC
(In reply to CI Bug Log from comment #2)
> A CI Bug Log filter associated to this bug has been updated:
> 
> {- ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other
> engine bcs0 failed, err=-5 -}
> {+ ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other
> engine (vecs|bcs0) failed, err=-5 +}
> 
> New failures caught by the filter:
> 
>   *
> https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6491/fi-icl-u3/
> igt@i915_selftest@live_hangcheck.html

This looks like a different bug:

<7> [633.865426] __intel_gt_set_wedged vecs0
<7> [633.865427] __intel_gt_set_wedged 	Awake? 17
<7> [633.865429] __intel_gt_set_wedged 	Hangcheck: 5585 ms ago
<7> [633.865430] __intel_gt_set_wedged 	Reset count: 4715 (global 23)
<7> [633.865432] __intel_gt_set_wedged 	Requests:
<7> [633.867108] __intel_gt_set_wedged 	RING_START: 0x00000000
<7> [633.867942] __intel_gt_set_wedged 	RING_HEAD:  0x00000000
<7> [633.868793] __intel_gt_set_wedged 	RING_TAIL:  0x00000000
<7> [633.868830] __intel_gt_set_wedged 	RING_CTL:   0x00000000
<7> [633.869601] __intel_gt_set_wedged 	RING_MODE:  0x00000200 [idle]
<7> [633.869612] __intel_gt_set_wedged 	RING_IMR: 00000000
<7> [633.869736] __intel_gt_set_wedged 	ACTHD:  0x00000000_00000000
<7> [633.871397] __intel_gt_set_wedged 	BBADDR: 0x00000000_00000000
<7> [633.873097] __intel_gt_set_wedged 	DMA_FADDR: 0x00000000_00000000
<7> [633.873104] __intel_gt_set_wedged 	IPEIR: 0x00000000
<7> [633.873108] __intel_gt_set_wedged 	IPEHR: 0x00000000
<7> [633.874792] __intel_gt_set_wedged 	Execlist status: 0x00000001 00000000, entries 12
<7> [633.874793] __intel_gt_set_wedged 	Execlist CSB read 9, write 9, tasklet queued? no (enabled)
<7> [633.874810] __intel_gt_set_wedged 		Active[0: ring:{start:001b2000, hwsp:fedf9240, seqno:0000006b}, rq:  1ba1f:6c*  prio=1027 @ 5532ms: [i915]
<7> [633.874825] __intel_gt_set_wedged 		Active[1: ring:{start:002d5000, hwsp:fedf9c40, seqno:00000016}, rq:  1ba47:18-  prio=79 @ 5531ms: igt/vecs0[5103]
<7> [633.874829] __intel_gt_set_wedged 		E  1ba1f:6c*  prio=1027 @ 5532ms: [i915]
<7> [633.874831] __intel_gt_set_wedged 		E  1ba47:18-  prio=79 @ 5531ms: igt/vecs0[5103]
<7> [633.874833] __intel_gt_set_wedged 		Queue priority hint: 431
<7> [633.874836] __intel_gt_set_wedged 		Q  1ba42:1a  prio=431 @ 5530ms: igt/vecs0[5103]
<7> [633.874839] __intel_gt_set_wedged 		Q  1ba3c:1a  prio=367 @ 5530ms: igt/vecs0[5103]
<7> [633.874842] __intel_gt_set_wedged 		Q  1ba45:1a  prio=251 @ 5529ms: igt/vecs0[5103]
<7> [633.874844] __intel_gt_set_wedged 		Q  1ba39:1a  prio=235 @ 5530ms: igt/vecs0[5103]
<7> [633.874847] __intel_gt_set_wedged 		Q  1ba46:1a  prio=159 @ 5529ms: igt/vecs0[5103]
<7> [633.874849] __intel_gt_set_wedged 		Q  1ba3f:1a  prio=143 @ 5530ms: igt/vecs0[5103]
<7> [633.874852] __intel_gt_set_wedged 		Q  1ba35:1a  prio=107 @ 5530ms: igt/vecs0[5103]
<7> [633.874854] __intel_gt_set_wedged 		Q  1ba47:1a  prio=39 @ 5529ms: igt/vecs0[5103]
<7> [633.874856] __intel_gt_set_wedged HWSP:
<7> [633.874858] __intel_gt_set_wedged [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [633.874860] __intel_gt_set_wedged *
<7> [633.874862] __intel_gt_set_wedged [0040] 00000001 40000000 00000014 40000020 00000018 400003c0 00000001 40000000
<7> [633.874865] __intel_gt_set_wedged [0060] 00000014 40000480 00000018 400004e0 00000001 40000000 00000018 40000500
<7> [633.874867] __intel_gt_set_wedged [0080] 00000001 40000000 00008002 40000020 00000018 400004e0 00000001 40000000
<7> [633.874869] __intel_gt_set_wedged [00a0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000009
<7> [633.874871] __intel_gt_set_wedged [00c0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [633.874873] __intel_gt_set_wedged *
<7> [633.874877] __intel_gt_set_wedged Idle? no
<7> [633.874879] __intel_gt_set_wedged Signals:
<7> [633.874891] __intel_gt_set_wedged 	[1ba47:18] @ 5531ms

All the registers are zero, same symptoms as a powergating bug from gen9.
Comment 4 Francesco Balestrieri 2019-08-06 05:00:07 UTC
Last occurrence from 4 days ago:

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6590/shard-iclb3/igt@i915_selftest@live_hangcheck.html
Comment 5 CI Bug Log 2019-08-08 08:49:23 UTC
A CI Bug Log filter associated to this bug has been updated:

{- ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0) failed, err=-5 -}
{+ ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0|vcs2) failed, err=-5 +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6645/shard-iclb2/igt@i915_selftest@live_hangcheck.html
Comment 6 CI Bug Log 2019-08-08 08:53:24 UTC
The CI Bug Log issue associated to this bug has been updated.

### Removed filters

* ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0|vcs2) failed, err=-5 (added on 4 minutes ago)

### New filters associated

* ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0) failed, err=-5
  (No new failures associated)
Comment 7 CI Bug Log 2019-08-08 08:55:29 UTC
A CI Bug Log filter associated to this bug has been updated:

{- ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0) failed, err=-5 -}
{+ ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0|vcs0) failed, err=-5 +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6653/fi-icl-dsi/igt@i915_selftest@live_hangcheck.html
Comment 8 CI Bug Log 2019-08-14 08:25:46 UTC
A CI Bug Log filter associated to this bug has been updated:

{- ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0|vcs0) failed, err=-5 -}
{+ ICL: igt@i915_selftest@live_hangcheck - dmesg-fail - kthread for other engine (vecs|bcs0|vcs0|vcs1) failed, err=-5 +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/IGT_5131/fi-icl-guc/igt@i915_selftest@live_hangcheck.html
Comment 9 Francesco Balestrieri 2019-11-11 10:08:22 UTC
Last seen two weeks ago, but was occurring about once a week before that. Lowering to medium for now.
Comment 10 Martin Peres 2019-11-29 19:16:57 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/333.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.