Bug 108681

Summary: [CI][BAT] igt@drv_selftest@live_contexts - dmesg-fail - rcs0: timed out on STOP_RING
Product: DRI Reporter: Martin Peres <martin.peres>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED DUPLICATE QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: highest CC: intel-gfx-bugs
Version: XOrg git   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: BSW/CHT i915 features: GEM/Other

Description Martin Peres 2018-11-06 12:28:09 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5092/fi-bsw-n3050/igt@drv_selftest@live_contexts.html

<6> [451.864966] [IGT] drv_selftest: executing
<6> [451.889347] [IGT] drv_selftest: starting subtest live_contexts
<5> [452.086761] Setting dangerous option live_selftests - tainting kernel
<7> [452.172326] [drm:i915_driver_load [i915]] No PCH found.
<7> [452.172467] [drm:i915_driver_load [i915]] WOPCM size: 1024KiB
<7> [452.172581] [drm:intel_uc_init_early [i915]] enable_guc=0 (submission:no huc:no)
<7> [452.172685] [drm:intel_uc_init_early [i915]] guc_log_level=0 (enabled:no, verbose:no, verbosity:0)
<7> [452.172781] [drm:intel_power_domains_init [i915]] Allowed DC state mask 00
<6> [452.175389] [drm] Display disabled (module parameter)
<7> [452.175712] [drm:i915_ggtt_probe_hw [i915]] GGTT size = 2048M
<7> [452.176594] [drm:i915_ggtt_probe_hw [i915]] GMADR size = 128M
<7> [452.176691] [drm:i915_ggtt_probe_hw [i915]] DSM size = 32M
<6> [452.176731] [drm] Replacing VGA console driver
<7> [452.178854] [drm:i915_gem_init_stolen [i915]] GEN6_STOLEN_RESERVED = 7ed00007
<7> [452.178958] [drm:i915_gem_init_stolen [i915]] Memory reserved for graphics device: 32768K, usable: 31744K
<7> [452.183772] [drm:intel_gvt_init [i915]] GVT-g is disabled by kernel params
<7> [452.184702] [drm:intel_opregion_setup [i915]] graphic opregion physical addr: 0x7b46f000
<7> [452.184906] [drm:intel_opregion_setup [i915]] Public ACPI methods supported
<7> [452.185010] [drm:intel_opregion_setup [i915]] ASLE supported
<7> [452.185114] [drm:intel_opregion_setup [i915]] Found valid VBT in ACPI OpRegion (Mailbox #4)
<7> [452.185218] [drm:intel_bios_init [i915]] Skipping VBT init due to disabled display.
<7> [452.189285] [drm:intel_dsm_detect [i915]] no _DSM method for intel device
<7> [452.189749] [drm:i915_driver_load [i915]] rawclk rate: 200000 kHz
<7> [452.191338] [drm:intel_power_domains_init_hw [i915]] Initial PHY_CONTROL=0x050007ff
<7> [452.191523] [drm:intel_power_well_enable [i915]] enabling always-on
<7> [452.191618] [drm:intel_power_well_enable [i915]] enabling display
<7> [452.191747] [drm:intel_power_well_enable [i915]] enabling dpio-common-bc
<7> [452.197780] [drm:chv_dpio_cmn_power_well_enable [i915]] Enabled DPIO PHY0 (PHY_CONTROL=0x050007ff)
<7> [452.197934] [drm:intel_power_well_enable [i915]] enabling dpio-common-d
<7> [452.201230] [drm:chv_dpio_cmn_power_well_enable [i915]] Enabled DPIO PHY1 (PHY_CONTROL=0x050007ff)
<7> [452.203589] [drm:intel_fbc_init [i915]] Sanitized enable_fbc value: 0
<7> [452.204803] [drm:intel_modeset_init [i915]] 0 display pipe available.
<7> [452.204930] [drm:intel_modeset_init [i915]] CZ clock rate: 320000 kHz
<7> [452.205082] [drm:intel_dump_cdclk_state [i915]] Current CDCLK 200000 kHz, VCO 1600000 kHz, ref 0 kHz, bypass 0 kHz, voltage level 15
<7> [452.208084] [drm:intel_update_max_cdclk [i915]] Max CD clock rate: 320000 kHz
<7> [452.208190] [drm:intel_modeset_init [i915]] Max dotclock rate: 304000 kHz
<7> [452.213811] [drm:vlv_wm_get_hw_state [i915]] Punit not acking DDR DVFS request, assuming DDR DVFS is disabled
<7> [452.213964] [drm:intel_modeset_setup_hw_state [i915]] Initial watermarks: SR plane=127, SR cursor=0 level=1 cxsr=0
<7> [452.221027] [drm:i915_gem_init_ggtt [i915]] clearing unused GTT space: [1000, 80000000]
<7> [452.221131] [drm:intel_ctx_workarounds_init [i915]] Number of context specific w/a: 8
<7> [452.223627] [drm:i915_gem_contexts_init [i915]] logical context support initialized
<7> [452.241517] [drm:intel_init_gt_powersave [i915]] PCBR: 0x7edf8001
<7> [452.241782] [drm:intel_init_gt_powersave [i915]] GPLL reference freq: 40000 kHz
<7> [452.243176] [drm:intel_init_gt_powersave [i915]] DDR speed: 1600 MHz
<7> [452.243286] [drm:intel_init_gt_powersave [i915]] max GPU freq: 600 MHz (60)
<7> [452.243824] [drm:intel_init_gt_powersave [i915]] RPe GPU freq: 320 MHz (32)
<7> [452.243932] [drm:intel_init_gt_powersave [i915]] RP1(Guar) GPU freq: 320 MHz (32)
<7> [452.245228] [drm:intel_init_gt_powersave [i915]] min GPU freq: 200 MHz (20)
<7> [452.248385] [drm:intel_enable_gt_powersave [i915]] GPLL enabled? yes
<7> [452.249248] [drm:intel_enable_gt_powersave [i915]] GPU status: 0x00001410
<6> [452.278822] [drm] Initialized i915 1.6.0 20181102 for 0000:00:02.0 on minor 0
<7> [452.284617] [drm:intel_power_well_disable [i915]] disabling dpio-common-d
<7> [452.284851] [drm:chv_dpio_cmn_power_well_disable [i915]] Disabled DPIO PHY1 (PHY_CONTROL=0x050007fd)
<7> [452.284999] [drm:intel_power_well_disable [i915]] disabling dpio-common-bc
<7> [452.286939] [drm:chv_dpio_cmn_power_well_disable [i915]] Disabled DPIO PHY0 (PHY_CONTROL=0x050007fc)
<7> [452.288179] [drm:intel_power_well_disable [i915]] disabling display
<7> [452.295583] [drm:intel_power_well_disable [i915]] disabling always-on
<7> [452.295676] i915 device info: pciid=0x22b1 rev=0x21 platform=CHERRYVIEW gen=8
<7> [452.295682] i915 device info: is_mobile: no
<7> [452.295687] i915 device info: is_lp: yes
<7> [452.295692] i915 device info: is_alpha_support: no
<7> [452.295697] i915 device info: has_64bit_reloc: yes
<7> [452.295701] i915 device info: has_csr: no
<7> [452.295706] i915 device info: has_ddi: no
<7> [452.295711] i915 device info: has_dp_mst: no
<7> [452.295716] i915 device info: has_reset_engine: yes
<7> [452.295720] i915 device info: has_fbc: no
<7> [452.295725] i915 device info: has_fpga_dbg: no
<7> [452.295730] i915 device info: has_gmch_display: yes
<7> [452.295735] i915 device info: has_guc: no
<7> [452.295739] i915 device info: has_guc_ct: no
<7> [452.295744] i915 device info: has_hotplug: yes
<7> [452.295749] i915 device info: has_l3_dpf: no
<7> [452.295754] i915 device info: has_llc: no
<7> [452.295758] i915 device info: has_logical_ring_contexts: yes
<7> [452.295763] i915 device info: has_logical_ring_elsq: no
<7> [452.295768] i915 device info: has_logical_ring_preemption: no
<7> [452.295773] i915 device info: has_overlay: no
<7> [452.295777] i915 device info: has_pooled_eu: no
<7> [452.295782] i915 device info: has_psr: no
<7> [452.295787] i915 device info: has_rc6: yes
<7> [452.295791] i915 device info: has_rc6p: no
<7> [452.295796] i915 device info: has_runtime_pm: yes
<7> [452.295801] i915 device info: has_snoop: yes
<7> [452.295806] i915 device info: has_coherent_ggtt: no
<7> [452.295810] i915 device info: unfenced_needs_alignment: no
<7> [452.295815] i915 device info: cursor_needs_physical: no
<7> [452.295820] i915 device info: hws_needs_physical: no
<7> [452.295825] i915 device info: overlay_needs_physical: no
<7> [452.295829] i915 device info: supports_tv: no
<7> [452.295834] i915 device info: has_ipc: no
<7> [452.295839] i915 device info: slice total: 1, mask=0001
<7> [452.295844] i915 device info: subslice total: 2
<7> [452.295850] i915 device info: slice0: 2 subslices, mask=0003
<7> [452.295855] i915 device info: EU total: 12
<7> [452.295860] i915 device info: EU per subslice: 6
<7> [452.295864] i915 device info: has slice power gating: no
<7> [452.295869] i915 device info: has subslice power gating: yes
<7> [452.295874] i915 device info: has EU power gating: yes
<7> [452.295879] i915 device info: CS timestamp frequency: 12500 kHz
<6> [452.295883] [drm] DRM_I915_DEBUG enabled
<6> [452.295887] [drm] DRM_I915_DEBUG_GEM enabled
<6> [452.295891] [drm] DRM_I915_DEBUG_RUNTIME_PM enabled
<6> [452.295900] i915: Performing live selftests with st_random_seed=0x9b9e3c5f st_timeout=1000
<7> [452.313027] [drm:intel_power_well_enable [i915]] enabling always-on
<6> [453.224105] Populated 1024 contexts on rcs0 in 804182462ns
<6> [454.299050] Switch latencies on rcs0: 1 = 1315238ns, 400 = 69239ns
<6> [454.818416] Populated 1024 contexts on bcs0 in 519350638ns
<6> [455.892471] Switch latencies on bcs0: 1 = 132787ns, 430 = 64389ns
<6> [456.456468] Populated 1024 contexts on vcs0 in 563981375ns
<6> [457.505103] Switch latencies on vcs0: 1 = 1262125ns, 420 = 65240ns
<6> [458.157070] Populated 1024 contexts on vecs0 in 651951512ns
<6> [459.259360] Switch latencies on vecs0: 1 = 1015462ns, 306 = 120165ns
<6> [460.325814] Submitted 13 contexts (across 4 engines), filling 52 dwords
<7> [464.949771] missed_breadcrumb rcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x51/0x70 [i915]
<7> [464.949819] missed_breadcrumb \x09current seqno 3c8d, last 3c8e, hangcheck 3c8d [3010 ms]
<7> [464.949826] missed_breadcrumb \x09Reset count: 0 (global 0)
<7> [464.949832] missed_breadcrumb \x09Requests:
<7> [464.949851] missed_breadcrumb \x09\x09first  3c8e [2b8d:a9] prio=2 @ 3891ms: drv_selftest[4023]/1
<7> [464.949858] missed_breadcrumb \x09\x09last   3c8e [2b8d:a9] prio=2 @ 3891ms: drv_selftest[4023]/1
<7> [464.949880] missed_breadcrumb \x09\x09active 3c8e [2b8d:a9] prio=2 @ 3891ms: drv_selftest[4023]/1
<7> [464.949887] missed_breadcrumb \x09\x09ring->start:  0x00065000
<7> [464.949892] missed_breadcrumb \x09\x09ring->head:   0x00000a10
<7> [464.949897] missed_breadcrumb \x09\x09ring->tail:   0x00000a78
<7> [464.949902] missed_breadcrumb \x09\x09ring->emit:   0x00000a80
<7> [464.949907] missed_breadcrumb \x09\x09ring->space:  0x00003380
<7> [464.949914] missed_breadcrumb [head 0a10, postfix 0a58, tail 0a80, batch 0xffffffff_ffffffff]:
<7> [464.949953] missed_breadcrumb [0000] 7a000004 01144c1c 7ffea080 00000000 00000000 00000000 04000001 18800101
<7> [464.949961] missed_breadcrumb [0020] e9bfe000 00000000 04000000 00000000 7a000004 001010a1 7ffea080 00000000
<7> [464.949968] missed_breadcrumb [0040] 00000000 00000000 7a000004 01104000 000020c0 00000000 00003c8e 00000000
<7> [464.949975] missed_breadcrumb [0060] 01000000 04000001 02800000 00000000
<7> [464.950706] missed_breadcrumb \x09RING_START: 0x00065000
<7> [464.950717] missed_breadcrumb \x09RING_HEAD:  0x00000a38
<7> [464.950725] missed_breadcrumb \x09RING_TAIL:  0x00000a78
<7> [464.950735] missed_breadcrumb \x09RING_CTL:   0x00003001
<7> [464.950746] missed_breadcrumb \x09RING_MODE:  0x00004000
<7> [464.950754] missed_breadcrumb \x09RING_IMR: fffffefe
<7> [464.950767] missed_breadcrumb \x09ACTHD:  0x00000000_e9c09de8
<7> [464.950781] missed_breadcrumb \x09BBADDR: 0x00000000_e9c09de9
<7> [464.950794] missed_breadcrumb \x09DMA_FADDR: 0x00000000_e9c09fc0
<7> [464.950802] missed_breadcrumb \x09IPEIR: 0x00000000
<7> [464.950810] missed_breadcrumb \x09IPEHR: 0x81cf8b52
<7> [464.950820] missed_breadcrumb \x09Execlist status: 0x00044032 00000001
<7> [464.950829] missed_breadcrumb \x09Execlist CSB read 4, write 4 [mmio:4], tasklet queued? no (enabled)
<7> [464.950838] missed_breadcrumb \x09\x09ELSP[0] count=1, ring->start=00065000, rq: 3c8e [2b8d:a9] prio=2 @ 3892ms: drv_selftest[4023]/1
<7> [464.950844] missed_breadcrumb \x09\x09ELSP[1] idle
<7> [464.950849] missed_breadcrumb \x09\x09HW active? 0x5
<7> [464.950911] missed_breadcrumb \x09\x09E 3c8e [2b8d:a9] prio=2 @ 3892ms: drv_selftest[4023]/1
<7> [464.950990] missed_breadcrumb \x09\x09Queue priority: -2147483648
<7> [464.951054] missed_breadcrumb \x09drv_selftest [4023] waiting for 3c8e
<7> [464.951139] missed_breadcrumb IRQ? 0x1 (breadcrumbs? yes)
<7> [464.951151] missed_breadcrumb HWSP:
<7> [464.951164] missed_breadcrumb [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [464.951175] missed_breadcrumb *
<7> [464.951188] missed_breadcrumb [0040] 00000001 00000000 00000018 00000001 00000001 00000000 00000018 00000001
<7> [464.951200] missed_breadcrumb [0060] 00000001 00000000 00000018 00000001 00000000 00000000 00000000 00000004
<7> [464.951231] missed_breadcrumb [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [464.951236] missed_breadcrumb *
<7> [464.951243] missed_breadcrumb [00c0] 00003c8d 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [464.951250] missed_breadcrumb [00e0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [464.951255] missed_breadcrumb *
<7> [464.951261] missed_breadcrumb Idle? no
<7> [467.956350] hangcheck rcs0
<7> [467.956390] hangcheck \x09current seqno 3c8d, last 3c8e, hangcheck 3c8d [6017 ms]
<7> [467.956396] hangcheck \x09Reset count: 0 (global 0)
<7> [467.956403] hangcheck \x09Requests:
<7> [467.956423] hangcheck \x09\x09first  3c8e [2b8d:a9] prio=2 @ 6898ms: drv_selftest[4023]/1
<7> [467.956431] hangcheck \x09\x09last   3c8e [2b8d:a9] prio=2 @ 6898ms: drv_selftest[4023]/1
<7> [467.956452] hangcheck \x09\x09active 3c8e [2b8d:a9] prio=2 @ 6898ms: drv_selftest[4023]/1
<7> [467.956458] hangcheck \x09\x09ring->start:  0x00065000
<7> [467.956464] hangcheck \x09\x09ring->head:   0x00000a10
<7> [467.956470] hangcheck \x09\x09ring->tail:   0x00000a78
<7> [467.956475] hangcheck \x09\x09ring->emit:   0x00000a80
<7> [467.956481] hangcheck \x09\x09ring->space:  0x00003380
<7> [467.956487] hangcheck [head 0a10, postfix 0a58, tail 0a80, batch 0xffffffff_ffffffff]:
<7> [467.956523] hangcheck [0000] 7a000004 01144c1c 7ffea080 00000000 00000000 00000000 04000001 18800101
<7> [467.956531] hangcheck [0020] e9bfe000 00000000 04000000 00000000 7a000004 001010a1 7ffea080 00000000
<7> [467.956539] hangcheck [0040] 00000000 00000000 7a000004 01104000 000020c0 00000000 00003c8e 00000000
<7> [467.956546] hangcheck [0060] 01000000 04000001 02800000 00000000
<7> [467.956572] hangcheck \x09RING_START: 0x00065000
<7> [467.956581] hangcheck \x09RING_HEAD:  0x00000a38
<7> [467.956590] hangcheck \x09RING_TAIL:  0x00000a78
<7> [467.956601] hangcheck \x09RING_CTL:   0x00003001
<7> [467.956612] hangcheck \x09RING_MODE:  0x00004000
<7> [467.956620] hangcheck \x09RING_IMR: fffffefe
<7> [467.956634] hangcheck \x09ACTHD:  0x00000000_e9c09de8
<7> [467.956648] hangcheck \x09BBADDR: 0x00000000_e9c09de9
<7> [467.956661] hangcheck \x09DMA_FADDR: 0x00000000_e9c09fc0
<7> [467.956670] hangcheck \x09IPEIR: 0x00000000
<7> [467.956678] hangcheck \x09IPEHR: 0x81cf8b52
<7> [467.956689] hangcheck \x09Execlist status: 0x00044032 00000001
<7> [467.956698] hangcheck \x09Execlist CSB read 4, write 4 [mmio:4], tasklet queued? no (enabled)
<7> [467.956707] hangcheck \x09\x09ELSP[0] count=1, ring->start=00065000, rq: 3c8e [2b8d:a9] prio=2 @ 6898ms: drv_selftest[4023]/1
<7> [467.956712] hangcheck \x09\x09ELSP[1] idle
<7> [467.956718] hangcheck \x09\x09HW active? 0x5
<7> [467.956795] hangcheck \x09\x09E 3c8e [2b8d:a9] prio=2 @ 6898ms: drv_selftest[4023]/1
<7> [467.956864] hangcheck \x09\x09Queue priority: -2147483648
<7> [467.956909] hangcheck \x09drv_selftest [4023] waiting for 3c8e
<7> [467.956979] hangcheck IRQ? 0x1 (breadcrumbs? yes)
<7> [467.956984] hangcheck HWSP:
<7> [467.956992] hangcheck [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [467.956997] hangcheck *
<7> [467.957004] hangcheck [0040] 00000001 00000000 00000018 00000001 00000001 00000000 00000018 00000001
<7> [467.957011] hangcheck [0060] 00000001 00000000 00000018 00000001 00000000 00000000 00000000 00000004
<7> [467.957019] hangcheck [0080] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [467.957024] hangcheck *
<7> [467.957031] hangcheck [00c0] 00003c8d 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [467.957038] hangcheck [00e0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7> [467.957043] hangcheck *
<7> [467.957049] hangcheck Idle? no
<6> [468.002149] [drm] GPU HANG: ecode 8:0:0x7e1074ad, in drv_selftest [4023], reason: hang on rcs0, action: reset
<6> [468.002520] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
<6> [468.002528] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
<6> [468.002533] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
<6> [468.002537] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
<6> [468.002543] [drm] GPU crash dump saved to /sys/class/drm/card0/error
<5> [468.003662] i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
<7> [468.004805] [drm:intel_gpu_reset [i915]] rcs0: timed out on STOP_RING
<7> [468.004918] [drm:i915_gem_reset_engine [i915]] client drv_selftest[4023]/1: gained 1 ban score, now 1
<6> [468.007317] Submitted 676 dwords (across 4 engines)
<3> [468.084063] igt_ctx_readonly(): engine 'rcs0' was reset 1 times!
<3> [468.085640] i915/i915_gem_context_live_selftests: igt_ctx_readonly failed with error -5
<7> [468.085786] [drm:intel_power_well_enable [i915]] enabling display
<7> [468.086758] [drm:i915_redisable_vga_power_on [i915]] Something enabled VGA plane, disabling it
<7> [468.088300] [drm:intel_power_well_enable [i915]] enabling dpio-common-bc
<7> [468.094847] [drm:chv_dpio_cmn_power_well_enable [i915]] Enabled DPIO PHY0 (PHY_CONTROL=0x050007fd)
<7> [468.098728] [drm:intel_power_well_enable [i915]] enabling dpio-common-d
<7> [468.099057] [drm:chv_dpio_cmn_power_well_enable [i915]] Enabled DPIO PHY1 (PHY_CONTROL=0x050007ff)
<4> [470.038258] i915: probe of 0000:00:02.0 failed with error -5
<6> [470.202331] [IGT] drv_selftest: exiting, ret=99
Comment 1 Chris Wilson 2018-11-06 12:33:13 UTC

*** This bug has been marked as a duplicate of bug 108656 ***

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.