Bug 107499 - [CI][BAT] igt@drv_selftest@live_hugepages - dmesg-fail - igt_ppgtt_internal_huge failed with error -22
Summary: [CI][BAT] igt@drv_selftest@live_hugepages - dmesg-fail - igt_ppgtt_internal_h...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-08-06 14:20 UTC by Martin Peres
Modified: 2018-08-27 13:08 UTC (History)
1 user (show)

See Also:
i915 platform: BYT
i915 features: GEM/PPGTT


Attachments

Description Martin Peres 2018-08-06 14:20:01 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4584/fi-byt-squawks/igt@drv_selftest@live_hugepages.html

(drv_selftest:8275) igt_kmod-WARNING: Performing live selftests with st_random_seed=0xa0825c46 st_timeout=1000
(drv_selftest:8275) igt_kmod-WARNING: 0 (global 0)
(drv_selftest:8275) igt_kmod-WARNING: ] @ 1788ms: bcs0
(drv_selftest:8275) igt_kmod-WARNING: ] @ 1788ms: bcs0
(drv_selftest:8275) igt_kmod-WARNING: ] @ 1790ms: bcs0
(drv_selftest:8275) igt_kmod-WARNING:  0x00022000
(drv_selftest:8275) igt_kmod-WARNING:   0x000000e0
(drv_selftest:8275) igt_kmod-WARNING:   0x00000158
(drv_selftest:8275) igt_kmod-WARNING:   0x00000158
(drv_selftest:8275) igt_kmod-WARNING:  0x0001fe68
(drv_selftest:8275) igt_kmod-WARNING: 0x00022000
(drv_selftest:8275) igt_kmod-WARNING:  0x00000120
(drv_selftest:8275) igt_kmod-WARNING:  0x00000158
(drv_selftest:8275) igt_kmod-WARNING:   0x0001f001
(drv_selftest:8275) igt_kmod-WARNING:  0x00000000
(drv_selftest:8275) igt_kmod-WARNING: ffbfffff
(drv_selftest:8275) igt_kmod-WARNING: 0x00000002
(drv_selftest:8275) igt_kmod-WARNING: 0x00000002
(drv_selftest:8275) igt_kmod-WARNING:  0x00000000_34514548
(drv_selftest:8275) igt_kmod-WARNING: 0x00000000_34515541
(drv_selftest:8275) igt_kmod-WARNING: 0x00000000_34516380
(drv_selftest:8275) igt_kmod-WARNING: 0x00000008
(drv_selftest:8275) igt_kmod-WARNING: 0x00000000
(drv_selftest:8275) igt_kmod-WARNING: 0x00070000
(drv_selftest:8275) igt_kmod-WARNING: 0x00000000
(drv_selftest:8275) igt_kmod-WARNING: 0xffffffff
(drv_selftest:8275) igt_kmod-WARNING: ] @ 1790ms: bcs0
(drv_selftest:8275) igt_kmod-WARNING: -2147483648
(drv_selftest:8275) igt_kmod-WARNING: ecode 7:1:0xfffffff7, in drv_selftest [8275], reason: no progress on bcs0, action: reset
(drv_selftest:8275) igt_kmod-WARNING: 915_reset_device [i915]] resetting chip
(drv_selftest:8275) igt_kmod-WARNING: 0:02.0: Resetting chip for no progress on bcs0
(drv_selftest:8275) igt_kmod-WARNING: 000:00:02.0
(drv_selftest:8275) igt_kmod-WARNING: 915_gem_reset_engine [i915]] client drv_selftest[8275]/1: gained 1 ban score, now 1
(drv_selftest:8275) igt_kmod-WARNING: igt_ppgtt_internal_huge failed with error -22
(drv_selftest:8275) igt_kmod-WARNING: probe of 0000:00:02.0 failed with error -22
(drv_selftest:8275) igt_kmod-CRITICAL: Test assertion failure function igt_kselftest_execute, file ../lib/igt_kmod.c:520:
(drv_selftest:8275) igt_kmod-CRITICAL: Failed assertion: err == 0
(drv_selftest:8275) igt_kmod-CRITICAL: kselftest "i915 igt__21__live_hugepages=1 live_selftests=-1 disable_display=1" failed: Invalid argument [22]
(drv_selftest:8275) igt_core-INFO: Stack trace:
(drv_selftest:8275) igt_core-INFO:   #0 [__igt_fail_assert+0x180]
(drv_selftest:8275) igt_core-INFO:   #1 [igt_kselftest_execute+0x1d9]
(drv_selftest:8275) igt_core-INFO:   #2 [igt_kselftests+0x18c]
(drv_selftest:8275) igt_core-INFO:   #3 [__real_main29+0x44]
(drv_selftest:8275) igt_core-INFO:   #4 [main+0x44]
(drv_selftest:8275) igt_core-INFO:   #5 [__libc_start_main+0xe7]
(drv_selftest:8275) igt_core-INFO:   #6 [_start+0x2a]
****  END  ****
Comment 1 Chris Wilson 2018-08-06 14:28:58 UTC
<7>[  710.756031] missed_breadcrumb bcs0 missed breadcrumb at intel_breadcrumbs_hangcheck+0x51/0x70 [i915]
<7>[  710.756104] missed_breadcrumb 	current seqno 2, last 3, hangcheck 0 [1816 ms]
<7>[  710.756111] missed_breadcrumb 	Reset count: 0 (global 0)
<7>[  710.756119] missed_breadcrumb 	Requests:
<7>[  710.756138] missed_breadcrumb 		first  3 [b8:3] @ 1788ms: bcs0
<7>[  710.756144] missed_breadcrumb 		last   3 [b8:3] @ 1788ms: bcs0
<7>[  710.756173] missed_breadcrumb 		active 3 [b8:3] @ 1790ms: bcs0
<7>[  710.756180] missed_breadcrumb 		ring->start:  0x00022000
<7>[  710.756185] missed_breadcrumb 		ring->head:   0x000000e0
<7>[  710.756191] missed_breadcrumb 		ring->tail:   0x00000158
<7>[  710.756197] missed_breadcrumb 		ring->emit:   0x00000158
<7>[  710.756204] missed_breadcrumb 		ring->space:  0x0001fe68
<7>[  710.756211] missed_breadcrumb [head 00e0, postfix 0130, tail 0158, batch 0xffffffff_ffffffff]:
<7>[  710.756247] missed_breadcrumb [0000] 13244001 00000104 00000000 00000000 11000001 00022220 ffffffff 11000001
<7>[  710.756256] missed_breadcrumb [0020] 00022228 00070000 12400001 00022228 7fffc000 00000000 18800100 00010000
<7>[  710.756270] missed_breadcrumb [0040] 13204001 00000104 00000000 00000000 11000001 00002044 00000003 11000001
<7>[  710.756277] missed_breadcrumb [0060] 00012040 00000003 10800001 000000c0 00000003 01000000
<7>[  710.756952] missed_breadcrumb 	RING_START: 0x00022000
<7>[  710.756961] missed_breadcrumb 	RING_HEAD:  0x00000120
<7>[  710.756967] missed_breadcrumb 	RING_TAIL:  0x00000158
<7>[  710.756974] missed_breadcrumb 	RING_CTL:   0x0001f001
<7>[  710.756981] missed_breadcrumb 	RING_MODE:  0x00000000
<7>[  710.756987] missed_breadcrumb 	RING_IMR: ffbfffff
<7>[  710.756993] missed_breadcrumb 	SYNC_0: 0x00000002
<7>[  710.756999] missed_breadcrumb 	SYNC_1: 0x00000002
<7>[  710.757005] missed_breadcrumb 	ACTHD:  0x00000000_34514548
<7>[  710.757014] missed_breadcrumb 	BBADDR: 0x00000000_34515541
<7>[  710.757020] missed_breadcrumb 	DMA_FADDR: 0x00000000_34516380
<7>[  710.757026] missed_breadcrumb 	IPEIR: 0x00000008
<7>[  710.757032] missed_breadcrumb 	IPEHR: 0x00000000
<7>[  710.757037] missed_breadcrumb 	PP_DIR_BASE: 0x00070000
<7>[  710.757043] missed_breadcrumb 	PP_DIR_BASE_READ: 0x00000000
<7>[  710.757049] missed_breadcrumb 	PP_DIR_DCLV: 0xffffffff
<7>[  710.757105] missed_breadcrumb 		E 3 [b8:3] @ 1790ms: bcs0
<7>[  710.757185] missed_breadcrumb 		Queue priority: -2147483648
<7>[  710.757247] missed_breadcrumb 	drv_selftest [8275] waiting for 3
<7>[  710.757326] missed_breadcrumb IRQ? 0x0 (breadcrumbs? no)
<7>[  710.757333] missed_breadcrumb HWSP:
<7>[  710.757341] missed_breadcrumb [0000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7>[  710.757345] missed_breadcrumb *
<7>[  710.757352] missed_breadcrumb [00c0] 00000002 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7>[  710.757359] missed_breadcrumb [00e0] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
<7>[  710.757363] missed_breadcrumb *
<7>[  710.757369] missed_breadcrumb Idle? no

Is saying that we didn't end up in our batch.
Comment 2 Chris Wilson 2018-08-06 19:43:21 UTC
commit e6a59382924e2d007b554a2aebcd4445ebb01fef (HEAD -> drm-intel-next-queued, drm-intel/drm-intel-next-queued)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Mon Aug 6 15:46:04 2018 +0100

    drm/i915/selftests: Unconditionally do a chipset flush before emit_bb_start
    
    Experience teaches us over and over again that coherency on Baytrail
    requires the odd heavy hammer, and in particular clflush alone is not
    enough to guarrantee that writes from the CPU are picked up by the CS.
    Do as we do elsewhere and ensure we have an unconditional
    i915_gem_chipset_flush() after writing to memory and submitting a batch
    to HW.
    
    Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107499
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Reviewed-by: Matthew Auld <matthew.auld@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20180806144604.8346-1-chris@chris-wilson.co.uk
Comment 3 James Ausmus 2018-08-21 00:30:37 UTC
Tests showing all green, closing
Comment 4 Lakshmi 2018-08-24 05:33:05 UTC
Last seen 3 weeks before, Closing the bug.
Comment 5 Lakshmi 2018-08-27 13:08:50 UTC
This issue occurred only once 117 rounds ago. No issues noticed recently. Closing this bug.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.