Bug 103453

Summary: [IGT] drv_selftests/live_uncore
Product: DRI Reporter: Elizabeth <elizabethx.de.la.torre.mena>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED NOTOURBUG QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: critical    
Priority: high CC: intel-gfx-bugs
Version: unspecified   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: SKL i915 features:
Attachments:
Description Flags
kern_log_live_uncore none

Description Elizabeth 2017-10-25 15:05:07 UTC
Created attachment 135039 [details]
kern_log_live_uncore

with test live_uncore, platform "dies", no display output nor ssh, after command:

$sudo -E ./drv_selftests --r live_uncore

until power reset. 

======================================
        Graphic drivers
======================================
grep: /opt/X11R7/var/log/Xorg.0.log: No such file or directory
libdrm                      : 2.4.85
cairo                       : 1.15.9
intel-gpu-tools (tag)       : intel-gpu-tools-1.19-464-ge7742ee
intel-gpu-tools (commit)    : e7742ee

======================================
             Hardware
======================================
platform                   : Skylake Canyon
motherboard id             : NUC6i7KYB
form factor                : Desktop
cpu family                 : Core i7
cpu family id              : 6
cpu information            : Intel(R) Core(TM) i7-6770HQ CPU @ 2.60GHz
gpu card                   : Intel Corporation Iris Pro Graphics 580 (rev 09) (prog-if 00 [VGA controller])
memory ram                 : 31.31 GB
max memory ram             : 32 GB
cpu thread                 : 8
cpu core                   : 4
cpu model                  : 94
cpu stepping               : 3
socket                     : Other
signature                  : Type 0, Family 6, Model 94, Stepping 3
current cd clock frequency : 337500 kHz
maximum cd clock frequency : 675000 kHz
displays connected         : DP-1

======================================
             Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.26
guc fw loaded             : fetch SUCCESS, load SUCCESS
guc version wanted        : wanted 6.1, found 6.1
guc version found         : wanted 6.1, found 6.1

======================================
             kernel parameters
======================================
quiet drm.debug=0x1e pci=pcie_bus_safe intel_iommu=igfx_off auto panic=1 nmi_watchdog=panic i915.alpha_support=1 i915.enable_guc_loading=2 i915.enable_guc_submission=2 resume=/dev/sda3 fastboot
Comment 1 Chris Wilson 2017-10-25 15:15:51 UTC
Kind of expected... We depend upon the hw validation reporting an error for bad registers -- the point of the test is to make sure that the hw does report such errors if we make the mistake at runtime.
Comment 2 Elizabeth 2017-10-26 17:15:49 UTC
Rising priority since it got our test cycles stuck for hours until we manually power platforms off.
Comment 3 Chris Wilson 2017-11-06 13:21:01 UTC
Fwiw, this test is now disabled (by default):

commit 69ea47a5a98b198c9d36fe157a3986748a9e2554
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Oct 25 16:32:07 2017 +0100

    drm/i915/selftests: Hide dangerous tests
    
    Some tests are designed to exercise the limits of the HW and may trigger
    unintended side-effects making the machine unusable. This should not be
    executed by default, but are still useful for early platform validation.
    
    References: https://bugs.freedesktop.org/show_bug.cgi?id=103453
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20171025153207.9589-1-chris@chris-wilson.co.uk
    Reviewed-by: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
Comment 4 Armando Antonio 2018-01-05 17:59:31 UTC
this is the execution output on skl

IGT-Version: 1.20-g6db2441 (x86_64) (Linux: 4.15.0-rc6-drm-intel-qa-ww1-commit-914d61a+ x86_64)
(drv_selftest:13871) igt-kmod-DEBUG: Test requirement passed: err == 0 || err == -ENOENT
(drv_selftest:13871) igt-kmod-DEBUG: Test requirement passed: igt_kselftest_begin(&tst) == 0
(drv_selftest:13871) igt-kmod-DEBUG: Test requirement passed: !igt_list_empty(&tests)
(drv_selftest:13871) igt-kmod-DEBUG: Could not remove module drm_kms_helper (No such file or directory)
(drv_selftest:13871) igt-kmod-DEBUG: Could not remove module drm (No such file or directory)
(drv_selftest:13871) igt-kmod-DEBUG: Test requirement passed: err == 0 || err == -ENOENT
(drv_selftest:13871) igt-kmod-DEBUG: Test requirement passed: igt_kselftest_begin(&tst) == 0
(drv_selftest:13871) igt-core-DEBUG: Starting subtest: live_uncore
Subtest live_uncore: SUCCESS (0.362s)
(drv_selftest:13871) igt-kmod-DEBUG: Test requirement passed: !igt_list_empty(&tests)
(drv_selftest:13871) igt-core-DEBUG: Exiting with status code 0

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.