Bug 105279 - [IGT] gem_ctx_isolation some subtest failed assertion: num_errors == 0
Summary: [IGT] gem_ctx_isolation some subtest failed assertion: num_errors == 0
Status: CLOSED INVALID
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: x86-64 (AMD64) All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-02-27 20:59 UTC by Octavio
Modified: 2018-03-28 16:25 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
dmesg log (180.31 KB, text/plain)
2018-02-27 20:59 UTC, Octavio
no flags Details
kernel log (486.81 KB, text/plain)
2018-02-27 21:00 UTC, Octavio
no flags Details

Description Octavio 2018-02-27 20:59:10 UTC
This test fail on CFL QA 

igt@gem_ctx_isolation@vcs0-reset


======================================
        Graphic stack
======================================

======================================
             Software
======================================
kernel version              : 4.16.0-rc2-drm-intel-qa-ww9-commit-01a067a+
hostname                    : CFL-1
architecture                : x86_64
os version                  : Ubuntu 17.10
os codename                 : artful
kernel driver               : i915
bios revision               : 118.7
bios release date           : 01/04/2018
ksc                         : 1.5
hardware acceleration       : disabled
swap partition              : enabled on (/dev/nvme0n1p2)

======================================
        Graphic drivers
======================================
grep: /opt/X11R7/var/log/Xorg.0.log: No such file or directory
libdrm                      : 2.4.90
intel-gpu-tools (tag)       : intel-gpu-tools-1.21-155-ga2664f86
intel-gpu-tools (commit)    : a2664f86

======================================
             Hardware
======================================
motherboard model          : CoffeeLakeClientPlatform
motherboard id             : CoffeeLakeSUDIMMRVP
form factor                : Desktop
manufacturer               : IntelCorporation
cpu family                 : Other
cpu family id              : 6
cpu information            : Genuine Intel(R) CPU 0000 @ 3.60GHz
gpu card                   : Intel Corporation Device 3e92 (prog-if 00 [VGA controller])
memory ram                 : 15.57 GB
max memory ram             : 32 GB
cpu thread                 : 12
cpu core                   : 6
cpu model                  : 158
cpu stepping               : 10
socket                     : Other
current cd clock frequency : 337500 kHz
maximum cd clock frequency : 675000 kHz
displays connected         : eDP-1 DP-1

======================================
             Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.4
guc fw loaded             : fetch SUCCESS, load SUCCESS
guc version wanted        : wanted 9.39, found 9.39
guc version found         : wanted 9.39, found 9.39

======================================
             kernel parameters
======================================
quiet drm.debug=0x1e intel_iommu=igfx_off auto panic=1 nmi_watchdog=panic fsck.repair=yes i915.error_capture=yes log_buf_len=4M i915.alpha_support=1 i915.enable_guc=-1 resume=/dev/sda3 fastboot

======================================
Output
======================================

(gem_ctx_isolation:4353) WARNING: Register 0x22200 (BCS_SWCTRL): A=00003333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22600 (BCS_GPR[0]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22604 (BCS_GPR[1]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22608 (BCS_GPR[2]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2260c (BCS_GPR[3]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22610 (BCS_GPR[4]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22614 (BCS_GPR[5]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22618 (BCS_GPR[6]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2261c (BCS_GPR[7]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22620 (BCS_GPR[8]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22624 (BCS_GPR[9]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22628 (BCS_GPR[10]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2262c (BCS_GPR[11]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22630 (BCS_GPR[12]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22634 (BCS_GPR[13]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22638 (BCS_GPR[14]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2263c (BCS_GPR[15]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22640 (BCS_GPR[16]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22644 (BCS_GPR[17]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22648 (BCS_GPR[18]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2264c (BCS_GPR[19]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22650 (BCS_GPR[20]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22654 (BCS_GPR[21]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22658 (BCS_GPR[22]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2265c (BCS_GPR[23]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22660 (BCS_GPR[24]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22664 (BCS_GPR[25]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22668 (BCS_GPR[26]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2266c (BCS_GPR[27]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22670 (BCS_GPR[28]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22674 (BCS_GPR[29]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22678 (BCS_GPR[30]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2267c (BCS_GPR[31]): A=33333333 B=00000000
(gem_ctx_isolation:4353) CRITICAL: Test assertion failure function compare_regs, file gem_ctx_isolation.c:441:
(gem_ctx_isolation:4353) CRITICAL: Failed assertion: num_errors == 0
(gem_ctx_isolation:4353) CRITICAL: 33 registers mistached between dirty 33333333 context
.
Subtest bcs0-reset failed.
**** DEBUG ****
(gem_ctx_isolation:4353) i915/gem-context-DEBUG: Test requirement passed: has_ban_period || has_bannable
(gem_ctx_isolation:4353) igt-gt-DEBUG: Test requirement passed: has_gpu_reset(fd)
(gem_ctx_isolation:4353) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_ctx_isolation:4353) drmtest-DEBUG: Test requirement passed: is_i915_device(fd) && has_known_intel_chipset(fd)
(gem_ctx_isolation:4353) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_ctx_isolation:4353) ioctl-wrappers-DEBUG: Test requirement passed: dir >= 0
(gem_ctx_isolation:4353) ioctl-wrappers-DEBUG: Test requirement passed: err == 0
(gem_ctx_isolation:4353) ioctl-wrappers-DEBUG: Test requirement passed: gem_has_ring(fd, ring)
(gem_ctx_isolation:4353) igt-dummyload-DEBUG: Test requirement passed: nengine
(gem_ctx_isolation:4353) igt-gt-DEBUG: Triggering GPU reset
(gem_ctx_isolation:4353) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_ctx_isolation:4353) drmtest-DEBUG: Test requirement passed: is_i915_device(fd) && has_known_intel_chipset(fd)
(gem_ctx_isolation:4353) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_ctx_isolation:4353) ioctl-wrappers-DEBUG: Test requirement passed: dir >= 0
(gem_ctx_isolation:4353) ioctl-wrappers-DEBUG: Test requirement passed: err == 0
(gem_ctx_isolation:4353) ioctl-wrappers-DEBUG: Test requirement passed: gem_has_ring(fd, ring)
(gem_ctx_isolation:4353) igt-dummyload-DEBUG: Test requirement passed: nengine
(gem_ctx_isolation:4353) WARNING: Register 0x22200 (BCS_SWCTRL): A=00003333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22600 (BCS_GPR[0]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22604 (BCS_GPR[1]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22608 (BCS_GPR[2]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2260c (BCS_GPR[3]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22610 (BCS_GPR[4]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22614 (BCS_GPR[5]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22618 (BCS_GPR[6]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2261c (BCS_GPR[7]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22620 (BCS_GPR[8]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22624 (BCS_GPR[9]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22628 (BCS_GPR[10]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2262c (BCS_GPR[11]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22630 (BCS_GPR[12]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22634 (BCS_GPR[13]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22638 (BCS_GPR[14]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2263c (BCS_GPR[15]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22640 (BCS_GPR[16]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22644 (BCS_GPR[17]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22648 (BCS_GPR[18]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2264c (BCS_GPR[19]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22650 (BCS_GPR[20]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22654 (BCS_GPR[21]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22658 (BCS_GPR[22]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2265c (BCS_GPR[23]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22660 (BCS_GPR[24]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22664 (BCS_GPR[25]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22668 (BCS_GPR[26]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2266c (BCS_GPR[27]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22670 (BCS_GPR[28]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22674 (BCS_GPR[29]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x22678 (BCS_GPR[30]): A=33333333 B=00000000
(gem_ctx_isolation:4353) WARNING: Register 0x2267c (BCS_GPR[31]): A=33333333 B=00000000
(gem_ctx_isolation:4353) CRITICAL: Test assertion failure function compare_regs, file gem_ctx_isolation.c:441:
(gem_ctx_isolation:4353) CRITICAL: Failed assertion: num_errors == 0
(gem_ctx_isolation:4353) CRITICAL: 33 registers mistached between dirty 33333333 context
.
(gem_ctx_isolation:4353) igt-core-INFO: Stack trace:
(gem_ctx_isolation:4353) igt-core-INFO:   #0 [__igt_fail_assert+0x101]
(gem_ctx_isolation:4353) igt-core-INFO:   #1 [compare_regs+0x1c8]
(gem_ctx_isolation:4353) igt-core-INFO:   #2 [<unknown>+0x1c8]
****  END  ****

Note: These test are failing randomly 

In dmesg log it displays a GPU hang: code 9:0:0xfffffffe. is this expected?
Comment 1 Octavio 2018-02-27 20:59:47 UTC
Created attachment 137666 [details]
dmesg log
Comment 2 Octavio 2018-02-27 21:00:07 UTC
Created attachment 137667 [details]
kernel log
Comment 3 Chris Wilson 2018-02-27 22:35:37 UTC
It's hard to tell as you've misconfigured your kernel to drop /dev/kmsg messages and you appear to force loading the guc.
Comment 4 Elizabeth 2018-03-08 18:54:13 UTC
(In reply to Chris Wilson from comment #3)
> It's hard to tell as you've misconfigured your kernel to drop /dev/kmsg
> messages and you appear to force loading the guc.
How can I enable de /dev/kmsg messages? Thank you.
Comment 5 Chris Wilson 2018-03-09 00:28:38 UTC
Well I thought it was part of CONFIG_SECURITY_DMESG_RESTRICT but reading through, no. igt needs write permission to /dev/kmsg, so check the permission bits there and keep hunting until running "hostname > /dev/kmsg", the output appears in dmesg. We expect to see start/stop markers as igt runs through the subtests, which helps us when reading through the kernel logs to find the relevant chunks.
Comment 6 Jani Saarinen 2018-03-28 16:05:08 UTC
In CI passing:
  0.20 igt@gem_ctx_isolation@vcs0-reset pass
Comment 7 Jani Saarinen 2018-03-28 16:05:32 UTC
I propose to close, agree?
Comment 8 Elizabeth 2018-03-28 16:25:28 UTC
Last time we saw this was like five weeks ago, so I also agree.

IGT-Version: 1.21-g1fb30f1 (x86_64) (Linux: 4.16.0-rc2-drm-intel-qa-ww8-commit-562dc33+ x86_64)
Stack trace:
  #0 [__igt_fail_assert+0x101]
  #1 [compare_regs+0x1c8]
  #2 [<unknown>+0x1c8]
Subtest vcs0-reset: FAIL (0.026s)
Test requirement not met in function gem_require_engine, file ./../lib/igt_gt.h:111:
Test requirement: gem_has_engine(gem_fd, class, instance)


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.