Bug 101701

Summary: [IGT] igt@gem_mmap@swap-bo causing GPU hang
Product: DRI Reporter: Ricardo Madrigal <ricardox.madrigal.maldonado>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: DRI git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard: ReadyForDev
i915 platform: ALL i915 features: GPU hang
Attachments:
Description Flags
dmesg
none
i915_gpu_info none

Description Ricardo Madrigal 2017-07-05 19:21:59 UTC
Created attachment 132463 [details]
dmesg

When you start the test, platform works fine only for some minutes, after that, the platform does not work anymore and only show the last command used. 
There are not information to gather about this issue 

This following test case causes a GPU Hang

====================================
           Steps to reproduce
====================================
1.   start on text mode
2.  ./gem_mmap --r swap-bo --d


Configuration:

======================================
             Software
======================================
architecture                : x86_64
os version                  : Ubuntu 16.10
os codename                 : yakkety
kernel driver               : i915
swap partition              : enabled on (/dev/sda3)


======================================
                 Kernel
======================================
commit df0182c2c95385492772c6e4ace76b463298b8ca
Author:     Jani Nikula <jani.nikula@intel.com>
AuthorDate: Mon Jul 3 16:20:53 2017 +0300
Commit:     Jani Nikula <jani.nikula@intel.com>
CommitDate: Mon Jul 3 16:20:53 2017 +0300

drm-tip: 2017y-07m-03d-13h-20m-24s UTC integration manifest


======================================
        Graphic drivers
======================================
Component: drm
    tag: libdrm-2.4.81-24-g3095cc8
    commit: 3095cc8eaba1aa87ad38c04ae2b1eabe30f7e16c

Component: cairo
    tag: 1.15.6-2-g57b4050
    commit: 57b40507dda3f58dfc8635548d606b86dc7bcf51

Component: intel-gpu-tools
    tag: intel-gpu-tools-1.19-57-g6fcc8e8
    commit: 6fcc8e8b247661c7950b998e0b95141ffbd6b833

Component: piglit
    tag: piglit-v1
    commit: c8f4fd9eeb298a2ef0855927f22634f794ef3eff
==========================================


=======================================
Attachments
========================================
dmesg.log
i915_gpu_info
Comment 1 Ricardo Madrigal 2017-07-05 19:23:19 UTC
Created attachment 132464 [details]
i915_gpu_info
Comment 2 Elizabeth 2017-07-19 18:34:42 UTC
Adding tag into "Whiteboard" field - ReadyForDev
*Status is correct
*Platform is included
*Feature is included
*Priority and Severity correctly set
*Logs included
Comment 3 Chris Wilson 2017-07-19 18:44:47 UTC
It's designed to trigger a pagefault-of-doom with the current whole-object-at-a-time model. There's no GPU hang (at least not that you have shown), the system should just be very, very slow.
Comment 4 Elizabeth 2017-08-23 16:59:58 UTC
Still reproducible on GLK with commit ad6ab9f96437f0cb1f4d8a7840fd9eeb894eb12e
No new behavior, error state doesn't show hang.
Comment 5 Chris Wilson 2017-08-23 17:07:46 UTC
If you don't have a GPU hang, then the bug report about swap-bo causing a GPU hang continues to mystify.
Comment 6 Hector Velazquez 2017-09-27 20:53:53 UTC
This test is still failing on GLK QA

Tests List:

igt@gem_mmap@swap-bo


This is my configuration:

======================================
        Graphic stack
======================================
Component: drm
    tag: libdrm-2.4.81-56-g7c71188
    commit: 7c71188610b4ceba0339c2bc884320bcb749adee

Component: cairo
    tag: 1.15.6-42-gdccbed7
    commit: dccbed7d78d32bd3b912e8810379451dd94e6a1f

Component: intel-gpu-tools
    tag: intel-gpu-tools-1.19-332-g0a91a5e
    commit: 0a91a5e9624d41d23b79e2540eda111cb56d42d9

Component: piglit
    tag: piglit-v1
    commit: 95e2f51a28b6cf7ff77d84e1234121c98f10ef64
	
======================================
             Software
======================================
kernel version              : 4.14.0-rc2-drm-tip-ww39-commit-d76cbbc+
hostname                    : GLK-2-GLKRVP1DDR405
architecture                : x86_64
os version                  : Ubuntu 16.10
os codename                 : yakkety
kernel driver               : i915
bios revision               : 62.30
bios release date           : 08/22/2017
ksc                         : 1.41
hardware acceleration       : disabled
swap partition              : enabled on (/dev/sda3)

======================================
        Graphic drivers
======================================
grep: /opt/X11R7/var/log/Xorg.0.log: No such file or directory
libdrm                      : 2.4.83
cairo                       : 1.15.9
intel-gpu-tools (tag)       : intel-gpu-tools-1.19-332-g0a91a5e
intel-gpu-tools (commit)    : 0a91a5e

======================================
             Hardware
======================================
. . .

======================================
             Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.4
guc fw loaded             : SUCCESS
guc version wanted        : 10.56
guc version found         : 10.56
huc fw loaded             : yes

======================================
             kernel parameters
======================================
quiet drm.debug=0xe pci=pcie_bus_safe i915.alpha_support=1 i915.enable_guc_loading=2 i915.enable_guc_submission=2 intel_iommu=igfx_off auto panic=1 nmi_watchdog=panic resume=/dev/sda3 fastboot
Comment 7 Hector Velazquez 2017-10-02 16:05:26 UTC
This test are failing on CFL-S-1 QA

Tests List:

igt@gem_mmap@swap-bo


This is my configuration:

======================================
        Graphic stack
======================================
Component: drm
    tag: libdrm-2.4.81-57-g1dd84e0
    commit: 1dd84e01a972b1759839a7326009be24ab3e6de2

Component: cairo
    tag: 1.15.6-42-gdccbed7
    commit: dccbed7d78d32bd3b912e8810379451dd94e6a1f

Component: intel-gpu-tools
    tag: intel-gpu-tools-1.19-357-g1e99f8b
    commit: 1e99f8b8d2563d7f5c4e82932bab15abc5eacaef

Component: piglit
    tag: piglit-v1
    commit: 5aa6eea37f44f818632a3dad4c1a7478085bd56d

	
======================================
             Software
======================================
kernel version              : 4.14.0-rc3-drm-tip-ww40-commit-2f14e31+
hostname                    : CFL-S-1
architecture                : x86_64
os version                  : Ubuntu 16.10
os codename                 : yakkety
kernel driver               : i915
bios revision               : 104.3
bios release date           : 09/14/2017
ksc                         : 1.5
hardware acceleration       : disabled
swap partition              : enabled on (/dev/nvme0n1p3
/dev/sda3)

======================================
        Graphic drivers
======================================
grep: /opt/X11R7/var/log/Xorg.0.log: No such file or directory
libdrm                      : 2.4.83
cairo                       : 1.15.9
intel-gpu-tools (tag)       : intel-gpu-tools-1.19-357-g1e99f8b
intel-gpu-tools (commit)    : 1e99f8b

======================================
             Hardware
======================================
motherboard model          : CoffeeLakeClientPlatform
motherboard id             : CoffeeLakeSUDIMMRVP
form factor                : Desktop
manufacturer               : IntelCorporation
cpu family                 : Other
cpu family id              : 6
cpu information            : Genuine Intel(R) CPU 0000 @ 3.60GHz
gpu card                   : Intel Corporation Device 3e92 (prog-if 00 [VGA controller])
memory ram                 : 15.59 GB
max memory ram             : 32 GB
cpu thread                 : 12
cpu core                   : 6
cpu model                  : 158
cpu stepping               : 10
socket                     : Other
hard drive                 : 111GiB (120GB)
current cd clock frequency : 337500 kHz
maximum cd clock frequency : 675000 kHz
displays connected         : eDP-1 DP-1

======================================
             Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.1
guc fw loaded             : SUCCESS
guc version wanted        : 9.14
guc version found         : 9.14
huc fw loaded             : yes

======================================
             kernel parameters
======================================
quiet drm.debug=0x1e i915.enable_guc_loading=2 i915.enable_guc_submission=2 i915.alpha_support=1 auto panic=1 nmi_watchdog=panic intel_iommu=igfx_off resume=/dev/sda3 fastboot
Comment 8 Octavio 2018-02-09 23:37:36 UTC
This issue is still present on CFL 

Tests List:

igt@gem_mmap@swap-bo

Usgin IGT-Version: 1.21-g94bd67c (x86_64) (Linux: 4.15.0-drm-intel-qa-ww6-commit-6c10ba2+ x86_64)

no error state
Comment 9 Hector Velazquez 2018-02-12 14:11:22 UTC
This test still failing on GLK QA 

Tests List:

igt@gem_mmap@swap-bo

IGT-Version: 1.21-g94bd67c (x86_64) (Linux: 4.15.0-drm-tip-ww6-commit-078873d+ x86_64)

No new behavior, error state doesn't show hang.
Comment 10 Hector Velazquez 2018-03-02 18:16:51 UTC
This tests has a incomplete/fail on SNB/IVB/HSW/BDW/BSW/SKL/BXT/KBL/GLK/CFL/CNL QA 
Tests List:

igt@gem_mmap@swap-bo

output sample(SKL):
. . .
**** DEBUG ****
(gem_mmap:952) intel-os-DEBUG: Checking 1 surfaces of size 8231321600 bytes (total 8231325696) against RAM + swap
(gem_mmap:952) drmtest-DEBUG: Test requirement passed: !(fd<0)
(gem_mmap:952) igt-debugfs-DEBUG: Opening debugfs directory '/sys/kernel/debug/dri/0'
(gem_mmap:952) intel-os-DEBUG: Test requirement passed: __intel_check_memory(count, size, mode, &required, &total)
(gem_mmap:952) igt-core-DEBUG: Test requirement passed: !igt_run_in_simulation()
(gem_mmap:952) DEBUG: Test requirement passed: ptr_cpu
(gem_mmap:952) ioctl-wrappers-CRITICAL: Test assertion failure function gem_set_domain, file ioctl_wrappers.c:422:
(gem_mmap:952) ioctl-wrappers-CRITICAL: Failed assertion: __gem_set_domain(fd, handle, read, write) == 0
(gem_mmap:952) ioctl-wrappers-CRITICAL: Last errno: 12, Cannot allocate memory
(gem_mmap:952) ioctl-wrappers-CRITICAL: error: -12 != 0
(gem_mmap:952) igt-core-INFO: Stack trace:
(gem_mmap:952) igt-core-INFO:   #0 [__igt_fail_assert+0x101]
(gem_mmap:952) igt-core-INFO:   #1 [gem_set_domain+0x44]
(gem_mmap:952) igt-core-INFO:   #2 [test_huge_bo+0xef]
(gem_mmap:952) igt-core-INFO:   #3 [__real_main119+0x264]
(gem_mmap:952) igt-core-INFO:   #4 [main+0x27]
(gem_mmap:952) igt-core-INFO:   #5 [__libc_start_main+0xf1]
(gem_mmap:952) igt-core-INFO:   #6 [_start+0x2a]
(gem_mmap:952) igt-core-INFO:   #7 [<unknown>+0x2a]
****  END  ****
. . .

software:
IGT-Version: 1.21-g2b64cac (x86_64) (Linux: 4.16.0-rc1-drm-intel-qa-ww7-commit-8a1ed24+ x86_64)
Comment 11 Elizabeth 2018-03-15 17:17:23 UTC
(In reply to Hector Velazquez from comment #10)
> This tests has a incomplete/fail on
> SNB/IVB/HSW/BDW/BSW/SKL/BXT/KBL/GLK/CFL/CNL QA 
> Tests List:
> 
> igt@gem_mmap@swap-bo
> 
> output sample(SKL):
> . . .
> **** DEBUG ****
> (gem_mmap:952) intel-os-DEBUG: Checking 1 surfaces of size 8231321600 bytes
> (total 8231325696) against RAM + swap
> (gem_mmap:952) drmtest-DEBUG: Test requirement passed: !(fd<0)
> (gem_mmap:952) igt-debugfs-DEBUG: Opening debugfs directory
> '/sys/kernel/debug/dri/0'
> (gem_mmap:952) intel-os-DEBUG: Test requirement passed:
> __intel_check_memory(count, size, mode, &required, &total)
> (gem_mmap:952) igt-core-DEBUG: Test requirement passed:
> !igt_run_in_simulation()
> (gem_mmap:952) DEBUG: Test requirement passed: ptr_cpu
> (gem_mmap:952) ioctl-wrappers-CRITICAL: Test assertion failure function
> gem_set_domain, file ioctl_wrappers.c:422:
> (gem_mmap:952) ioctl-wrappers-CRITICAL: Failed assertion:
> __gem_set_domain(fd, handle, read, write) == 0
> (gem_mmap:952) ioctl-wrappers-CRITICAL: Last errno: 12, Cannot allocate
> memory
> (gem_mmap:952) ioctl-wrappers-CRITICAL: error: -12 != 0
> (gem_mmap:952) igt-core-INFO: Stack trace:
> (gem_mmap:952) igt-core-INFO:   #0 [__igt_fail_assert+0x101]
> (gem_mmap:952) igt-core-INFO:   #1 [gem_set_domain+0x44]
> (gem_mmap:952) igt-core-INFO:   #2 [test_huge_bo+0xef]
> (gem_mmap:952) igt-core-INFO:   #3 [__real_main119+0x264]
> (gem_mmap:952) igt-core-INFO:   #4 [main+0x27]
> (gem_mmap:952) igt-core-INFO:   #5 [__libc_start_main+0xf1]
> (gem_mmap:952) igt-core-INFO:   #6 [_start+0x2a]
> (gem_mmap:952) igt-core-INFO:   #7 [<unknown>+0x2a]
> ****  END  ****
> . . .
> 
> software:
> IGT-Version: 1.21-g2b64cac (x86_64) (Linux:
> 4.16.0-rc1-drm-intel-qa-ww7-commit-8a1ed24+ x86_64)
This fail doesn't apply for this bug, and is being worked in bug 105402.
Comment 12 Jani Saarinen 2018-03-29 07:11:42 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 13 Elizabeth 2018-04-09 17:24:32 UTC
This test takes around 2 and a half days to finish and once it seems to have stopped the computer will restart, so the pass/fail/incomplete result is not available. Is it worth to try to get logs from this test or the bug can be considered invalid as there is not enough information?
Comment 14 Lakshmi 2018-09-25 07:14:21 UTC
I assume this issue has been fixed.
Closing now. Feel free to reopen if you still have the issue.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.