Summary: | [BSW/BXT] [GPU Hang] igt / gem_softpin / noreloc-s4 fails | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | cprigent <christophe.prigent> | ||||||||||
Component: | DRM/Intel | Assignee: | Chris Wilson <chris> | ||||||||||
Status: | CLOSED DUPLICATE | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||||||||
Severity: | critical | ||||||||||||
Priority: | highest | CC: | intel-gfx-bugs | ||||||||||
Version: | unspecified | ||||||||||||
Hardware: | x86-64 (AMD64) | ||||||||||||
OS: | Linux (All) | ||||||||||||
Whiteboard: | |||||||||||||
i915 platform: | BSW/CHT, BXT | i915 features: | GEM/Other, GPU hang | ||||||||||
Attachments: |
|
Description
cprigent
2016-03-25 15:25:40 UTC
Unreported GPU hang. Is the hang now reported? This bug was also reproduced on BXT under the following configuration Hardware configuration ======================= Platform BXT - P (APL) Motherboard model Apollo Lake Motherboard type NOTEBOOK Hand Held Motherboard manufacturer Intel Corp. CPU family Other CPU information 06/5c GPU Card Intel Corporation Device 5a84 (rev 03) (prog-if 00 [VGA controller]) Memory ram 8 GB Software configuration ======================= --> Component : drm url : http://cgit.freedesktop.org/mesa/drm tag : libdrm-2.4.67-11-gea78c17 commit : ea78c17 author : Emil Velikov <emil.l.velikov@gmail.com> age : 20 hours ago --> Component : mesa url : http://cgit.freedesktop.org/mesa/mesa tag : mesa-11.1.2 commit : 7bcd827 author : Emil Velikov <emil.velikov@collabora.com> age : 7 weeks ago --> Component : xf86-video-intel url : http://cgit.freedesktop.org/xorg/driver/xf86-video-intel tag : 2.99.917-590-g094924f commit : 094924f author : Chris Wilson <chris@chris-wilson.co.uk> age : 4 days ago --> Component : libva url : http://cgit.freedesktop.org/libva/ tag : libva-1.7.0-1-g2339d10 commit : 2339d10 author : Xiang Haihao <haihao.xiang@intel.com> age : 13 days ago --> Component : vaapi (intel-driver) url : http://cgit.freedesktop.org/vaapi/intel-driver tag : 1.7.0-5-g759e44d commit : 759e44d author : peng.chen <peng.c.chen@intel.com> age : 13 days ago --> Component : cairo url : http://cgit.freedesktop.org/cairo tag : 1.15.2 commit : db8a7f1 author : Bryce Harrington <bryce@osg.samsung.com> age : 4 months ago --> Component : xserver url : http://cgit.freedesktop.org/xorg/xserver tag : xorg-server-1.18.0-254-g44e1c97 commit : 44e1c97 author : Olivier Fourdan <ofourdan@redhat.com> age : 8 days ago --> Component : intel-gpu-tools url : http://cgit.freedesktop.org/xorg/app/intel-gpu-tools tag : intel-gpu-tools-1.14-129-g41a26b5 commit : 41a26b5 author : Chris Wilson <chris@chris-wilson.co.uk> age : 26 hours ago kernel configuration ===================== Branch : drm-intel-nightly commit f5d413cccefa1f93d64c34f357151d42add63a84 Author: Tvrtko Ursulin <tvrtko.ursulin@intel.com> Date: Thu Mar 24 14:35:16 2016 +0000 drm-intel-nightly: 2016y-03m-24d-14h-34m-29s UTC integration manifest Kernel version : 4.5.0 Architecture : source amd64 all The following tests cases fail on BXT-P : tests cases =========================== igt@gem_softpin@noreloc-s3 igt@gem_softpin@noreloc-s4 Software information =========================== Linux distribution : Ubuntu 15.10 Architecture : 64-bit Bios revision : 129.10 KSC revision : 1.12 Hardware information ============================= Platform : BXT-P Motherboard model : Broxton P Motherboard type : NOTEBOOK Hand Held Motherboard manufacturer : Intel Corp. CPU family : Other CPU information : 06/5c GPU Card : Intel Corporation Device 5a84 (rev 03) (prog-if 00 [VGA controller]) CPU's number : 4 Graphic stack ========================= --> Component : drm url : http://cgit.freedesktop.org/mesa/drm tag : libdrm-2.4.68-4-g7aab852 commit : 7aab852 author : Sonny Jiang <sonny.jiang@amd.com> age : 5 days ago --> Component : mesa url : http://cgit.freedesktop.org/mesa/mesa tag : mesa-11.1.2 commit : 7bcd827 author : Emil Velikov <emil.velikov@collabora.com> age : 3 months ago --> Component : xf86-video-intel url : http://cgit.freedesktop.org/xorg/driver/xf86-video-intel tag : 2.99.917-645-g88733a7 commit : 88733a7 author : Chris Wilson <chris@chris-wilson.co.uk> age : 10 days ago --> Component : libva url : http://cgit.freedesktop.org/libva/ tag : libva-1.7.0-10-gcfed369 commit : cfed369 author : peng.chen <peng.c.chen@intel.com> age : 6 days ago --> Component : vaapi (intel-driver) url : http://cgit.freedesktop.org/vaapi/intel-driver tag : 1.7.0-23-g0be0b21 commit : 0be0b21 author : Xiang Haihao <haihao.xiang@intel.com> age : 7 days ago --> Component : cairo url : http://cgit.freedesktop.org/cairo tag : 1.15.2 commit : db8a7f1 author : Bryce Harrington <bryce@osg.samsung.com> age : 5 months ago --> Component : xserver url : http://cgit.freedesktop.org/xorg/xserver tag : xorg-server-1.18.0-324-g984be78 commit : 984be78 author : Olivier Fourdan <ofourdan@redhat.com> age : 6 days ago --> Component : intel-gpu-tools url : http://cgit.freedesktop.org/xorg/app/intel-gpu-tools tag : intel-gpu-tools-1.14-319-gb88bce4 commit : b88bce4 author : Mika Kuoppala <mika.kuoppala@intel.com> age : 4 days ago kernel ====================== commit 5528ede25a356196417ee02aa153b17fe90c1be5 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Sat May 14 09:24:52 2016 +0100 drm-intel-nightly: 2016y-05m-14d-08h-24m-24s UTC integration manifest Kernel version : 4.6.0-rc7 Architecture : source amd64 all Homepage : http://www.kernel.org/ The following tests cases fail with the following configuration : Tests cases =============================================== igt@gem_softpin@evict-active igt@gem_softpin@noreloc-s4 igt@gem_softpin@evict-active-interruptible Gfx stack information =============================================== --> Component : drm tag : libdrm-2.4.68-4-g7aab852 commit : 7aab852 --> Component : mesa tag : mesa-11.1.2 commit : 7bcd827 --> Component : cairo tag : 1.15.2 commit : db8a7f1 --> Component : intel-gpu-tools tag : intel-gpu-tools-1.14-346-gcce2ff0 commit : cce2ff0 Software information =============================================== Kernel version : 4.6.0-rc7-drm-intel-nightly-ww20-commit-5528ede+ Linux distribution : Ubuntu 15.10 Architecture : 64-bit Bios revision : 138.22 KSC revision : 1.12 DMC revision : 1.07 GUC revision : 8.7 Hardware information =============================================== Platform : BXT-P Motherboard model : Broxton P Motherboard type : NOTEBOOK Hand Held Motherboard manufacturer : Intel Corp. CPU family : Other CPU information : 06/5c GPU Card : Intel Corporation Device 5a84 (rev 0a) (prog-if 00 [VGA controller]) kernel =============================================== commit 2ec823981d62c56d1511bda42b8295e31ece800f Author: Daniel Vetter <daniel.vetter@ffwll.ch> Date: Sun May 22 18:23:13 2016 +0200 drm-intel-nightly: 2016y-05m-22d-16h-22m-45s UTC integration manifest gem_softpin@noreloc-s4 is randomly failing on BSW, it passes some times it crashes another. 50%-50% reproducible. (Graphic Stack) Intel® Graphics for Linux* | 01.org Software information ============================================ Kernel version : 4.6.0-drm-intel-nightly-ww23-commit-fb023a2+ Linux distribution : Ubuntu 16.04 LTS Architecture : 64-bit Mesa version : Not found << Please see the message at the bottom >> xf86-video-intel version : 2.99.917 Xorg-Xserver version : 1.18.3 DRM version : 2.4.68 VAAPI version : Not found Cairo version : 1.15.2 Intel GPU Tools version : Tag [intel-gpu-tools-1.14-348-g303b380] / Commit [303b380] Kernel driver in use : i915 Hardware acceleration : Bios revision : 0.33 KSC revision : 0.16 Hardware information ============================================ Platform : Motherboard model : 10G9000NUS Motherboard type : BRASWELL Desktop Motherboard manufacturer : LENOVO CPU family : Pentium CPU information : Intel(R) Pentium(R) CPU N3700 @ 1.60GHz GPU Card : Intel Corporation Device 22b1 (rev 21) (prog-if 00 [VGA controller]) Memory ram : 8 GB Maximum memory ram allowed : 8 GB Display resolution : CPU's number : 4 Hard drive Capacity : 120 GB The following tests cases are fail with the following configuration : Tests cases =============================================== igt@gem_softpin@noreloc-s4 Gfx stack information =============================================== --> Component : drm tag : libdrm-2.4.68 commit : fc09c5a --> Component : cairo tag : 1.15.2 commit : db8a7f1 --> Component : intel-gpu-tools tag : intel-gpu-tools-1.14-348-g303b380 commit : 303b380 Software information =============================================== Kernel version : 4.6.0-drm-intel-nightly-ww23-commit-fb023a2+ Linux distribution : Ubuntu 16.04 Architecture : 64-bit Bios revision : 138.25 KSC revision : 1.12 DMC revision : 1.07 Hardware information =============================================== Platform : BXT-P Motherboard model : Broxton P Motherboard type : NOTEBOOK Hand Held Motherboard manufacturer : Intel Corp. CPU family : B1 CPU information : 06/5c GPU Card : Intel Corporation Device 5a84 (rev 0a) (prog-if 00 [VGA controller]) kernel =============================================== commit fb023a2062df06c9e097e1f8f2bcf252194b9413 Author: Daniel Vetter <daniel.vetter@ffwll.ch> Date: Mon May 30 10:46:14 2016 +0200 drm-intel-nightly: 2016y-05m-30d-08h-45m-53s UTC integration manifest The issue is present with the following configuration : Tests cases =============================================== igt@gem_softpin@evict-active-interruptible igt@gem_softpin@evict-active Gfx stack information =============================================== --> Component : drm tag : libdrm-2.4.68 commit : fc09c5a --> Component : cairo tag : 1.15.2 commit : db8a7f1 --> Component : intel-gpu-tools tag : intel-gpu-tools-1.14-348-g303b380 commit : 303b380 Software information =============================================== Kernel version : 4.6.0-drm-intel-nightly-ww23-commit-fb023a2+ Linux distribution : Ubuntu 16.04 Architecture : 64-bit Bios revision : 138.25 KSC revision : 1.12 DMC revision : 1.07 Hardware information =============================================== Platform : BXT-P Motherboard model : Broxton P Motherboard type : NOTEBOOK Hand Held Motherboard manufacturer : Intel Corp. CPU family : B1 CPU information : 06/5c GPU Card : Intel Corporation Device 5a84 (rev 0a) (prog-if 00 [VGA controller]) kernel =============================================== commit fb023a2062df06c9e097e1f8f2bcf252194b9413 Author: Daniel Vetter <daniel.vetter@ffwll.ch> Date: Mon May 30 10:46:14 2016 +0200 drm-intel-nightly: 2016y-05m-30d-08h-45m-53s UTC integration manifest Created attachment 124932 [details] apl-gem_softpin-kern.log Test is fail on APL with GPU hang. Attached kern.log and /sys/class/drm/card0/error. # ./gem_softpin --r noreloc-S4 IGT-Version: 1.15-g88c1f7c (x86_64) (Linux: 4.7.0-rc5-nightly+ x86_64) rtcwake: wakeup from "disk" using /dev/rtc0 at Wed Jul 6 14:34:06 2016 (gem_softpin:1615) CRITICAL: Test assertion failure function test_noreloc, file gem_softpin.c:466: (gem_softpin:1615) CRITICAL: Failed assertion: val == (object[i].offset - offset)/size (gem_softpin:1615) CRITICAL: error: 217 != 21 Stack trace: #0 [__igt_fail_assert+0xf1] #1 [test_noreloc+0x365] #2 [__real_main474+0x7e1] #3 [main+0x29] #4 [__libc_start_main+0xf0] #5 [_start+0x29] #6 [<unknown>+0x29] Subtest noreloc-S4 failed. **** DEBUG **** (gem_softpin:1615) DEBUG: Made a 257x4 KiB hole: ffefd000 (gem_softpin:1615) igt-core-DEBUG: Test requirement passed: !igt_run_in_simulation() (gem_softpin:1615) igt-aux-DEBUG: Test requirement passed: system("rtcwake -n -s 30 -m disk" SQUELCH) == 0 (gem_softpin:1615) CRITICAL: Test assertion failure function test_noreloc, file gem_softpin.c:466: (gem_softpin:1615) CRITICAL: Failed assertion: val == (object[i].offset - offset)/size (gem_softpin:1615) CRITICAL: error: 217 != 21 **** END **** Subtest noreloc-S4: FAIL (15.105s) Platform: APL system CPU Name : Intel(R) Genuine Processor @ 1.1 GHz (family: 6, model: 12, stepping: 9) 4 cores QDF : Q6HE SoC : B1 CRB : Apollo Lake DDR3L RVP1A FAB2 Reworks : R19, R20 Software Bios: 144_B10 APLK_B0_IFWI_X64_R_2016_06_27_0956_SPI_RVP1.bin from \\gar\ec\proj\ba\CCG\APL BIOS\External\BIOS_Release\Daily\v144_10_2016_WW27.1\IFWI\IFWI_RVP1_Release\IFWI KSC: 1.15 Linux distribution: Ubuntu 16.04 64 bits Kernel: drm-intel-nightly 4.7.0-rc5 2fe5da8 from http://cgit.freedesktop.org/drm-intel/ commit 2fe5da8adcbe5a34830b98c63878eea756e977a2 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Sat Jul 2 19:32:50 2016 +0100 drm-intel-nightly: 2016y-07m-02d-18h-31m-39s UTC integration manifest drm: libdrm-2.4.68-11 7996a87 from git://anongit.freedesktop.org/mesa/drm mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa cairo: 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo server: xorg-server-1.18.0-443 3762edd from git://git.freedesktop.org/git/xorg/xserver xf86-video-intel: 2.99.917-674 9154dff from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel libva: libva-1.7.0-26 c36971c from git://git.freedesktop.org/git/vaapi/libva vaapi-intel-driver: 1.7.0-53 bcde10d from git://git.freedesktop.org/git/vaapi/intel-driver GuC 8.7 DMC 1.07 from https://01.org/linuxgraphics/downloads/broxton-dmc-1.07 Intel-Gpu-Tools 1.15-87 376b813 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git Created attachment 124933 [details]
/sys/class/drm/card0/error
Both this and gem_exec_suspend have the similar symptom of a missed context-switch following resume (different rings though). But not on bdw/skl so suggests not in the common execlists code but in the irq handler (and teardown/setup)? Thanks Chris for the investigation. What is the next step? For info it is reproduced with following setup: Platform: APL system CPU Name : Intel(R) Genuine Processor @ 1.1 GHz (family: 6, model: 12, stepping: 9) 4 cores QDF : Q6HE SoC : B1 CRB : Apollo Lake DDR3L RVP1A FAB2 Reworks : R19, R20 Software Bios: 144_B10 APLK_B0_IFWI_X64_R_2016_06_27_0956_SPI_RVP1.bin from \\gar\ec\proj\ba\CCG\APL BIOS\External\BIOS_Release\Daily\v144_10_2016_WW27.1\IFWI\IFWI_RVP1_Release\IFWI KSC: 1.15 Linux distribution: Ubuntu 16.04 64 bits Kernel: tag drm-intel-testing-2016-07-11 4.7.0-rc6 0230e3c from http://cgit.freedesktop.org/drm-intel/ commit 0230e3c4eb76cf8f57cf40db0e908b96b84e3911 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Sun Jul 10 13:24:46 2016 +0100 drm-intel-nightly: 2016y-07m-10d-12h-23m-38s UTC integration manifest drm: libdrm-2.4.68-14 8c8d5ddfrom git://anongit.freedesktop.org/mesa/drm mesa: mesa-11.2.2 3a9f628from git://anongit.freedesktop.org/mesa/mesa cairo: 1.15.2 db8a7f1 from git://anongit.freedesktop.org/cairo xserver: xorg-server-1.18.0-454 033888e from git://git.freedesktop.org/git/xorg/xserver xf86-video-intel: 2.99.917-676 26f8ab5 from git://git.freedesktop.org/git/xorg/driver/xf86-video-intel libva: libva-1.7.0-26 c36971c from git://git.freedesktop.org/git/vaapi/libva vaapi-intel-driver: 1.7.0-53 bcde10d from git://git.freedesktop.org/git/vaapi/intel-driver GuC 8.7 DMC 1.07 from https://01.org/linuxgraphics/downloads/broxton-dmc-1.07 Intel-Gpu-Tools 1.15 2038b24 from http://anongit.freedesktop.org/git/xorg/app/intel-gpu-tools.git For the BXT case, could you please retest with GuC disabled? Deleting guc or charging previous intramfs without guc: rm /lib/firmware/i915/*guc* Synchronizing system and bios clock: hwclock-w Checking guc existence: sudo cat /sys/kernel/debug/dri/0/i915_guc_load_status | grep "load" | awk -F": " '{print $2}' NONE Executing: ./gem_softpin --r noreloc-S4 Status: SUCCESS Terminal Output: ./gem_softpin --r noreloc-S4 IGT-Version: 1.15-g3ce58b6 (x86_64) (Linux: 4.7.0-rc6-drm-intel-nightly-ww29-commit-ee33eb7+ x86_64) rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 15:54:20 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 15:55:37 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 15:56:53 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 15:58:07 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 15:59:20 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 16:02:20 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 16:04:28 2016 rtcwake: wakeup from "disk" using /dev/rtc0 at Thu Jul 14 16:05:46 2016 Subtest noreloc-S4: SUCCESS (145.368s) Configuration: Kernel:4.7.0-rc6-drm-intel-nightly commit-ee33eb7 GFX Configuration: --> Component : drm url : http://cgit.freedesktop.org/mesa/drm tag : libdrm-2.4.68 commit : fc09c5a author : Kenneth Graunke <kenneth@whitecape.org> age : 6 weeks ago --> Component : mesa url : http://cgit.freedesktop.org/mesa/mesa tag : mesa-11.2.2 commit : 3a9f628 author : Emil Velikov <emil.velikov@collabora.com> age : 4 weeks ago --> Component : xf86-video-intel url : http://cgit.freedesktop.org/xorg/driver/xf86-video-intel tag : 2.99.917 commit : baec802 author : Chris Wilson <chris@chris-wilson.co.uk> age : 1 year 6 months ago --> Component : libva url : http://cgit.freedesktop.org/libva/ tag : libva-1.7.1.pre1 commit : 453876f author : Xiang Haihao <haihao.xiang@intel.com> age : 4 days ago --> Component : vaapi (intel-driver) url : http://cgit.freedesktop.org/vaapi/intel-driver tag : 1.7.1.pre1 commit : 2975480 author : Xiang Haihao <haihao.xiang@intel.com> age : 4 days ago --> Component : cairo url : http://cgit.freedesktop.org/cairo tag : 1.15.2 commit : db8a7f1 author : Bryce Harrington <bryce@osg.samsung.com> age : 6 months ago --> Component : xserver url : http://cgit.freedesktop.org/xorg/xserver tag : xorg-server-1.18.3 commit : 9454cd5 author : Adam Jackson <ajax@redhat.com> age : 9 weeks ago --> Component : intel-gpu-tools url : http://cgit.freedesktop.org/xorg/app/intel-gpu-tools tag : intel-gpu-tools-1.15 commit : 3ce58b6 author : Marius Vlad <marius.c.vlad@intel.com> age : 6 days ago *** This bug has been marked as a duplicate of bug 96526 *** With Latest configuration + kernel patches we are having a failed action just after returning from S4 using "rtcwake" [91291.107493] [drm:guc_ucode_xfer_dma [i915]] returning 0 [91291.107539] [drm:intel_guc_setup [i915]] GuC fw status: fetch SUCCESS, load SUCCESS [91291.127091] [drm] INTEL_GUC_SEND: Action 0x502 failed; ret=-110 status=0x00000502 response=0x40000000 Configuration: (Graphic Stack) Intel® Graphics for Linux* | 01.org ============================================ Software information ============================================ Kernel version : 4.9.0-rc8latestfirmware+ Linux distribution : Ubuntu 16.04.1 LTS Architecture : 64-bit Gfx stack code : 1771486656 Mesa version : Not found << Please see the message at the bottom >> xf86-video-intel version : Xorg-Xserver version : 1.18.3 DRM version : 2.4.67 VAAPI version : Not found Cairo version : 1.14.6 Intel GPU Tools version : Tag [intel-gpu-tools-1.17-41-ga56a1f6] / Commit [a56a1f6] Kernel driver in use : i915 Hardware acceleration : Bios revision : 144.10 Bios release date : 06/27/2016 KSC revision : 1.15 ============================================ Firmwares information ============================================ DMC fw loaded : yes DMC version : 1.7 GUC fw loaded : SUCCESS GUC version wanted : 8.7 GUC version found : 8.7 Kernel parameters ============================================ quiet drm.debug=0xe i915.enable_guc_loading=2 i915.enable_guc_submission=0 resume=/dev/sda3 fastboot Created attachment 128494 [details]
Dmesg
When the symptoms dramatically change, please do file a new bug. That is not the same bug as the original. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.