Bug 102888 - [BAT][ILK] igt@gem_ringfill@basic-default-hang - dmesg-warn *ERROR* Failed to reset chip: -110
Summary: [BAT][ILK] igt@gem_ringfill@basic-default-hang - dmesg-warn *ERROR* Failed to...
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: high major
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2017-09-20 06:58 UTC by Marta Löfstedt
Modified: 2018-01-19 10:06 UTC (History)
1 user (show)

See Also:
i915 platform: ILK
i915 features: GEM/Other


Attachments

Description Marta Löfstedt 2017-09-20 06:58:43 UTC
CI_DRM_3111 igt@gem_ringfill@basic-default-hang failed on fi-ilk-m540.

(gem_ringfill:3479) ioctl-wrappers-CRITICAL: Test assertion failure function gem_execbuf, file ioctl_wrappers.c:622:
(gem_ringfill:3479) ioctl-wrappers-CRITICAL: Failed assertion: __gem_execbuf(fd, execbuf) == 0
(gem_ringfill:3479) ioctl-wrappers-CRITICAL: error: -5 != 0

Dmesg:
[  250.616185] i915 0000:00:02.0: Resetting chip after gpu hang
[  252.125903] [drm:i915_reset [i915]] *ERROR* Failed to reset chip: -110
[  262.508070] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  272.742816] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  282.977620] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:27:primary A] flip_done timed out
[  293.212572] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  303.447474] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  313.682336] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:27:primary A] flip_done timed out
[  323.917227] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:36:pipe A] flip_done timed out

Full data:
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3111/fi-ilk-m540/igt@gem_ringfill@basic-default-hang.html
Comment 1 Marta Löfstedt 2017-09-20 07:05:22 UTC
Note, this issue is possibly causing BUG 102890
Comment 2 Chris Wilson 2017-10-06 16:33:08 UTC
Hmm, I think this may have been implicitly fixed by

Author: Mika Kuoppala <mika.kuoppala@linux.intel.com>
Date:   Tue Sep 19 17:41:28 2017 +0300

    drm/i915: Stop engines before reset
    
    On kbl evidence indicates that even if the hardware happily
    tells us to proceed with reset, it really isn't ready.
    Resetting a freely running batchbuffer after we have ack for readiness,
    still can cause a system hang.
    
    We also have similar experiences on older gens. So now
    attempt to stop engines before proceeding for reset, on all
    gens where we have a gpu reset. This has shown to improve reset
    reliability and reduce the risk of losing the machine.
    
    v2: Add fixme for wa (Joonas)
    
    Testcase: igt/prime_busy/hang-* # kbl
    Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
    Cc: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Ville Syrjälä <ville.syrjala@linux.intel.com>
    Signed-off-by: Mika Kuoppala <mika.kuoppala@intel.com>
    Acked-by: Chris Wilson <chris@chris-wilson.co.uk>
    Acked-by: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20170919144128.25506-1-mika.kuoppala@intel.com

Time will tell.
Comment 3 Marta Löfstedt 2017-10-12 05:45:13 UTC
This pattern is repeating itself. 

[  246.872766] i915 0000:00:02.0: Resetting chip after gpu hang
[  249.892728] [drm:i915_reset [i915]] *ERROR* Failed to reset chip: -110
[  260.182782] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  270.427915] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  280.673000] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:27:primary A] flip_done timed out
[  290.918123] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  301.163196] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:36:pipe A] flip_done timed out
[  311.408387] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:27:primary A] flip_done timed out
[  321.653507] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:36:pipe A] flip_done timed out

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3218/fi-ilk-m540/igt@gem_ringfill@basic-default-hang.html

and then incomplete on https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3218/fi-ilk-m540/igt@gem_tiled_fence_blits@basic.html

the incomplete is in BUG 102890.
Comment 4 Marta Löfstedt 2018-01-19 10:06:04 UTC
last seen:  IGT_4069: 2017-12-16 / 186 runs ago


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.