Bug 96902 - [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render ring idle
Summary: [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
: 95361 96887 (view as bug list)
Depends on:
Blocks:
 
Reported: 2016-07-12 15:55 UTC by mikhail.v.gavrilov
Modified: 2017-07-24 22:41 UTC (History)
1 user (show)

See Also:
i915 platform: HSW
i915 features: GPU hang


Attachments
dmesg (98.17 KB, text/plain)
2016-07-12 15:55 UTC, mikhail.v.gavrilov
no flags Details
GPU crash dump (2.64 MB, text/plain)
2016-07-12 15:56 UTC, mikhail.v.gavrilov
no flags Details
dmesg (110.04 KB, text/plain)
2016-08-05 19:47 UTC, mikhail.v.gavrilov
no flags Details
GPU crash dump (2.64 MB, text/plain)
2016-08-05 19:48 UTC, mikhail.v.gavrilov
no flags Details

Description mikhail.v.gavrilov 2016-07-12 15:55:11 UTC
Created attachment 125031 [details]
dmesg
Comment 1 mikhail.v.gavrilov 2016-07-12 15:56:43 UTC
Created attachment 125032 [details]
GPU crash dump
Comment 2 yann 2016-07-15 08:53:03 UTC
Mikhail, this sounds like you have a previous segfault in your Chrome caused by xfs (not on graphics / i915 side) even if finally there was some gpu hang reported after (but still linked to Chrome).
- Can you confirm whether or not this Chrome crash, you launch again Chrome or Chrome respawn by itself?
- If Chrome is not getting this segfault with xfs, I think that you are not getting that gpu hang?
Comment 3 mikhail.v.gavrilov 2016-08-05 19:47:22 UTC
Segfault in Chrome not related to xfs, this occured in different time.
'*ERROR* Hangcheck timer elapsed' occured when I try open too much tabs in chrome and seems chrome segfault same reason. The mouse cursor freezes for a while, and after this error entry appear in dmesg log.
Comment 4 mikhail.v.gavrilov 2016-08-05 19:47:48 UTC
Created attachment 125563 [details]
dmesg
Comment 5 mikhail.v.gavrilov 2016-08-05 19:48:17 UTC
Created attachment 125564 [details]
GPU crash dump
Comment 6 Chris Wilson 2016-08-10 10:15:39 UTC
commit 83348ba84ee0d5d4d982e5382bfbc8b2a2d05e75
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Tue Aug 9 17:47:51 2016 +0100

    drm/i915: Move missed interrupt detection from hangcheck to breadcrumbs
    
    In commit 2529d57050af ("drm/i915: Drop racy markup of missed-irqs from
    idle-worker") the racy detection of missed interrupts was removed when
    we went idle. This however opened up the issue that the stuck waiters
    were not being reported, causing a test case failure. If we move the
    stuck waiter detection out of hangcheck and into the breadcrumb
    mechanims (i.e. the waiter) itself, we can avoid this issue entirely.
    This leaves hangcheck looking for a stuck GPU (inspecting for request
    advancement and HEAD motion), and breadcrumbs looking for a stuck
    waiter - hopefully make both easier to understand by their segregation.
Comment 7 Chris Wilson 2016-08-10 10:16:36 UTC
*** Bug 96887 has been marked as a duplicate of this bug. ***
Comment 8 Chris Wilson 2016-08-10 10:16:47 UTC
*** Bug 95361 has been marked as a duplicate of this bug. ***


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.