Bug 105373 - [drm] GPU HANG: ecode 9:0:0xfedffffa, in Xorg [1345], reason: Hang on rcs0, action: rese
Summary: [drm] GPU HANG: ecode 9:0:0xfedffffa, in Xorg [1345], reason: Hang on rcs0, a...
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
Whiteboard: ReadyForDev
Depends on:
Reported: 2018-03-06 22:44 UTC by Vasil Kolev
Modified: 2018-03-13 13:53 UTC (History)
2 users (show)

See Also:
i915 platform: KBL
i915 features: GPU hang

/sys/class/drm/card0/error (16.83 KB, application/x-bzip)
2018-03-06 22:44 UTC, Vasil Kolev
no flags Details
dmesg (57.19 KB, text/plain)
2018-03-06 22:45 UTC, Vasil Kolev
no flags Details
/sys/class/drm/card0/error with 1.04 (4.16 KB, application/x-bzip)
2018-03-07 16:20 UTC, Vasil Kolev
no flags Details
dmesg with 1.04 (56.72 KB, text/plain)
2018-03-07 16:20 UTC, Vasil Kolev
no flags Details
dmesg with drim.debug (97.48 KB, text/plain)
2018-03-07 18:15 UTC, Vasil Kolev
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Vasil Kolev 2018-03-06 22:44:46 UTC
Created attachment 137844 [details]

issue: GPU doesn't even start working

On every boot, as soon as I login and compiz tries to start, the above message shows up in dmesg.
Comment 1 Vasil Kolev 2018-03-06 22:45:39 UTC
Created attachment 137845 [details]
Comment 2 Chris Wilson 2018-03-07 08:45:31 UTC
You need to update the dmc firmware:

commit 4f0aa1fa3e3849caee450ee5d14fcc289cf16703
Author: Anusha Srivatsa <anusha.srivatsa@intel.com>
Date:   Thu Nov 9 10:51:43 2017 -0800

    drm/i915/dmc: DMC 1.04 for Kabylake
    There is a new version of DMC available for KBL.
    The release notes mentions:
    1. Fix for the issue where DC_STATE was getting enabled even
    when disabled by driver causing data corruption.
    v2: Remove pull request from commit message (Rodrigo).
    Cc: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Anusha Srivatsa <anusha.srivatsa@intel.com>
    Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Jani Nikula <jani.nikula@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/1510253503-12634-1-git-send-email-anusha.srivatsa@intel.com
Comment 3 Vasil Kolev 2018-03-07 16:19:22 UTC
Same happens with 1.04. Attaching dmesg, /sys/class/drm/card0/error.

Also, echo 1 > /sys/kernel/debug/dri/0/i915_wedged doesn't seem to have any effect, it's still unable to reset the GPU.
Comment 4 Vasil Kolev 2018-03-07 16:20:09 UTC
Created attachment 137865 [details]
/sys/class/drm/card0/error with 1.04
Comment 5 Vasil Kolev 2018-03-07 16:20:35 UTC
Created attachment 137866 [details]
dmesg with 1.04
Comment 6 Elizabeth 2018-03-07 17:33:49 UTC
Hi, could you attach dmesg with debug info, drm.debug=0xe parameter in grub. Thanks.
Comment 7 Vasil Kolev 2018-03-07 18:15:06 UTC
Created attachment 137869 [details]
dmesg with drim.debug

Here's the dmesg with debug.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.