Created attachment 137844 [details]
issue: GPU doesn't even start working
On every boot, as soon as I login and compiz tries to start, the above message shows up in dmesg.
Created attachment 137845 [details]
You need to update the dmc firmware:
Author: Anusha Srivatsa <firstname.lastname@example.org>
Date: Thu Nov 9 10:51:43 2017 -0800
drm/i915/dmc: DMC 1.04 for Kabylake
There is a new version of DMC available for KBL.
The release notes mentions:
1. Fix for the issue where DC_STATE was getting enabled even
when disabled by driver causing data corruption.
v2: Remove pull request from commit message (Rodrigo).
Cc: Rodrigo Vivi <email@example.com>
Signed-off-by: Anusha Srivatsa <firstname.lastname@example.org>
Reviewed-by: Rodrigo Vivi <email@example.com>
Signed-off-by: Jani Nikula <firstname.lastname@example.org>
Same happens with 1.04. Attaching dmesg, /sys/class/drm/card0/error.
Also, echo 1 > /sys/kernel/debug/dri/0/i915_wedged doesn't seem to have any effect, it's still unable to reset the GPU.
Created attachment 137865 [details]
/sys/class/drm/card0/error with 1.04
Created attachment 137866 [details]
dmesg with 1.04
Hi, could you attach dmesg with debug info, drm.debug=0xe parameter in grub. Thanks.
Created attachment 137869 [details]
dmesg with drim.debug
Here's the dmesg with debug.
First of all. Sorry about spam.
This is mass update for our bugs.
Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!
If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Jani, yes, this is still valid - there doesn't seem to have been a new release of anything related to this, and the bug persists (my GPU is hung and I still use some very slow driver/access to my video card).
The stuff I'm currently running is 4.15.0 with the path for the 1.04 firmware. Is there any new work in the drm-tip, and how do I fetch that?
You can get drm-tip from: https://cgit.freedesktop.org/drm-tip.
Created attachment 138427 [details]
/sys/class/drm/card0/error with drm-tip
Created attachment 138428 [details]
dmesg with drm-tip
(In reply to Jani Saarinen from comment #10)
> You can get drm-tip from: https://cgit.freedesktop.org/drm-tip.
Attached are the dmesg (with debug enabled) and the error from the card with drm-tip. The issue persists.
Mika, Chris, any advice here?
Looked at this and discussed with Chris on irc and here are the findings:
HW RING START is not from request that was queued to hardware. And the
gpu is dormant on a previous requests tail.
Please retest with fetching a up-to-date drm-tip from https://cgit.freedesktop.org/drm-tip and prevent driver from loading a dmc firmware by moving dmc firmware binaries out from /lib/firmware/i915.
Created attachment 139062 [details]
dmesg with drm-tip 4.17.0-rc2 (d04fd4f6d93cea918521059db8358ff9e7a4a03b)
Created attachment 139063 [details]
/sys/class/drm/card0/error with drm-tip 4.17.0-rc2 (d04fd4f6d93cea918521059db8358ff9e7a4a03b)
Retested with the latest drm-tip, the issue looks the same.
Is there anything else besides the dmesg and /sys/class/drm/card0/error I can help with? I can see to provide access to the laptop in question.