Created attachment 142409 [details]
Crash dump from the reported hang
My X session froze several times and eventually recovered when resuming from hibernate. Here is the message displayed (crash dump attached):
Nov 08 08:52:25 eric-macbookpro kernel: [drm] GPU HANG: ecode 8:0:0x37974124, in spotify , reason: hang on rcs0, action: reset
Nov 08 08:52:25 eric-macbookpro kernel: [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
Nov 08 08:52:25 eric-macbookpro kernel: [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
Nov 08 08:52:25 eric-macbookpro kernel: [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
Nov 08 08:52:25 eric-macbookpro kernel: [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
Nov 08 08:52:25 eric-macbookpro kernel: [drm] GPU crash dump saved to /sys/class/drm/card0/error
Nov 08 08:52:25 eric-macbookpro kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Nov 08 08:52:35 eric-macbookpro kernel: i915 0000:00:02.0: Resetting rcs0 for hang on rcs0
Linux Distribution: Arch Linux
System Architecture: x86_64
Kernel Version: 4.18.16-arch1-1-ARCH
Display Connector: 2x eDP
ELSP: pid 10995, ban score 0, seqno 18:0034da3d, prio 0, emitted 1431653982ms, start 010d2000, head 00000000, tail 00000070
ELSP: pid 2718, ban score 0, seqno 1:0034da3e, prio 1024, emitted 1431653982ms, start 02eea000, head 00000bf8, tail 00000c60
HEAD: 0x00400bf0 [0x00000000]
head = 0x00000bf0, wraps = 2
TAIL: 0x00000bf0 [0x00000048, 0x00000070]
=> GPU switched contexts before completing the first, but failed to start the second context -- it didn't even see the TAIL update.
Eric, how often you can reproduce this issue? Any particular pattern causing this?
Have you tried to verify this issue with latest drm-tip?(https://cgit.freedesktop.org/drm-tip)
I can reproduce this issue fairly often, maybe once in every four attempts or so when I'm using Chromium as my web browser. With Firefox it never seems to happen. I'm not sure if that is because Chromium is using hardware acceleration or other features that Firefox does not, but it definitely happens with more frequency with Chromium.
The issue occurs when I hibernate and resume from hibernate. It's a longstanding issue that keeps coming up for me. See bug 102658 for example, which was closed without a fix.
Currently we are a bit limited in the ability to reproduce this issue. It would help if you could run the latest drm-tip on your system and report the logs.
Sure, I will try to reproduce the problem with drm-tip.
Since I was hitting this problem so often, I tried a few workarounds. Disabling hardware acceleration in Chromium has made the problem go away, although obviously I would prefer using hardware acceleration.
I'll build and run drm-tip, turn on Chromium hardware acceleration again and report back. Thanks.
Any news with recent kernels, Chromium rendering and hibernation?
I have not been able to attempt to reproduce the problem with drm-tip yet. The ZFS kernel module I require could not build on Linux 5.0+ until the March 4th release of version 0.7.13 so my build was unsuccessful.
In addition, I find the power consumption on my laptop is lower if I use Chromium in software rendering mode vs. using hardware acceleration so I've been tempted to leave hardware acceleration disabled.
I'll try the drm-tip build again this week now that the ZFS build issues are fixed and see how it goes.
I retried with drm-tip at d33bf3f6a140 now that I was able to get ZFS to build. With Chromium hardware acceleration enabled, I was able to do a bunch of hibernate / resume sequences without hitting any hangs. Closing this one out. Thanks.
Thanks for testing!