Created attachment 136460 [details]
Appropriate part of dmesg
I get this 1-2/day after a brand new archlinux install. When the hang happens, the Xorg display is frozen until I'm kicked out of my session after ~30 seconds.
I don't know yet how to reproduce it. The hang usually happens after several hours of usage with several X windows open.
Some info about my system:
:: Kernel: 4.14.8-1-ARCH
:: Distro: archlinux
:: Model: LENOVO ThinkPad T570 20H90052MC
:: BIOS: N1VET37W (1.27)
The crash report and relevant dmesg output is attached.
Created attachment 136461 [details]
Contents of /sys/class/drm/card0/error
Hello Petr, could you please share your Mesa version? Thanks.
Hi Elizabeth, my Mesa version is 17.3.1.
Created attachment 136581 [details]
Contents of /sys/class/drm/card0/error for the 2nd hang
Created attachment 136582 [details]
Appropriate part of dmesg for the 2nd hang
Just had another hang. This time, I have executed a MATLAB command, which produced high volume of ASCII letters in my shell, just before the screen froze. The contents of dmesg and crash dump have been attached.
Created attachment 136650 [details]
Contents of /sys/class/drm/card0/error for the 3rd hang
Created attachment 136651 [details]
Appropriate part of dmesg for the 3rd hang
I have added info about yet another hang. Happened when approximately 64K lines were printed in xterm by a faulty Python script.
Created attachment 136816 [details]
Contents of /sys/class/drm/card0/error for the 4th hang
Created attachment 136817 [details]
Appropriate part of dmesg for the 4th hang
Another hang happened. This time with no particular reason whatsoever. The crash dump and dmesg is attached. P.
Created attachment 136822 [details]
Contents of /sys/class/drm/card0/error for the 5th hang
Created attachment 136823 [details]
Appropriate part of dmesg for the 5th hang
One more hang. This time during memory-expensive calculations.
I have the very same problem.
Instead of filing an own bug report I append my stuff here as I also have the identifier "GPU HANG: ecode 9:0:0x85dffffb" in my dmesg.
The hag occurs randomly 2-5 times a day. For me usually when I'm using libreoffice in KDE/Plasma on my up to date archlinux.
Description otherwise the same as in Petr's case.
Machine: Lenovo Thinkpad X1 Carbon 2016 generation.
I did *not* install the package xf86-video-intel. Should I?
I do not use any specific xorg configuration files.
I already reported this bug 1 year ago here: https://bugs.freedesktop.org/show_bug.cgi?id=99325
This bug showed the same behavior but identified differently (maybe the logging has changed and is now more explicit).
I also attach the output of dmesg and te /sys/class/drm/card0/error file.
Created attachment 136861 [details]
Appropriate part of dmesg for Rainbert's 1st hang
Created attachment 136862 [details]
Contents of /sys/class/drm/card0/error for Rainbert's 1st hang
Thanks for keeping track. Could you investigate if the random hangs can be reduced to some specific case, like the MATLAB command that Petr mentioned or any specific process related? It will be really useful to identify a reliable way to reproduce this. Also, any of you have tried with another desktop besides KDE? It will be helpful to know if this is actually desktop dependent. And as you mention about bug 99325, could you attach xorg log from a hanged session as requested by Mark. Also, dmesg messages of hang are no more needed since we have the error states attached.
Thanks for the update.
In my case it seems that the hangs never happen immediately after booting. Instead, most of them happen few hours after. That suggests to me that the problem might somehow be related to memory usage (acessing invalid buffers? not enough memory to continue?). In two of the four documented cases (so far), the hang occurred as excessive amount of output characters was printed into xterm (infinite loops, faulty scripts). That could either be linked to the memory problem or to another similar root cause.
Also, please note that I use i3 WM instead of KDE in my setup. Sometimes I use two displays (via HDMI) in extended mode (not mirror) However, in all instances so far, the hang occurred in a single display mode.
Hello again everyone, new mesa 17.3.6 release includes important fixes for gpu hangs reported on games and DEs, could any of you try it and report back? If the issue still is happening with that version, a way to reliably reproduce this still will be the best approach.
(In reply to Elizabeth from comment #21)
> Hello again everyone, new mesa 17.3.6 release includes important fixes for
> gpu hangs reported on games and DEs, could any of you try it and report
> back? If the issue still is happening with that version, a way to reliably
> reproduce this still will be the best approach.
It has been almost a month since your last message. I'm currently running mesa 18.0.0 and no GPU hang has occured since the update to mesa 17.3.6.
I think it's safe to assume that the bug has been fixed in my case.
Thank you and cheers!