Created attachment 136422 [details]
Starting from kernel 4.7 (I tried 4.7.4, 4.9.40, 4.11.12, 4.13.0-rc4, 4.14.0-rc6), the whole system freeze, at least 1 time per day (usually it freeze during the night when the system is idle). Only 1 time I successfully got the "GPU HANG" on dmesg.
The last known good kernel is 4.6.7.
The hang does not occur if I disable i915 kernel module.
Xorg version is 1.18.3.
Created attachment 136423 [details]
Hello Luca, could you bisect the issue?
Created attachment 136567 [details] [review]
Test case - disabling hpd polling
the hang/bug appears from 4.7.3, when hpd polling was introduced.
I disabled the hpd polling in 4.7.3 and 4.14.11: since now no more hangs.
On my platform (HP 250 G3 Notebook) it seems no problem are raised by disabling it, even pm-suspend works fine (but I did not try with external monitor).
Is there any other test I could perform to better address the problem?
First of all. Sorry about spam.
This is mass update for our bugs.
Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!
If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Closing, please re-open is issue still exists.
Bug still present in 5.0.4.
Latest attached patch for 4.14.11 (disabling HPD polling) works without any update.
(In reply to Luca Bonissi from comment #6)
> Bug still present in 5.0.4.
> Latest attached patch for 4.14.11 (disabling HPD polling) works without any
Can you please attach the latest dmesg (kernel 5.0.4) from boot with kernel parameters drm.debug=0x1e log_buf_len=4M?
Also, Can you attach the GPU crash dump file?
Created attachment 143885 [details]
dmesg with drm debug
(In reply to Lakshmi from comment #7)
> Can you please attach the latest dmesg (kernel 5.0.4) from boot with kernel
> parameters drm.debug=0x1e log_buf_len=4M?
Done, one just after boot, and the other one after some time (in the while DPMS was active)
> Also, Can you attach the GPU crash dump file?
Sorry, but I got GPU crash dump only one time: the other times the system was totally frozen...
(In reply to Luca Bonissi from comment #8)
> Created attachment 143885 [details]
> dmesg with drm debug
From the attached logs, I don't see GPU hang messages. Attached crash dump is from kernel 4.13.
So, the system hangs/frozen with latest kernel?
(In reply to Lakshmi from comment #10)
> From the attached logs, I don't see GPU hang messages. Attached crash dump
> is from kernel 4.13.
> So, the system hangs/frozen with latest kernel?
From kernel 4.7.3 the system hangs/frozen nearly one a day (it usually hangs during the night, when DPMS switched off the monitor), so it hangs also with the latest kernel 5.0.4.
Unfortunately, only 1 time (with kernel 4.13) the system did not freeze and I could get the GPU crash dump.
Also another system, with Intel N2840 CPU and the same integrated GPU, suffers the same problem (I lost some audio recording data due to the bug....). If I disabled DPMS, no problems occurred.
Anyway, the problem is identified somewhere in the "HPD polling" routines, introduced in kernel 4.7.3: if I remove these routines, the system works fine without any hangs (neither GPU nor totally).
It seemed something with mutex (try to lock when already locked...), but I am not a mutex/GPU expert....
Can you verify with Kernel 5.1 and attach logs if the hang occurs again?
Can you please try to reproduce the issue with drm-tip (https://cgit.freedesktop.org/drm-tip)
If persists on drmtip, please upload the dmesg and crash dump file.
No feedback from more than a month, closing as resolved works for me.
Please re-open this issue if persists with latest drm-tip https://cgit.freedesktop.org/drm-tip and send dmesg from boot with kernel parameters drm.debug=0x1e log_buf_len=4M?
Also attach the GPU crash dump file.