Created attachment 141196 [details]
"wait timed out!" in dmesg
We have an ASUS X570ZD laptop equipped with AMD Ryzen 7 2700U with Radeon Vega Mobile Gfx and NVIDIA GeForce GTX 1050 Mobile.
We tried with Linux kernel 4.18.3.
After system resumes from suspend, there is a prompted picture for a short time, then system goes no display for a long time.
After the long time, system turns off the display's backlight. User must presses any key to have the login screen.
The resume process is not smooth and hits the error:
[drm:hwss_edp_wait_for_hpd_ready [amdgpu]] *ERROR* hwss_edp_wait_for_hpd_ready: wait timed out!
We have the dmesg with amdgpu.dc_log=1 and drm.debug=6
Created attachment 141197 [details]
The resume process video
It is nouveau with parameters runpm=0 and noaccel=1 as the driver for NVIDIA GeForce GTX 1050 Mobile card.
Created attachment 141207 [details]
dmesg of boot into multi-user.target
If I boot system into "multi-user.target", then "systemctl suspend" and press key to resume. This issue is not reproduced.
Have you tried with nouveau disabled?
This issue was detected on 4.18.3 when using a somewhat reduced developer config. When using our usual full distro kernel config (from Ubuntu), the issue convincingly goes away.
Both Jian-Hong and I looked at the differences in the config and couldn't spot anything obvious that would explain different behaviour. Trying to bisect good and bad configs, for a while it looked like it might be CONFIG_NUMA_BALANCING=y and CONFIG_NUMA_BALANCING_DEFAULT_ENABLED=y but we then disproved that in further experiments. We didn't find which config option is responsible for the different behaviour.
There's probably a bug here somewhere, but as it's working on our shipped config I'll close this issue, as we need to focus our time on other issues on this platform.