Bug 111980 - Rebbot and shutdown doesn't work on specific hardware
Summary: Rebbot and shutdown doesn't work on specific hardware
Status: RESOLVED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Radeon (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: not set not set
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-10-11 16:05 UTC by lei.pero
Modified: 2019-10-18 16:42 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments

Description lei.pero 2019-10-11 16:05:12 UTC
When system shutdown/reboot is initiated, everything works as it should, journal doesn't show any error and it reaches target reboot (for example), display shuts down properly, but PC stays powered on (requiring hard reset) indefinitely.

This happened after kernel upgrade, after bisectiing, commit that was to blame was bellow, after removing just that specific commit, it would work as expected:

894c414129a8d9ef1b2de443015e4dde6085f64f is the first bad commit
commit 894c414129a8d9ef1b2de443015e4dde6085f64f
Author: KyleMahlkuch <kmahlkuc@linux.vnet.ibm.com>
Date:   Wed Jul 31 17:10:14 2019 -0500

    drm/radeon: Fix EEH during kexec
    
    [ Upstream commit 6f7fe9a93e6c09bf988c5059403f5f88e17e21e6 ]
    
    During kexec some adapters hit an EEH since they are not properly
    shut down in the radeon_pci_shutdown() function. Adding
    radeon_suspend_kms() fixes this issue.
    
    Signed-off-by: KyleMahlkuch <kmahlkuc@linux.vnet.ibm.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

 drivers/gpu/drm/radeon/radeon_drv.c | 8 ++++++++
 1 file changed, 8 insertions(+)

Issue is described here, seems like others have similar issue that might be connected with this as well:
https://bbs.archlinux.org/viewtopic.php?id=249787
Comment 1 Tim Cuthbertson 2019-10-11 18:05:28 UTC
My system does this, too, since kernel 5.3.5-arch1-1-ARCH on Arch Linux.
Comment 2 lei.pero 2019-10-11 18:54:57 UTC
(In reply to Tim Cuthbertson from comment #1)
> My system does this, too, since kernel 5.3.5-arch1-1-ARCH on Arch Linux.

Can you test it without this commit? Same for me 5.3.5-arch1-1-ARCH introduced the bug.

I've used
git revert 894c414129a8d9ef1b2de443015e4dde6085f64f
to exclude that commit in stable branch (after bisecting) for 5.4-rc2 and it worked fine without it.
Comment 3 Tim Cuthbertson 2019-10-15 14:11:47 UTC
This morning, my kernel was upgraded to version 5.3.6-arch1-1-ARCH dated Oct 11. The problem persists.
Comment 4 Tim Cuthbertson 2019-10-18 13:20:15 UTC
This morning, my kernel was upgraded to version 5.3.7-arch1-1-ARCH dated Oct 18. The problem persists.
Comment 6 lei.pero 2019-10-18 16:21:23 UTC
(In reply to Alex Deucher from comment #5)
> Reverted:
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/
> ?id=8d13c187c42e110625d60094668a8f778c092879

Great, I can close this bug report now?
Comment 7 Alex Deucher 2019-10-18 16:42:47 UTC
Yes.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.