Bug 107913 - amdgpu_drv.so segfaults on boot; black screen
Summary: amdgpu_drv.so segfaults on boot; black screen
Status: RESOLVED FIXED
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/AMDgpu (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: xf86-video-ati maintainers
QA Contact: Xorg Project Team
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-09-12 22:42 UTC by freedesktop
Modified: 2018-09-14 07:33 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
Xorg.0.log including the segfoult (25.45 KB, text/plain)
2018-09-12 22:42 UTC, freedesktop
no flags Details

Description freedesktop 2018-09-12 22:42:01 UTC
Created attachment 141539 [details]
Xorg.0.log including the segfoult

History (maybe only partly related to the problem):
I followed this guide to update my drivers: https://github.com/ValveSoftware/Proton/blob/proton_3.7/PREREQS.md

sudo add-apt-repository ppa:paulo-miguel-dias/mesa
sudo apt dist-upgrade
sudo apt install mesa-vulkan-drivers mesa-vulkan-drivers:i386

which worked fine for some days. I updated my system as normal (via apt) and after a reboot the machine came up with a black screen.

I found a segfault in my Xorg.0.log (attached). Blacklisting the module "amdgpu" lets me boot into the desktop using only the integrated Intel GPU.

Some information that are hopefully helpful:

$ lspci -k
00:02.0 VGA compatible controller: Intel Corporation HD Graphics 620 (rev 02)
	Subsystem: Dell HD Graphics 620
	Kernel driver in use: i915
	Kernel modules: i915
01:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Topaz XT [Radeon R7 M260/M265 / M340/M360 / M440/M445] (rev c3)
	Subsystem: Dell Radeon R7 M445
	Kernel modules: amdgpu

$ uname -a
Linux notebook-kai 4.15.0-34-generic #37-Ubuntu SMP Mon Aug 27 15:21:48 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

My PC is a notebook with an integrated Intel GPU and a dedicated AMD GPU.
OS is Linux Mint 19 Cinnamon
CPU is Intel© Core™ i7-7500U CPU @ 2.70GHz × 2

What else is needed?
Can I somehow re-install or downgrade the amdgpu-drivers?
Comment 1 Michel Dänzer 2018-09-13 08:58:27 UTC
Please also attach the corresponding dmesg output.

Please install the xserver-xorg-video-amdgpu-dbg package, attach a new Xorg log file from running with that installed, and either get a backtrace with gdb (per https://www.x.org/wiki/Development/Documentation/ServerDebugging/), or provide the output of the following commands:

 addr2line -e /usr/lib/xorg/modules/drivers/amdgpu_drv.so 0x7f9e326d3000+0x17305
 addr2line -e /usr/lib/xorg/modules/drivers/amdgpu_drv.so 0x7f9e326d3000+0xedfd

For the time being, assuming there are no display connectors on the AMD GPU (that you need to use), you can avoid the problem by putting this in /etc/X11/xorg.conf:

Section "ServerFlags"
       Option  "AutoAddGPU" "off"
EndSection
Comment 2 Michel Dänzer 2018-09-13 09:52:17 UTC
I was able to reproduce the crash (it only happens when the kernel driver doesn't expose any display outputs), https://patchwork.freedesktop.org/patch/249254/ fixes it.
Comment 3 Michel Dänzer 2018-09-13 14:35:29 UTC
Thanks for the report, fixed in Git master:

commit 6572be49b713a26eca14f16e1854cabf28101288
Author: Michel Dänzer <michel.daenzer@amd.com>
Date:   Thu Sep 13 11:44:21 2018 +0200

    Bail from drmmode_cm_init if there's no CRTC
Comment 4 freedesktop 2018-09-13 19:35:00 UTC
Thanks a lot for the fix, that was pretty quick!

Is any of the requested information still relevant? Maybe for documentation purpose?
Comment 5 Michel Dänzer 2018-09-14 07:33:17 UTC
(In reply to freedesktop from comment #4)
> Is any of the requested information still relevant? Maybe for documentation
> purpose?

Thanks for asking, but it's not needed anymore.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.