Bug 112134

Summary: Failed to load firmware on Raven Ridge
Product: DRI Reporter: Samuele Decarli <samuele.decarli>
Component: DRM/AMDgpuAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED INVALID QA Contact:
Severity: critical    
Priority: not set CC: samuele.decarli
Version: DRI git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:

Description Samuele Decarli 2019-10-25 11:03:22 UTC
Using an HP EliteBook 745 G5, with a AMD Ryzen 7 PRO 2700U. Running Arch Linux, kernel version 5.3.7, linux firmware version 20190923.417a9c6-1, mesa version 19.2.2

The laptop boots, but fails to initialise the GPU, making it unusable. Booting with an older kernel version (4.19.80) works, and reveals the following messages in the log:

Oct 25 12:43:06 archer kernel: amdgpu 0000:04:00.0: Direct firmware load for amdgpu/raven_kicker_rlc.bin failed with >
Oct 25 12:43:06 archer kernel: amdgpu 0000:04:00.0: gfx9: Failed to load firmware "amdgpu/raven_kicker_rlc.bin"
Oct 25 12:43:06 archer kernel: [drm:gfx_v9_0_sw_init.cold [amdgpu]] *ERROR* Failed to load gfx firmware!
Oct 25 12:43:06 archer kernel: [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* sw_init of IP block <gfx_v9_0> failed -2
Oct 25 12:43:06 archer kernel: amdgpu 0000:04:00.0: amdgpu_device_ip_init failed
Oct 25 12:43:06 archer kernel: amdgpu 0000:04:00.0: Fatal error during GPU init
Oct 25 12:43:06 archer kernel: [drm] amdgpu: finishing device.
Oct 25 12:43:06 archer kernel: [TTM] Finalizing pool allocator
Oct 25 12:43:06 archer kernel: [TTM] Finalizing DMA pool allocator
Oct 25 12:43:06 archer kernel: [TTM] Zone  kernel: Used memory at exit: 0 KiB
Oct 25 12:43:06 archer kernel: [TTM] Zone   dma32: Used memory at exit: 0 KiB
Oct 25 12:43:06 archer kernel: [drm] amdgpu: ttm finalized
Oct 25 12:43:06 archer kernel: ------------[ cut here ]------------
Oct 25 12:43:06 archer kernel: sysfs group 'fw_version' not found for kobject '0000:04:00.0'
Oct 25 12:43:06 archer kernel: WARNING: CPU: 5 PID: 355 at fs/sysfs/group.c:278 sysfs_remove_group+0x74/0x80
Oct 25 12:43:06 archer kernel: Modules linked in: amd64_edac_mod(-) amdgpu(+) edac_mce_amd kvm_amd msr ccp kvm joydev>
Oct 25 12:43:06 archer kernel:  libps2 ahci libahci libata crc32c_intel xhci_pci scsi_mod xhci_hcd ehci_pci ehci_hcd >
Oct 25 12:43:06 archer kernel: CPU: 5 PID: 355 Comm: systemd-udevd Not tainted 5.3.7-arch1-1-ARCH #1
Oct 25 12:43:06 archer kernel: Hardware name: HP HP EliteBook 745 G5/83D5, BIOS Q81 Ver. 01.08.00 07/16/2019
Oct 25 12:43:06 archer kernel: RIP: 0010:sysfs_remove_group+0x74/0x80
Oct 25 12:43:06 archer kernel: Code: ff 5b 48 89 ef 5d 41 5c e9 19 bb ff ff 48 89 ef e8 f1 b7 ff ff eb cc 49 8b 14 24>
Oct 25 12:43:06 archer kernel: RSP: 0018:ffffb4c6006039e0 EFLAGS: 00010282
Oct 25 12:43:06 archer kernel: RAX: 0000000000000000 RBX: ffffffffc1173ba0 RCX: 0000000000000000
Oct 25 12:43:06 archer kernel: RDX: 0000000000000001 RSI: 0000000000000092 RDI: 00000000ffffffff
Oct 25 12:43:06 archer kernel: RBP: 0000000000000000 R08: 0000000000000408 R09: 0000000000000004
Oct 25 12:43:06 archer kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff9f02dd0250b0
Oct 25 12:43:06 archer kernel: R13: ffff9f02c7350018 R14: ffff9f02d18d65e0 R15: 0000000000000000
Oct 25 12:43:06 archer kernel: FS:  00007ffbcde2c840(0000) GS:ffff9f02dfd40000(0000) knlGS:0000000000000000
Oct 25 12:43:06 archer kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 25 12:43:06 archer kernel: CR2: 00007f0e64a20cb0 CR3: 000000041a66a000 CR4: 00000000003406e0
Oct 25 12:43:06 archer kernel: Call Trace:
Oct 25 12:43:06 archer kernel:  amdgpu_device_fini+0x462/0x496 [amdgpu]
Oct 25 12:43:06 archer kernel:  amdgpu_driver_unload_kms+0x4a/0x90 [amdgpu]
Oct 25 12:43:06 archer kernel:  amdgpu_driver_load_kms.cold+0xaa/0xaf [amdgpu]
Oct 25 12:43:06 archer kernel:  drm_dev_register+0x111/0x150 [drm]
Oct 25 12:43:06 archer kernel:  amdgpu_pci_probe+0xbd/0x120 [amdgpu]
Oct 25 12:43:06 archer kernel:  ? __pm_runtime_resume+0x49/0x60
Oct 25 12:43:06 archer kernel:  local_pci_probe+0x42/0x80
Oct 25 12:43:06 archer kernel:  ? pci_match_device+0xc5/0x100
Oct 25 12:43:06 archer kernel:  pci_device_probe+0x104/0x1a0
Oct 25 12:43:06 archer kernel:  really_probe+0xf0/0x380
Oct 25 12:43:06 archer kernel:  driver_probe_device+0xb6/0x100
Oct 25 12:43:06 archer kernel:  device_driver_attach+0x53/0x60
Oct 25 12:43:06 archer kernel:  __driver_attach+0x8a/0x150
Oct 25 12:43:06 archer kernel:  ? device_driver_attach+0x60/0x60
Oct 25 12:43:06 archer kernel:  ? device_driver_attach+0x60/0x60
Oct 25 12:43:06 archer kernel:  bus_for_each_dev+0x89/0xd0
Oct 25 12:43:06 archer kernel:  bus_add_driver+0x14a/0x1e0
Oct 25 12:43:06 archer kernel:  driver_register+0x6c/0xb0
Oct 25 12:43:06 archer kernel:  ? 0xffffffffc0c12000
Oct 25 12:43:06 archer kernel:  do_one_initcall+0x59/0x234
Oct 25 12:43:06 archer kernel:  do_init_module+0x5c/0x230
Oct 25 12:43:06 archer kernel:  load_module+0x218f/0x2400
Oct 25 12:43:06 archer kernel:  ? __se_sys_init_module+0x114/0x180
Oct 25 12:43:06 archer kernel:  __se_sys_init_module+0x114/0x180
Oct 25 12:43:06 archer kernel:  do_syscall_64+0x5f/0x1c0
Oct 25 12:43:06 archer kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Oct 25 12:43:06 archer kernel: RIP: 0033:0x7ffbcf64ad2e
Oct 25 12:43:06 archer kernel: Code: 48 8b 0d 55 01 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00>
Oct 25 12:43:06 archer kernel: RSP: 002b:00007ffe696d7fe8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Oct 25 12:43:06 archer kernel: RAX: ffffffffffffffda RBX: 000055ed1ee6f680 RCX: 00007ffbcf64ad2e
Oct 25 12:43:06 archer kernel: RDX: 00007ffbcf29a84d RSI: 000000000084ea31 RDI: 000055ed1fe9f230
Oct 25 12:43:06 archer kernel: RBP: 00007ffbcf29a84d R08: 000055ed1ecb001a R09: 0000000000000002
Oct 25 12:43:06 archer kernel: R10: 000055ed1ecb0010 R11: 0000000000000246 R12: 000055ed1fe9f230
Oct 25 12:43:06 archer kernel: R13: 000055ed1ee89a90 R14: 0000000000020000 R15: 000055ed1ee6f680
Oct 25 12:43:06 archer kernel: ---[ end trace 8e90a248b2fec769 ]---
Oct 25 12:43:06 archer kernel: amdgpu: probe of 0000:04:00.0 failed with error -2
Oct 25 12:43:06 archer systemd-udevd[359]: renderD128: Failed to process device, ignoring: No such file or directory
Oct 25 12:43:06 archer systemd-udevd[373]: card0: Failed to process device, ignoring: No such file or directory
Comment 1 Alex Deucher 2019-10-25 19:43:15 UTC
You can grab the missing firmware from the linux firmware tree:
https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/amdgpu
Comment 2 Samuele Decarli 2019-10-25 20:49:59 UTC
Arch just updated their linux-firmware package: it now includes the missing firmware, fixing the issue.

Thank you for your help though.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.