Created attachment 145593 [details]
Occasionally, usually while watching videos in Firefox, my GPU will hang and the screen will freeze -- sound and keyboard input still work in the background, and I need to use REISUB hotkeys to reboot. This is separate, in addition to the sdma and ring gfx_0.0.0 hangs.
Upon rebooting, journalctl shows the attached "divide error". I've included logs from 3 instances of it happening. I'm currently using the Jul 14 firmware from Fedora's linux-firmware package as the hang appears to occur more often on the newer firmware from https://people.freedesktop.org/~agd5f/radeon_ucode/navi10/ however this may just be placebo.
It occurs with or without the "0-sized IBs" kernel patch from https://bugs.freedesktop.org/show_bug.cgi?id=111481#c33 and on both PCIe 3.0 and 4.0. I'm not using a PCIe riser and the card works without issue on Windows 10 dual boot.
GPU: Sapphire 5700XT (reference)
Motherboard: Gigabyte X570-I (BIOS F4)
Mesa: mesa-git 1:19.3.0_devel.115682.3c966fd688c-1
LLVM: llvm-git 10.0.0_r327425.63f6066b53d-1
Please let me know if any more information would be helpful, or if there's anything I can do to troubleshoot. Thanks.
Created attachment 145594 [details]
Created attachment 145595 [details]
I also get this error frequently with amd-staging-drm-next, but not with 5.4-rcX (at least I can't remember getting one with the latter).
Not sure if that suggests there is a regression, or something to do with the 5.3 kernel specifically (I don't remember having the error when amd-staging-drm-next was using 5.2 kernel).
I just want to add that I do still get this bug with 5.4-rcX, unfortunately. It's the only remaining non-Mesa hang that I haven't been able to workaround.
-- GitLab Migration Automatic Message --
This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.
You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/926.