Summary: | gfx_v8_0_priv_reg_irq [amdgpu]] *ERROR* Illegal register access in command stream | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | dwagner <jb5sgc1n.nya> | ||||||
Component: | DRM/AMDgpu | Assignee: | Default DRI bug account <dri-devel> | ||||||
Status: | RESOLVED MOVED | QA Contact: | |||||||
Severity: | major | ||||||||
Priority: | medium | ||||||||
Version: | DRI git | ||||||||
Hardware: | x86-64 (AMD64) | ||||||||
OS: | Linux (All) | ||||||||
Whiteboard: | |||||||||
i915 platform: | i915 features: | ||||||||
Attachments: |
|
Description
dwagner
2018-08-09 21:13:16 UTC
Created attachment 141028 [details]
dmesg, ending at crash
Created attachment 141029 [details]
X11 log
Is this reproducible or was it a one time event? So far it has been a one-time event. It was probably unrelated to the "echo manual >power_dpm_force_performance_level" setting I mentioned above: I still need to use that setting in order to let the kernel not crash every few minutes (this is subject to https://bugs.freedesktop.org/show_bug.cgi?id=102322 ). I can reproduce this in a very very specific way (discovered while reproducing bug 102322). With the amdgpu driver, and RADV vulkan implementation, with DXVK 1.2.1, running "House Flipper" from Steam (wine-staging 4.8), on 2560x1440 144Hz display (DisplayPort). It crashes with the AMDVLK implementation as well, but with a different message. Usually happens withing 2 minutes of firing up the game. It's notable that this *does not* occur if I render the game in 1080p and blow it up for the screen. * 5.1.3-arch2-1-ARCH * LLVM 8.0.0 * vulkan-radeon/mesa 19.0.4 The register that it is not liking the access to flips between TC1 and TC2 seemingly nondeterministically. I'm sorry for the poor information, but I'm not used to developing/debugging software at the kernel level. Let me know what information I can provide to be helpful, and I'd be happy to fish it out for you. Thanks in advance for your work and the help. I also tried to reproduce with amdgpu.vm_update_mode=3, but I can't get Xorg to launch with that setting (KERNEL (not gpu) fails on a page request with that setting on, but that might be due to a lower amt of RAM, and the fact that I'm running an RX 590 w/ 8GB of GDDR5, so it might just be trying to allocate too much memory?). The failures do NOT occur if I disable dynamic power management with amdgpu.dpm=0, but obviously, performance sucks with those low clock speeds. Game gets about 14fps. Manual power management fared no better, but some quick debugging showed that it might be getting overridden by DXVK's DXGI implementation. I also logged `sensors` output, which showed that the failures often occur quickly after the card reaches its maximum power draw at a little over 190W. I thought about increasing that, but I didn't want to fry my hardware since I don't have much experience mucking around with overclocking/overvolting GPUs. -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/475. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.