Bug 109200 - VMC page fault and Coherent Slave Error: Address violation after upgrading to 4.20
Summary: VMC page fault and Coherent Slave Error: Address violation after upgrading to...
Status: RESOLVED DUPLICATE of bug 108992
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: DRI git
Hardware: x86-64 (AMD64) Linux (All)
: medium major
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-12-31 20:16 UTC by Zheng Luo
Modified: 2018-12-31 21:05 UTC (History)
2 users (show)

See Also:
i915 platform:
i915 features:


Attachments
kernel log on 4.20.arch1-1 (2.49 MB, text/x-log)
2018-12-31 20:16 UTC, Zheng Luo
no flags Details

Description Zheng Luo 2018-12-31 20:16:26 UTC
Created attachment 142927 [details]
kernel log on 4.20.arch1-1

I'm currently using AMD 2500U on Thinkpad E585 with Archlinux kernel 4.20.arch1-1. Everything works fine with 4.19.12. However after upgrading to 4.20.arch1-1, system crashes after gnome-shell starts. The error log reports:

Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022)
Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100020000 from 18
Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x0010013C
Dec 31 11:57:09 lzThinkpad kernel: mce: [Hardware Error]: Machine check events logged
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required.
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: CPU:0 (17:11:0) MC20_STATUS[-|-|MiscV|-|AddrV
Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022)
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: IPID: 0x0000002e00000000, Syndrome: 0x000000005b240205
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Error: Address violation.
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required.
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout)
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Extended Error Code: 1
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00
Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00
(more Error Addr lines omitted)
Comment 1 Zheng Luo 2018-12-31 20:24:40 UTC
w/ mesa 18.3.1-1, gnome-shell & mutter & gnome-desktop 3.30.2-1, wayland 1.16.0-1, libva 2.3.0-1, linux-firmware 20181218.0f22c85-1
Comment 2 Zheng Luo 2018-12-31 21:05:21 UTC
As mentioned in https://bugs.freedesktop.org/show_bug.cgi?id=108992, iommu=soft stills works. This looks like a duplicate of that issue.

*** This bug has been marked as a duplicate of bug 108992 ***


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.