Summary: |
VMC page fault and Coherent Slave Error: Address violation after upgrading to 4.20 |
Product: |
DRI
|
Reporter: |
Zheng Luo <vicluo96> |
Component: |
DRM/AMDgpu | Assignee: |
Default DRI bug account <dri-devel> |
Status: |
RESOLVED
DUPLICATE
|
QA Contact: |
|
Severity: |
major
|
|
|
Priority: |
medium
|
CC: |
jv356, vicluo96
|
Version: |
DRI git | |
|
Hardware: |
x86-64 (AMD64) | |
|
OS: |
Linux (All) | |
|
See Also: |
https://bugs.freedesktop.org/show_bug.cgi?id=108992
|
Whiteboard: |
|
i915 platform:
|
|
i915 features:
|
|
Attachments: |
|
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.
Created attachment 142927 [details] kernel log on 4.20.arch1-1 I'm currently using AMD 2500U on Thinkpad E585 with Archlinux kernel 4.20.arch1-1. Everything works fine with 4.19.12. However after upgrading to 4.20.arch1-1, system crashes after gnome-shell starts. The error log reports: Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022) Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: in page starting at address 0x0000800100020000 from 18 Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x0010013C Dec 31 11:57:09 lzThinkpad kernel: mce: [Hardware Error]: Machine check events logged Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: CPU:0 (17:11:0) MC20_STATUS[-|-|MiscV|-|AddrV Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022) Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: IPID: 0x0000002e00000000, Syndrome: 0x000000005b240205 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Error: Address violation. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout) Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Extended Error Code: 1 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 (more Error Addr lines omitted)