Bug 108685

Summary: Raven Ridge: VMC page fault on resume from suspend
Product: DRI Reporter: Samantha McVey <samantham>
Component: DRM/AMDgpuAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED NOTOURBUG QA Contact:
Severity: normal    
Priority: medium    
Version: XOrg git   
Hardware: Other   
OS: All   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
kernel log with lockup on resume none

Description Samantha McVey 2018-11-07 05:22:49 UTC
Created attachment 142395 [details]
kernel log with lockup on resume

This is using 4.20.0-rc1-2.g0587eb8. I believe this is the issue I have had in the past where the system does not resume from suspend, or at least the graphics are locked up when I do.

I am able to fix these resume from suspend gpu lockups by using a script to disable C6 states on the CPU before suspend and then enabling C6 states on resume (this uses zenstates.py to set MSR through a systemd sleep script).

Nov 06 20:26:29 kernel: gmc_v9_0_process_interrupt: 35 callbacks suppressed
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200135
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200135
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200135
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:154 vmid:2 pasid:32771, for process kwin_x11 pid 2792 thread amdgpu_cs:0 pid 2906)
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0:   in page starting at address 0x0000800100aaa000 from 18
Nov 06 20:26:29 kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00200134
Nov 06 20:26:29 kernel: PM: suspend exit
Comment 1 Samantha McVey 2018-11-30 03:22:55 UTC
This seems to have been fixed with Lenovo A485 BIOS 1.11. I haven't encountered any issues even after disabling my script which disabled C6 states before suspend and enabled them again on resume.

Going to close this now.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.