Summary: | [radeonsi] page fault, umr dump | ||
---|---|---|---|
Product: | Mesa | Reporter: | Domen <domen.stangar> |
Component: | Drivers/Gallium/radeonsi | Assignee: | Default DRI bug account <dri-devel> |
Status: | RESOLVED FIXED | QA Contact: | Default DRI bug account <dri-devel> |
Severity: | normal | ||
Priority: | medium | ||
Version: | 18.3 | ||
Hardware: | x86-64 (AMD64) | ||
OS: | Linux (All) | ||
See Also: | https://bugs.freedesktop.org/show_bug.cgi?id=108261 | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: |
umr dump
gallium dump t1 gallium dump t0 trace events amdgpu another gallium dump |
Description
Domen
2018-11-20 21:39:10 UTC
Created attachment 142535 [details]
umr dump
Created attachment 142536 [details]
gallium dump t1
Created attachment 142537 [details]
gallium dump t0
Created attachment 142538 [details]
trace events amdgpu
Attached logs [ 332.004841] amdgpu 0000:0e:00.0: GPU fault detected: 147 0x0f800802 for process roles pid 1043 thread roles:cs0 pid 1044 [ 332.004844] amdgpu 0000:0e:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x000EA1F0 [ 332.004845] amdgpu 0000:0e:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x04008002 [ 332.004848] amdgpu 0000:0e:00.0: VM fault (0x02, vmid 2, pasid 32769) at page 958960, read from 'TC2' (0x54433200) (8) Created attachment 142598 [details]
another gallium dump
another dump, tried with propriery nvidia drivers. it works fine there.
Looks like sctx->bindless_descriptors->gpu_address is not accessable by gpu. 2e00000 is not in buffer list. c0017600 SET_SH_REG: 0000014d 02e00000 SPI_SHADER_USER_DATA_COMMON_1 <- 0x02e00000 [ 174.469016] amdgpu 0000:38:00.0: [gfxhub] VMC page fault (src_id:0 ring:32 vmid:2 pasid:32769, for process roles pid 398 thread roles:cs0 pid 399) [ 174.469021] amdgpu 0000:38:00.0: in page starting at address 0x0000800002e04000 from 27 [ 174.469023] amdgpu 0000:38:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00240C40 [ 184.763074] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=583, emitted seq=585 Well this is bug when using bindless textures and framebuffer which is also resident in bindless textures. There is no more fault if i comment out si_upload_bindless_descriptor function. radeon_emit(cs, PKT3(PKT3_WRITE_DATA, 2 + num_dwords, 0)); radeon_emit(cs, S_370_DST_SEL(V_370_TC_L2) | S_370_WR_CONFIRM(1) | S_370_ENGINE_SEL(V_370_ME)); radeon_emit(cs, va); radeon_emit(cs, va >> 32); |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.