Bug 108729

Summary: [Kaveri CIK 7400K] [regression, works on radeon] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!
Product: DRI Reporter: Vedran Miletić <vedran>
Component: DRM/AMDgpuAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED FIXED QA Contact:
Severity: major    
Priority: medium Keywords: regression
Version: DRI git   
Hardware: All   
OS: All   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
Full dmesg none

Description Vedran Miletić 2018-11-13 14:30:02 UTC
Same Kaveri 7400K as in bug 99353, so this is a regression.

[    0.000000] Command line: BOOT_IMAGE=/vmlinuz-4.18.18-300.fc29.x86_64 root=UUID=6721d330-05e1-4b6d-a862-ccc514fd41ff ro rhgb quiet radeon.cik_support=0 amdgpu.cik_support=1 LANG=hr_HR.UTF-8
[    0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-4.18.18-300.fc29.x86_64 root=UUID=6721d330-05e1-4b6d-a862-ccc514fd41ff ro rhgb quiet radeon.cik_support=0 amdgpu.cik_support=1
LANG=hr_HR.UTF-8
[    1.958265] [drm] radeon kernel modesetting enabled.
[    1.958322] fb: switching to radeondrmfb from EFI VGA
[    1.959264] radeon 0000:00:01.0: CIK support disabled by module param
[    2.149551] [drm] amdgpu kernel modesetting enabled.
[    2.184243] amdgpu 0000:00:01.0: VRAM: 512M 0x000000F400000000 - 0x000000F41FFFFFFF (512M used)                                                                                    
[    2.184245] amdgpu 0000:00:01.0: GTT: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
[    2.184371] [drm] amdgpu: 512M of VRAM memory ready
[    2.184372] [drm] amdgpu: 2546M of GTT memory ready.
[    2.184494] [drm] amdgpu: dpm initialized
[    2.544367] [drm] Initialized amdgpu 3.26.0 20150101 for 0000:00:01.0 on minor 0
[    5.693549] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[    6.713708] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[    7.733862] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[    8.753952] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[    9.774041] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[   10.794129] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[   11.814217] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[   12.834303] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[   13.854390] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[   14.874478] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, trying to reset the ECPU!!!                                                                                  
[   14.894517] [drm:vce_v2_0_start [amdgpu]] *ERROR* VCE not responding, giving up!!!
[   14.894553] [drm:amdgpu_device_ip_set_powergating_state [amdgpu]] *ERROR* set_powergating_state of IP block <vce_v2_0> failed -110                                                 
[   15.904117] [drm:amdgpu_vce_ring_test_ib [amdgpu]] *ERROR* amdgpu: IB test timed out.
[   15.904157] [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* amdgpu: failed testing IB on ring 12 (-110).                                                                               
[   15.904192] [drm:amdgpu_device_ip_late_init_func_handler [amdgpu]] *ERROR* ib ring test failed (-110).
Comment 1 Vedran Miletić 2018-11-13 14:34:34 UTC
dmesg with radeon for comparison:

[    0.000000] Command line: BOOT_IMAGE=/vmlinuz-4.18.18-300.fc29.x86_64 root=UUID=6721d330-05e1-4b6d-a862-ccc514fd41ff ro rhgb quiet radeon.cik_support=1 amdgpu.cik_support=0 LANG=hr_HR.UTF-8
[    0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-4.18.18-300.fc29.x86_64 root=UUID=6721d330-05e1-4b6d-a862-ccc514fd41ff ro rhgb quiet radeon.cik_support=1 amdgpu.cik_support=0
LANG=hr_HR.UTF-8
[    1.934388] [drm] radeon kernel modesetting enabled.
[    1.934429] fb: switching to radeondrmfb from EFI VGA
[    1.934855] radeon 0000:00:01.0: VRAM: 512M 0x0000000000000000 - 0x000000001FFFFFFF (512M used)                                                                                    
[    1.934857] radeon 0000:00:01.0: GTT: 2048M 0x0000000020000000 - 0x000000009FFFFFFF
[    1.941593] [drm] radeon: 512M of VRAM memory ready
[    1.941594] [drm] radeon: 2048M of GTT memory ready.
[    1.945188] [drm] radeon: dpm initialized
[    1.989743] radeon 0000:00:01.0: WB enabled
[    1.989761] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr 0x0000000020000c00 and cpu addr 0x(____ptrval____)                                                            
[    1.989764] radeon 0000:00:01.0: fence driver on ring 1 use gpu addr 0x0000000020000c04 and cpu addr 0x(____ptrval____)                                                            
[    1.989766] radeon 0000:00:01.0: fence driver on ring 2 use gpu addr 0x0000000020000c08 and cpu addr 0x(____ptrval____)                                                            
[    1.989768] radeon 0000:00:01.0: fence driver on ring 3 use gpu addr 0x0000000020000c0c and cpu addr 0x(____ptrval____)                                                            
[    1.989770] radeon 0000:00:01.0: fence driver on ring 4 use gpu addr 0x0000000020000c10 and cpu addr 0x(____ptrval____)                                                            
[    1.990188] radeon 0000:00:01.0: fence driver on ring 5 use gpu addr 0x0000000000078d30 and cpu addr 0x(____ptrval____)                                                            
[    1.990342] radeon 0000:00:01.0: fence driver on ring 6 use gpu addr 0x0000000020000c18 and cpu addr 0x(____ptrval____)                                                            
[    1.990344] radeon 0000:00:01.0: fence driver on ring 7 use gpu addr 0x0000000020000c1c and cpu addr 0x(____ptrval____)                                                            
[    1.990394] radeon 0000:00:01.0: radeon: using MSI.
[    1.990418] [drm] radeon: irq initialized.
[    3.419140] [drm] Initialized radeon 2.50.0 20080528 for 0000:00:01.0 on minor 0
[    3.624369] [drm] amdgpu kernel modesetting enabled.
Comment 2 Alex Deucher 2018-11-13 14:54:01 UTC
Please attach your full dmesg output and xorg log if using X.
Comment 3 Vedran Miletić 2018-11-15 01:02:37 UTC
Created attachment 142468 [details]
Full dmesg

Full dmesg as requested. I'm not using X, but it would work with radeon since bug 99353 has been fixed. Not sure what would happen with amdgpu.
Comment 4 Vedran Miletić 2018-11-17 12:59:58 UTC
After upgrading the kernel to 4.19.2:

[    0.000000] Command line: BOOT_IMAGE=/vmlinuz-4.19.2-300.fc29.x86_64 root=UUID=6721d330-05e1-4b6d-a862-ccc514fd41ff ro rhgb quiet radeon.cik_support=0 amdgpu.cik_support=1 LANG=hr_HR.UTF-8
[    0.165807] Kernel command line: BOOT_IMAGE=/vmlinuz-4.19.2-300.fc29.x86_64 root=UUID=6721d330-05e1-4b6d-a862-ccc514fd41ff ro rhgb quiet radeon.cik_support=0 amdgpu.cik_support=1 LANG=hr_HR.UTF-8
[    2.163636] [drm] radeon kernel modesetting enabled.
[    2.163686] fb: switching to radeondrmfb from EFI VGA
[    2.164062] radeon 0000:00:01.0: CIK support disabled by module param
[    2.355575] [drm] amdgpu kernel modesetting enabled.
[    2.390577] amdgpu 0000:00:01.0: VRAM: 512M 0x000000F400000000 - 0x000000F41FFFFFFF (512M used)
[    2.390578] amdgpu 0000:00:01.0: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
[    2.390699] [drm] amdgpu: 512M of VRAM memory ready
[    2.390701] [drm] amdgpu: 2545M of GTT memory ready.
[    2.390834] [drm] amdgpu: dpm initialized
[    2.757608] [drm] Initialized amdgpu 3.27.0 20150101 for 0000:00:01.0 on minor 0

I'll check whether X/Wayland, 3D and video playback work and close the bug if it's all good.
Comment 5 Vedran Miletić 2018-11-19 18:48:06 UTC
Everything is good with 4.19.2.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.