Bug 88415

Summary: [NVA8] nouveau GPU lockup when reclocking
Product: xorg Reporter: Benedikt Morbach <benedikt.morbach>
Component: Driver/nouveauAssignee: Nouveau Project <nouveau>
Status: NEW --- QA Contact: Xorg Project Team <xorg-team>
Severity: normal    
Priority: medium CC: nouveau
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
full dmesg
none
vbios
none
dmesg of second try none

Description Benedikt Morbach 2015-01-14 18:06:35 UTC
Created attachment 112240 [details]
full dmesg

kernel: 3.19-rc4
hardware: Thinkpad T510, Quadro NVS 3100M
full dmesg attached, nouveau parts below

What I did:
 1. downclocking before starting X
    -> '[0000:01:00.0] invalid/missing rammap entry' in dmesg

 2. start X
    -> working fine (could run glxgears, didn't try much else)

 3. returning to default performance level
    -> fine

 4. downclocking again
    -> lockup, broke it with sysrq and saved dmesg

Any other logs I should produce?
Do you want a more verbose dmesg?

dmesg | grep nouveau:
[   24.080412] nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0a8600a2
[   24.080417] nouveau  [  DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
[   24.080420] nouveau  [  DEVICE][0000:01:00.0] Family : NV50
[   24.112811] nouveau  [   VBIOS][0000:01:00.0] using image from PRAMIN
[   24.113012] nouveau  [   VBIOS][0000:01:00.0] BIT signature found
[   24.113016] nouveau  [   VBIOS][0000:01:00.0] version 70.18.45.00.09
[   24.114500] nouveau  [     PMC][0000:01:00.0] MSI interrupts enabled
[   24.114546] nouveau  [     PFB][0000:01:00.0] RAM type: DDR3
[   24.114549] nouveau  [     PFB][0000:01:00.0] RAM size: 512 MiB
[   24.114551] nouveau  [     PFB][0000:01:00.0]    ZCOMP: 960 tags
[   24.117885] nouveau  [    VOLT][0000:01:00.0] GPU voltage: 850000uv
[   24.147955] nouveau  [  PTHERM][0000:01:00.0] FAN control: none / external
[   24.147972] nouveau  [  PTHERM][0000:01:00.0] fan management: automatic
[   24.147978] nouveau  [  PTHERM][0000:01:00.0] internal sensor: yes
[   24.168063] nouveau  [     CLK][0000:01:00.0] 03: core 135 MHz shader 270 MHz memory 135 MHz
[   24.168070] nouveau  [     CLK][0000:01:00.0] 07: core 405 MHz shader 810 MHz memory 405 MHz
[   24.168074] nouveau  [     CLK][0000:01:00.0] 0f: core 606 MHz shader 1468 MHz memory 790 MHz
[   24.168117] nouveau  [     CLK][0000:01:00.0] --: core 405 MHz shader 810 MHz memory 405 MHz
[   24.168493] nouveau  [     DRM] VRAM: 512 MiB
[   24.168495] nouveau  [     DRM] GART: 1048576 MiB
[   24.168499] nouveau  [     DRM] TMDS table version 2.0
[   24.168501] nouveau  [     DRM] DCB version 4.0
[   24.168504] nouveau  [     DRM] DCB outp 00: 01800323 00010034
[   24.168506] nouveau  [     DRM] DCB outp 01: 02811300 00000000
[   24.168508] nouveau  [     DRM] DCB outp 02: 028223a6 0f220010
[   24.168510] nouveau  [     DRM] DCB outp 03: 02822362 00020010
[   24.168511] nouveau  [     DRM] DCB outp 04: 048333b6 0f220010
[   24.168513] nouveau  [     DRM] DCB outp 05: 04833372 00020010
[   24.168515] nouveau  [     DRM] DCB outp 06: 088443c6 0f220010
[   24.168517] nouveau  [     DRM] DCB outp 07: 08844382 00020010
[   24.168519] nouveau  [     DRM] DCB conn 00: 00000040
[   24.168521] nouveau  [     DRM] DCB conn 01: 00000100
[   24.168523] nouveau  [     DRM] DCB conn 02: 00101246
[   24.168524] nouveau  [     DRM] DCB conn 03: 00202346
[   24.168526] nouveau  [     DRM] DCB conn 04: 00410446
[   24.236201] nouveau  [     DRM] MM: using COPY for buffer copies
[   25.443491] nouveau  [     DRM] allocated 1600x900 fb: 0x70000, bo ffff8800b8decc00
[   25.443614] fbcon: nouveaufb (fb0) is primary device
[   25.763829] nouveau 0000:01:00.0: fb0: nouveaufb frame buffer device
[   25.763831] nouveau 0000:01:00.0: registered panic notifier
[   25.766694] [drm] Initialized nouveau 1.2.1 20120801 for 0000:01:00.0 on minor 0
[  107.585436] nouveau E[     PFB][0000:01:00.0] invalid/missing rammap entry
[  612.287829] nouveau E[     DRM] GPU lockup - switching to software fbcon
[  626.123229] nouveau E[  X[907]] failed to idle channel 0xcccc0000 [X[907]]
[  641.123107] nouveau E[  X[907]] failed to idle channel 0xcccc0000 [X[907]]
[  643.129470] nouveau E[   PFIFO][0000:01:00.0] channel 6 [X[907]] unload timeout
Comment 1 Ilia Mirkin 2015-01-14 18:09:10 UTC
Please upload your VBIOS as well (cat /sys/kernel/debug/dri/0/vbios.rom > gt218-vbios.rom).

Which levels exactly did you clock between? 03 and 07? Or did you use 0f?
Comment 2 Benedikt Morbach 2015-01-14 18:13:33 UTC
Created attachment 112241 [details]
vbios
Comment 3 Benedikt Morbach 2015-01-14 18:14:06 UTC
> Which levels exactly did you clock between? 03 and 07? Or did you use 0f?

between 03 and 07
Comment 4 Benedikt Morbach 2015-01-14 18:18:42 UTC
Created attachment 112242 [details]
dmesg of second try

I just tried again and this time I could reclock between 03 and 07 while X was running, albeit with quite some errors in dmesg.

But when I tried to go to 0f, I got a lockup again. (also some minor corruption).
I used sysrq again to get out and dump dmesg. (see attachment)

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.