Bug 71189

Summary: [r600g] GPU hangs with UT3
Product: xorg Reporter: hej
Component: Driver/RadeonAssignee: xf86-video-ati maintainers <xorg-driver-ati>
Status: RESOLVED INVALID QA Contact: Xorg Project Team <xorg-team>
Severity: normal    
Priority: medium CC: christopher.m.penalver, jackdachef
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
syslog snippet andd output from "uname -a" and "lspci"
none
lspci output
none
r600g_ut3_blackscreen-after-minutes_3.13-rc8+
none
r600g_full-debug-02_rayman-origins_no-info
none
r600g_ut3_blackscreen-3.13-rc8 debug none

Description hej 2013-11-03 14:59:49 UTC
Created attachment 88564 [details]
syslog snippet andd output from "uname -a" and "lspci"

While gaming , both through wine and native(steam for linux) the screens go to standby mode and X is closed/crashed.

I have attached the kernel and hardware spec's and a snippet from the syslog, that covers the time of the crash.
I am running ubuntu-13.10 but I have also experienced this bug on ubuntu-13.04 and I'm using both the kernel and the radeon driver as it came from canonical.
Comment 1 Michel Dänzer 2013-11-12 03:44:28 UTC
I don't see a syslog snippet in the attachment.

Please attach the /var/log/Xorg.0.log file and the output of dmesg and glxinfo.
Comment 2 jackdachef 2014-01-19 01:11:59 UTC
got the same/similar behavior on my 5850:

the monitor turns off (behavior like when the gpu has turned off or in standby) and doesn't turn back on

switching to VT doesn't seem to make a change, the system keeps on running, the games too (e.g. Serious Sam 3 BFE from native Steam on Linux; or e.g. Rayman Origins from Steam via Wine 1.7.11)

the weird things is that there's no error message in the syslog or kern.log


http://nopaste.info/380db9a43d.html


Unreal Tournament 3 worked fine several kernel versions back (as far as I know) when dpm got introduced

it's not really a regression in the sense since Serious Sam 3 BFE in the past wouldn't even start up and the screen immediately turned black (+ monitor off/standby)

now game starts up and is playable for a few minutes before the screen goes off



same for Rayman Origins: playable for a few minutes - then suddenly the screen goes off


There must be some gpu error and/or unrecoverable issue (in the sense that gpu can't reset itself ?) involved

since after rebooting via Magic SYSRQ Key (this box doesn't haven't a Reset button /facepalm)

the Bios (Amibios) gives the following error signals:

long - short - short; gpu error

pcie_gen2 worked fine in the past so that doesn't makes things worse

tried it with radeon.audio=0 radeon.aspm=0 but that didn't make a change

disabling HYPERZ also doesn't help (afaik it helped for Unreal Tournament 3 to run flawless)


already thought about to turn on some more debug info or trying out additional switches but didn't find relevant while going through the bugtracker


help ?


thanks for your hard work - the driver keeps on getting better and better !
Comment 3 jackdachef 2014-01-19 01:15:03 UTC
http://pastebin.com/Nwh6gp32

<== /var/log/Xorg.0.log


no xorg.conf is being used as far as I can see
Comment 4 jackdachef 2014-01-19 01:22:41 UTC
to clarify "switching to VT doesn't seem to make a change":

the monitor stays in standby mode/black (off, *not* on - so *not* simply blanking)

had related issues where switching to VT and back revived the screen-output

but this is persistent: the box has to be rebooted and/or turned off to be usable again


@hej@synack.fo:

does this exactly describe what you're also seeing/experiencing ?
Comment 5 jackdachef 2014-01-19 01:43:26 UTC
Created attachment 92367 [details]
lspci output

ok, so it's not limited to AMD or Intel motherboards solely :/



Rayman Origins works fine on Windows 8/8.1 with catalyst drivers
Comment 6 jackdachef 2014-01-19 03:27:54 UTC
correction:

the sound of the game stops pretty soon after the gpu has locked up

also - got some news (recompiled the kernel, removed some potential config entries that also could lead to trouble by themselves according to bug reports + added more debug info and got something to investigate):

Jan 19 03:35:51 lupus kernel: [   35.941047] input: HDA ATI HDMI HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.0/0000:01:00.1/sound/card1/input15
Jan 19 03:35:51 lupus kernel: [   36.414164] [drm] radeon kernel modesetting enabled.
Jan 19 03:35:51 lupus kernel: [   36.414437] [drm] initializing kernel modesetting (CYPRESS 0x1002:0x6899 0x1002:0x0B00).
Jan 19 03:35:51 lupus kernel: [   36.414468] [drm] register mmio base: 0xFBCE0000
Jan 19 03:35:51 lupus kernel: [   36.414469] [drm] register mmio size: 131072
Jan 19 03:35:51 lupus kernel: [   36.414518] ATOM BIOS: CYPRESS
Jan 19 03:35:51 lupus kernel: [   36.414589] radeon 0000:01:00.0: VRAM: 1024M 0x0000000000000000 - 0x000000003FFFFFFF (1024M used)
Jan 19 03:35:51 lupus kernel: [   36.414591] radeon 0000:01:00.0: GTT: 1024M 0x0000000040000000 - 0x000000007FFFFFFF
Jan 19 03:35:51 lupus kernel: [   36.414592] [drm] Detected VRAM RAM=1024M, BAR=256M
Jan 19 03:35:51 lupus kernel: [   36.414593] [drm] RAM width 256bits DDR
Jan 19 03:35:51 lupus kernel: [   36.414638] [TTM] Zone  kernel: Available graphics memory: 4085752 kiB
Jan 19 03:35:51 lupus kernel: [   36.414640] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
Jan 19 03:35:51 lupus kernel: [   36.414641] [TTM] Initializing pool allocator
Jan 19 03:35:51 lupus kernel: [   36.414644] [TTM] Initializing DMA pool allocator
Jan 19 03:35:51 lupus kernel: [   36.414658] [drm] radeon: 1024M of VRAM memory ready
Jan 19 03:35:51 lupus kernel: [   36.414659] [drm] radeon: 1024M of GTT memory ready.
Jan 19 03:35:51 lupus kernel: [   36.455915] [drm] GART: num cpu pages 262144, num gpu pages 262144
Jan 19 03:35:51 lupus kernel: [   36.456953] [drm] enabling PCIE gen 2 link speeds, disable with radeon.pcie_gen2=0
Jan 19 03:35:51 lupus kernel: [   36.464686] [drm] Loading CYPRESS Microcode
Jan 19 03:35:51 lupus kernel: [   36.556155] [drm] PCIE GART of 1024M enabled (table at 0x000000000025D000).
Jan 19 03:35:51 lupus kernel: [   36.556279] radeon 0000:01:00.0: WB enabled
Jan 19 03:35:51 lupus kernel: [   36.556281] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff8800ba7f9c00
Jan 19 03:35:51 lupus kernel: [   36.556283] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff8800ba7f9c0c
Jan 19 03:35:51 lupus kernel: [   36.556877] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x000000000005c418 and cpu addr 0xffffc9000411c418
Jan 19 03:35:51 lupus kernel: [   36.556878] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
Jan 19 03:35:51 lupus kernel: [   36.556879] [drm] Driver supports precise vblank timestamp query.
Jan 19 03:35:51 lupus kernel: [   36.556895] radeon 0000:01:00.0: irq 57 for MSI/MSI-X
Jan 19 03:35:51 lupus kernel: [   36.556904] radeon 0000:01:00.0: radeon: using MSI.
Jan 19 03:35:51 lupus kernel: [   36.556928] [drm] radeon: irq initialized.
Jan 19 03:35:51 lupus kernel: [   36.573355] [drm] ring test on 0 succeeded in 1 usecs
Jan 19 03:35:51 lupus kernel: [   36.573414] [drm] ring test on 3 succeeded in 1 usecs
Jan 19 03:35:51 lupus kernel: [   36.756639] [drm] ring test on 5 succeeded in 1 usecs
Jan 19 03:35:51 lupus kernel: [   36.756644] [drm] UVD initialized successfully.
Jan 19 03:35:51 lupus kernel: [   36.756822] [drm] Enabling audio 0 support
Jan 19 03:35:51 lupus kernel: [   36.756861] [drm] ib test on ring 0 succeeded in 0 usecs
Jan 19 03:35:51 lupus kernel: [   36.756892] [drm] ib test on ring 3 succeeded in 0 usecs
Jan 19 03:35:51 lupus kernel: [   36.905546] [drm] ib test on ring 5 succeeded
Jan 19 03:35:51 lupus kernel: [   36.906063] [drm] Radeon Display Connectors
Jan 19 03:35:51 lupus kernel: [   36.906065] [drm] Connector 0:
Jan 19 03:35:51 lupus kernel: [   36.906067] [drm]   DP-1
Jan 19 03:35:51 lupus kernel: [   36.906068] [drm]   HPD4
Jan 19 03:35:51 lupus kernel: [   36.906071] [drm]   DDC: 0x6430 0x6430 0x6434 0x6434 0x6438 0x6438 0x643c 0x643c
Jan 19 03:35:51 lupus kernel: [   36.906072] [drm]   Encoders:
Jan 19 03:35:51 lupus kernel: [   36.906074] [drm]     DFP1: INTERNAL_UNIPHY2
Jan 19 03:35:51 lupus kernel: [   36.906075] [drm] Connector 1:
Jan 19 03:35:51 lupus kernel: [   36.906076] [drm]   HDMI-A-1
Jan 19 03:35:51 lupus kernel: [   36.906078] [drm]   HPD5
Jan 19 03:35:51 lupus kernel: [   36.906080] [drm]   DDC: 0x6460 0x6460 0x6464 0x6464 0x6468 0x6468 0x646c 0x646c
Jan 19 03:35:51 lupus kernel: [   36.906081] [drm]   Encoders:
Jan 19 03:35:51 lupus kernel: [   36.906082] [drm]     DFP2: INTERNAL_UNIPHY2
Jan 19 03:35:51 lupus kernel: [   36.906083] [drm] Connector 2:
Jan 19 03:35:51 lupus kernel: [   36.906084] [drm]   DVI-I-1
Jan 19 03:35:51 lupus kernel: [   36.906086] [drm]   HPD1
Jan 19 03:35:51 lupus kernel: [   36.906087] [drm]   DDC: 0x6450 0x6450 0x6454 0x6454 0x6458 0x6458 0x645c 0x645c
Jan 19 03:35:51 lupus kernel: [   36.906088] [drm]   Encoders:
Jan 19 03:35:51 lupus kernel: [   36.906088] [drm]     DFP3: INTERNAL_UNIPHY1
Jan 19 03:35:51 lupus kernel: [   36.906090] [drm]     CRT2: INTERNAL_KLDSCP_DAC2
Jan 19 03:35:51 lupus kernel: [   36.906090] [drm] Connector 3:
Jan 19 03:35:51 lupus kernel: [   36.906091] [drm]   DVI-I-2
Jan 19 03:35:51 lupus kernel: [   36.906092] [drm]   HPD6
Jan 19 03:35:51 lupus kernel: [   36.906093] [drm]   DDC: 0x6470 0x6470 0x6474 0x6474 0x6478 0x6478 0x647c 0x647c
Jan 19 03:35:51 lupus kernel: [   36.906094] [drm]   Encoders:
Jan 19 03:35:51 lupus kernel: [   36.906095] [drm]     DFP4: INTERNAL_UNIPHY
Jan 19 03:35:51 lupus kernel: [   36.906096] [drm]     CRT1: INTERNAL_KLDSCP_DAC1
Jan 19 03:35:51 lupus kernel: [   36.906140] [drm] Internal thermal controller with fan control
Jan 19 03:35:51 lupus kernel: [   36.906168] == power state 0 ==
Jan 19 03:35:51 lupus kernel: [   36.906170] 	ui class: none
Jan 19 03:35:51 lupus kernel: [   36.906172] 	internal class: boot 
Jan 19 03:35:51 lupus kernel: [   36.906174] 	caps: video 
Jan 19 03:35:51 lupus kernel: [   36.906176] 	uvd    vclk: 0 dclk: 0
Jan 19 03:35:51 lupus kernel: [   36.906178] 		power level 0    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 0
Jan 19 03:35:51 lupus kernel: [   36.906180] 		power level 1    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 0
Jan 19 03:35:51 lupus kernel: [   36.906181] 		power level 2    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 0
Jan 19 03:35:51 lupus kernel: [   36.906182] 	status: c r b 
Jan 19 03:35:51 lupus kernel: [   36.906186] == power state 1 ==
Jan 19 03:35:51 lupus kernel: [   36.906187] 	ui class: performance
Jan 19 03:35:51 lupus kernel: [   36.906188] 	internal class: none
Jan 19 03:35:51 lupus kernel: [   36.906189] 	caps: single_disp video 
Jan 19 03:35:51 lupus kernel: [   36.906191] 	uvd    vclk: 0 dclk: 0
Jan 19 03:35:51 lupus kernel: [   36.906192] 		power level 0    sclk: 15700 mclk: 30000 vddc: 950 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906193] 		power level 1    sclk: 55000 mclk: 90000 vddc: 1038 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906194] 		power level 2    sclk: 72500 mclk: 100000 vddc: 1088 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906195] 	status: 
Jan 19 03:35:51 lupus kernel: [   36.906196] == power state 2 ==
Jan 19 03:35:51 lupus kernel: [   36.906197] 	ui class: none
Jan 19 03:35:51 lupus kernel: [   36.906198] 	internal class: uvd 
Jan 19 03:35:51 lupus kernel: [   36.906199] 	caps: video 
Jan 19 03:35:51 lupus kernel: [   36.906201] 	uvd    vclk: 54000 dclk: 40000
Jan 19 03:35:51 lupus kernel: [   36.906202] 		power level 0    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906203] 		power level 1    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906204] 		power level 2    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906205] 	status: 
Jan 19 03:35:51 lupus kernel: [   36.906206] == power state 3 ==
Jan 19 03:35:51 lupus kernel: [   36.906206] 	ui class: performance
Jan 19 03:35:51 lupus kernel: [   36.906207] 	internal class: none
Jan 19 03:35:51 lupus kernel: [   36.906208] 	caps: video 
Jan 19 03:35:51 lupus kernel: [   36.906210] 	uvd    vclk: 0 dclk: 0
Jan 19 03:35:51 lupus kernel: [   36.906211] 		power level 0    sclk: 40000 mclk: 100000 vddc: 1000 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906212] 		power level 1    sclk: 55000 mclk: 100000 vddc: 1038 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906213] 		power level 2    sclk: 72500 mclk: 100000 vddc: 1088 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906214] 	status: 
Jan 19 03:35:51 lupus kernel: [   36.906215] == power state 4 ==
Jan 19 03:35:51 lupus kernel: [   36.906216] 	ui class: none
Jan 19 03:35:51 lupus kernel: [   36.906217] 	internal class: acpi 
Jan 19 03:35:51 lupus kernel: [   36.906218] 	caps: 
Jan 19 03:35:51 lupus kernel: [   36.906219] 	uvd    vclk: 0 dclk: 0
Jan 19 03:35:51 lupus kernel: [   36.906220] 		power level 0    sclk: 15700 mclk: 30000 vddc: 950 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906221] 		power level 1    sclk: 15700 mclk: 30000 vddc: 950 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906222] 		power level 2    sclk: 15700 mclk: 30000 vddc: 950 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.906223] 	status: 
Jan 19 03:35:51 lupus kernel: [   36.911429] switching from power state:
Jan 19 03:35:51 lupus kernel: [   36.911431] 	ui class: none
Jan 19 03:35:51 lupus kernel: [   36.911432] 	internal class: boot 
Jan 19 03:35:51 lupus kernel: [   36.911433] 	caps: video 
Jan 19 03:35:51 lupus kernel: [   36.911435] 	uvd    vclk: 0 dclk: 0
Jan 19 03:35:51 lupus kernel: [   36.911437] 		power level 0    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 0
Jan 19 03:35:51 lupus kernel: [   36.911438] 		power level 1    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 0
Jan 19 03:35:51 lupus kernel: [   36.911440] 		power level 2    sclk: 40000 mclk: 90000 vddc: 1000 vddci: 0
Jan 19 03:35:51 lupus kernel: [   36.911441] 	status: c b 
Jan 19 03:35:51 lupus kernel: [   36.911443] switching to power state:
Jan 19 03:35:51 lupus kernel: [   36.911444] 	ui class: performance
Jan 19 03:35:51 lupus kernel: [   36.911445] 	internal class: none
Jan 19 03:35:51 lupus kernel: [   36.911446] 	caps: single_disp video 
Jan 19 03:35:51 lupus kernel: [   36.911449] 	uvd    vclk: 0 dclk: 0
Jan 19 03:35:51 lupus kernel: [   36.911450] 		power level 0    sclk: 15700 mclk: 30000 vddc: 950 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.911452] 		power level 1    sclk: 55000 mclk: 90000 vddc: 1038 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.911453] 		power level 2    sclk: 72500 mclk: 100000 vddc: 1088 vddci: 1100
Jan 19 03:35:51 lupus kernel: [   36.911454] 	status: r 
Jan 19 03:35:51 lupus kernel: [   36.917489] [drm] radeon: dpm initialized
Jan 19 03:35:51 lupus kernel: [   37.026943] [drm] fb mappable at 0xD045E000
Jan 19 03:35:51 lupus kernel: [   37.026945] [drm] vram apper at 0xD0000000
Jan 19 03:35:51 lupus kernel: [   37.026945] [drm] size 8294400
Jan 19 03:35:51 lupus kernel: [   37.026946] [drm] fb depth is 24
Jan 19 03:35:51 lupus kernel: [   37.026947] [drm]    pitch is 7680
Jan 19 03:35:51 lupus kernel: [   37.027186] fbcon: radeondrmfb (fb0) is primary device
Jan 19 03:35:51 lupus kernel: [   37.071960] Console: switching to colour frame buffer device 240x67
Jan 19 03:35:51 lupus kernel: [   37.075808] radeon 0000:01:00.0: fb0: radeondrmfb frame buffer device
Jan 19 03:35:51 lupus kernel: [   37.075809] radeon 0000:01:00.0: registered panic notifier
Jan 19 03:35:51 lupus kernel: [   37.075813] [drm] Initialized radeon 2.36.0 20080528 for 0000:01:00.0 on minor 0 


[snip]



Jan 19 04:15:33 lupus kernel: [ 2455.325804] radeon 0000:01:00.0: GPU lockup CP stall for more than 10066msec
Jan 19 04:15:33 lupus kernel: [ 2455.325809] radeon 0000:01:00.0: GPU lockup (waiting for 0x000000000002feaa last fence id 0x000000000002fea7 on ring 0)
Jan 19 04:15:33 lupus kernel: [ 2455.325827] [drm:rv770_stop_dpm] *ERROR* Could not force DPM to low.
Jan 19 04:15:33 lupus kernel: [ 2455.325833] [drm] Disabling audio 0 support
Jan 19 04:15:33 lupus kernel: [ 2455.332730] BUG: unable to handle kernel paging request at ffffc90404980ffc
Jan 19 04:15:33 lupus kernel: [ 2455.332778] IP: [<ffffffffa03d931a>] radeon_ring_backup+0xda/0x140 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.332835] PGD 23e80e067 PUD 0 
Jan 19 04:15:33 lupus kernel: [ 2455.332862] Oops: 0000 [#1] PREEMPT SMP 
Jan 19 04:15:33 lupus kernel: [ 2455.332895] Modules linked in: snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device xt_owner it87 hwmon_vid ip6t_REJECT xt_hl ip6t_rt ipt_REJECT xt_comment xt_LOG xt_recent xt_limit xt_addrtype xt_conntrack ip6table_filter nf_conntrack_netbios_ns nf_conntrack_broadcast nf_nat_ftp nf_nat nf_conntrack_ftp iptable_filter radeon cfbfillrect cfbimgblt snd_hda_codec_hdmi snd_hda_codec_realtek cfbcopyarea i2c_algo_bit ttm snd_hda_intel drm_kms_helper snd_hda_codec e1000e drm snd_hwdep snd_pcm snd_page_alloc ptp pps_core i2c_i801 coretemp snd_timer wmi snd soundcore e1000 zfs(PO) zcommon(PO) znvpair(PO) zavl(PO) zunicode(PO) spl(O) lockd sunrpc sl811_hcd xhci_hcd usb_storage [last unloaded: nfs]
Jan 19 04:15:33 lupus kernel: [ 2455.333392] CPU: 3 PID: 14551 Comm: UT3.exe Tainted: P           O 3.13.0-rc8 #6
Jan 19 04:15:33 lupus kernel: [ 2455.333431] Hardware name: Packard Bell ipower G3710/FMP55, BIOS P01-B2C0 03/29/2010
Jan 19 04:15:33 lupus kernel: [ 2455.333474] task: ffff8801243c0000 ti: ffff88000ed86000 task.ti: ffff88000ed86000
Jan 19 04:15:33 lupus kernel: [ 2455.333512] RIP: 0010:[<ffffffffa03d931a>]  [<ffffffffa03d931a>] radeon_ring_backup+0xda/0x140 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.333573] RSP: 0018:ffff88000ed87c38  EFLAGS: 00010257
Jan 19 04:15:33 lupus kernel: [ 2455.333605] RAX: 00000000000039c4 RBX: 0000000000000000 RCX: 0000000000000000
Jan 19 04:15:33 lupus kernel: [ 2455.333644] RDX: 0000000000000002 RSI: ffff8801b28b4000 RDI: ffffc90404980ffc
Jan 19 04:15:33 lupus kernel: [ 2455.333682] RBP: ffff88000ed87c68 R08: 00000000ffffffff R09: ffffea0005f0e840
Jan 19 04:15:33 lupus kernel: [ 2455.333722] R10: ffff88023fbf9dc0 R11: ffff880233709810 R12: ffff880231248f20
Jan 19 04:15:33 lupus kernel: [ 2455.333758] R13: ffff880231248ef8 R14: ffff88000ed87ca8 R15: 0000000000000e71
Jan 19 04:15:33 lupus kernel: [ 2455.333798] FS:  0000000081fb0000(0063) GS:ffff88023fcc0000(006b) knlGS:00000000202ffb40
Jan 19 04:15:33 lupus kernel: [ 2455.333839] CS:  0010 DS: 002b ES: 002b CR0: 0000000080050033
Jan 19 04:15:33 lupus kernel: [ 2455.333868] CR2: ffffc90404980ffc CR3: 000000000d762000 CR4: 00000000000007e0
Jan 19 04:15:33 lupus kernel: [ 2455.333907] DR0: 0000000000000045 DR1: 0000000000000000 DR2: 0000000000000000
Jan 19 04:15:33 lupus kernel: [ 2455.333942] DR3: 0000000000000005 DR6: 00000000ffff0ff0 DR7: 0000000000000600
Jan 19 04:15:33 lupus kernel: [ 2455.333981] Stack:
Jan 19 04:15:33 lupus kernel: [ 2455.333994]  ffff880231248000 ffff880231248000 0000000000000000 ffff880231248018
Jan 19 04:15:33 lupus kernel: [ 2455.334043]  0000000000000000 ffff88000ed87db0 ffff88000ed87d08 ffffffffa03aa4f2
Jan 19 04:15:33 lupus kernel: [ 2455.334094]  000000000ed87cd8 ffff880231248700 00000000ffffffdd 00000001243c0000
Jan 19 04:15:33 lupus kernel: [ 2455.334147] Call Trace:
Jan 19 04:15:33 lupus kernel: [ 2455.334171]  [<ffffffffa03aa4f2>] radeon_gpu_reset+0xa2/0x260 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.334212]  [<ffffffff8188f4d6>] ? _raw_spin_lock+0x16/0x40
Jan 19 04:15:33 lupus kernel: [ 2455.334247]  [<ffffffff8188f4d6>] ? _raw_spin_lock+0x16/0x40
Jan 19 04:15:33 lupus kernel: [ 2455.334278]  [<ffffffff8188f161>] ? _raw_spin_unlock+0x11/0x30
Jan 19 04:15:33 lupus kernel: [ 2455.334325]  [<ffffffffa03d7339>] radeon_gem_handle_lockup.part.5+0x9/0x20 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.334377]  [<ffffffffa03d7df8>] radeon_gem_wait_idle_ioctl+0xa8/0x100 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.334426]  [<ffffffffa02b5669>] drm_ioctl+0x529/0x660 [drm]
Jan 19 04:15:33 lupus kernel: [ 2455.334469]  [<ffffffffa03d7d50>] ? radeon_gem_busy_ioctl+0x120/0x120 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.334514]  [<ffffffff815d457b>] ? rpm_resume+0x1bb/0x670
Jan 19 04:15:33 lupus kernel: [ 2455.334546]  [<ffffffff81116120>] ? do_futex+0x100/0xb10
Jan 19 04:15:33 lupus kernel: [ 2455.334583]  [<ffffffffa03a807c>] radeon_drm_ioctl+0x5c/0xa0 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.334632]  [<ffffffffa0477a0f>] radeon_kms_compat_ioctl+0xf/0x30 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.334672]  [<ffffffff8120abd6>] compat_sys_ioctl+0x86/0x12a0
Jan 19 04:15:33 lupus kernel: [ 2455.334704]  [<ffffffff81116fc0>] ? compat_SyS_futex+0x90/0x1a0
Jan 19 04:15:33 lupus kernel: [ 2455.334741]  [<ffffffff810ce52c>] ? posix_ktime_get_ts+0xc/0x10
Jan 19 04:15:33 lupus kernel: [ 2455.334775]  [<ffffffff810cf899>] ? SyS_clock_gettime+0x49/0x70
Jan 19 04:15:33 lupus kernel: [ 2455.334810]  [<ffffffff81121b24>] ? compat_sys_clock_gettime+0x74/0x90
Jan 19 04:15:33 lupus kernel: [ 2455.334850]  [<ffffffff818919b5>] sysenter_dispatch+0x7/0x1a
Jan 19 04:15:33 lupus kernel: [ 2455.334878] Code: 74 5f 41 8d 47 ff 31 c9 48 8d 04 85 04 00 00 00 eb 0a 0f 1f 80 00 00 00 00 49 8b 36 49 8b 7c 24 08 41 89 d8 83 c3 01 4a 8d 3c 87 <8b> 3f 89 3c 0e 48 83 c1 04 41 23 5c 24 64 48 39 c1 75 db 4c 89 
Jan 19 04:15:33 lupus kernel: [ 2455.335183] RIP  [<ffffffffa03d931a>] radeon_ring_backup+0xda/0x140 [radeon]
Jan 19 04:15:33 lupus kernel: [ 2455.335234]  RSP <ffff88000ed87c38>
Jan 19 04:15:33 lupus kernel: [ 2455.335255] CR2: ffffc90404980ffc
Jan 19 04:15:33 lupus kernel: [ 2455.347643] ---[ end trace bb19deeaf6c5d792 ]---
Jan 19 04:15:55 lupus kernel: [ 2477.967242] SysRq : Keyboard mode set to system default
Jan 19 04:15:57 lupus kernel: [ 2479.276437]  ffff8802335b2ac0 00000000000130c0 ffff88000dc0dfd8 00000000000130c0
Jan 19 04:15:57 lupus kernel: [ 2479.276442]  ffff88023e8d0e40 ffff8802335b2ac0 ffff88000dc0dd58 ffff88001db44900
Comment 8 jackdachef 2014-01-19 03:44:24 UTC
Created attachment 92373 [details]
r600g_ut3_blackscreen-after-minutes_3.13-rc8+

using ZFS on /home, so unfortunately can't try/provide traces or kernel messages without ZFS/being tainted

but before migrating to ZFS the same behavior occured - so it should be pretty representative
Comment 9 jackdachef 2014-01-19 14:55:16 UTC
Created attachment 92400 [details]
r600g_full-debug-02_rayman-origins_no-info

compiled a full-debug kernel and wanted to run UT3 again:

the installation is borked (had this issue, too in the past with these kind of lockups & magic sysrq usage) - fortunately Rayman Origins still works

after a few minutes of gaming (around 5) the screen turned black, too (around 830 seconds)

but there was no useful information in /var/log/kern.log (had radeon.audio=0 radeon.aspm=0 this time appended to the kernel since there was some issue with pulsaudio claiming the radeon HDMI audio device as first & pulseaudio crashing; can this alter and make the kernel output worthless ? )



so there might be a common issue in the background when opengl and or directx via wine is used

but it only shows info with e.g. UT3


will try to get UT3 running again and see whether addr2line shows something (with the last partial debug kernel it didn't show anything)
Comment 10 jackdachef 2014-01-19 17:11:35 UTC
tried out drm-next-3.14 (merged it manually into 3.13-rc8) with radeon.hard_reset=1 radeon.audio=0 radeon.aspm=0

and launched UT3/Steam with R600_DEBUG=sb,nohyperz,sbsafemath,nollvm


it only took 1-2 minutes

then the first what stopped was sound [like before], the content of the monitor was shown [freeze-frame] for several seconds (or even half a minute)

then the monitor turned off (standby) again

box was hardlocked and didn't react to Magic SYSRQ Key


after turning the box off and on only the fans + harddrives started up - screen stayed black and no BIOS-beep/POST (this happens from time to time on this box and usually the BIOS posts normally after the next cold-boot, but it so far never has happened after a hardlock :/ )
Comment 11 jackdachef 2014-01-19 20:46:20 UTC
Created attachment 92411 [details]
r600g_ut3_blackscreen-3.13-rc8 debug

went back to an almost vanilla kernel of 3.13-rc8 with debug

a few moments after the start it already locked up

sound issues don't seem related or to be in connection with the lockup of the gpu (most probably when the sound stops pulseaudio crashed) - this time sound continued until magic sysrq key + e was issued


addr2line again didn't show nothing:

??:0
Comment 12 jackdachef 2014-01-19 20:54:58 UTC
this time Steam/UT3 were launched with

vblank_mode=0 R600_DEBUG=sb


so nohyperz, hyperz don't seem to make a difference in this case


@hej@synack.fo:

you tried launching your games R600_DEBUG=nohyperz (or R600_DEBUG=sb,nohyperz) ?

does that make a change ?
Comment 13 Christopher M. Penalver 2016-02-25 07:31:21 UTC
hej@synack.fo, Ubuntu 13.10 reached EOL on July 17, 2014. For more on this, please see https://wiki.ubuntu.com/Releases .

If this is reproducible in a supported release, it will help immensely if you filed a new report with Ubuntu by ensuring you have the package xdiagnose installed, and that you click the Yes button for attaching additional debugging information running the following from a terminal:
ubuntu-bug xorg

Also, please feel free to subscribe me to it.

For more on why this is helpful, please see https://wiki.ubuntu.com/ReportingBugs.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.