Bug 34359

Summary: Texture corruption and crashes on RV730 Pro AGP
Product: Mesa Reporter: Stephen Kitt <steve>
Component: Drivers/Gallium/r600Assignee: Default DRI bug account <dri-devel>
Status: RESOLVED FIXED QA Contact:
Severity: major    
Priority: medium    
Version: git   
Hardware: x86 (IA32)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:

Description Stephen Kitt 2011-02-16 14:08:31 UTC
Hi,

(I'm reproducing this from http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=613596 as suggested by Julien Cristau.)

Hi,

Since upgrading to libgl1-mesa-dri 7.10 with the Gallium driver, GL applications (e.g. xmoto) either suffer texture corruption or cause GPU lockups, sometimes recoverable, sometimes not. I haven't been able to get a screen capture with texture corruption; in xmoto for instance the red buttons have white lines through them.

Whether the game works with texture corruption or causes GPU lockups seems to be decided at launch time, i.e. if the game launches it doesn't crash later, but if it causes a GPU lockup it never gets to display anything. Killing the game sometimes allows the system to recover; in other instances it's necessary to reboot.

I tried running with mesa rebuilt from git master, and encountered the same problems.

When the GPU locks up, the Oops is as follows:

Feb 16 07:07:35 heffalump kernel: [26468.968055] ------------[ cut here ]------------
Feb 16 07:07:35 heffalump kernel: [26468.968113] WARNING: at /usr/src/linux-2.6-2.6.37/debian/build/source_i386_none/drivers/gpu/drm/radeon/radeon_fence.c:244 radeon_fence_wait+0x1f3/0x281 [radeon]()
Feb 16 07:07:35 heffalump kernel: [26468.968117] Hardware name: To Be Filled By O.E.M.
Feb 16 07:07:35 heffalump kernel: [26468.968120] GPU lockup (waiting for 0x0001672D last fence id 0x0001672A)
Feb 16 07:07:35 heffalump kernel: [26468.968123] Modules linked in: isofs vboxnetadp vboxnetflt vboxdrv sco bnep rfcomm l2cap binfmt_misc microcode fuse ext4 jbd2 crc16 sha256_generic aes_i586 aes_generic cbc iTCO_wdt iTCO_vendor_support tcp_diag inet_diag autofs4 loop grip w83627hf hwmon_vid dm_crypt usblp cx22702 cx88_dvb cx88_vp3054_i2c videobuf_dvb dvb_core snd_cs4236 snd_wavefront rc_hauppauge_new snd_hda_codec_hdmi snd_hda_intel tuner_simple snd_hda_codec tuner_types snd_intel8x0 snd_wss_lib snd_ac97_codec snd_usb_audio snd_opl3_lib sam_ba btusb snd_hwdep tda9887 ac97_bus snd_mpu401 snd_mpu401_uart tda8290 snd_usbmidi_lib bluetooth snd_seq_midi cx88_alsa pwc snd_rawmidi snd_pcm_oss snd_mixer_oss rfkill usbserial snd_seq_midi_event tuner joydev snd_pcm snd_seq cx8800 ir_lirc_codec lirc_dev cx8802 snd_timer ir_sony_decoder ir_jvc_decoder ir_rc6_decoder snd_seq_device cx88xx ir_rc5_decoder v4l2_common ir_nec_decoder snd videodev ir_common ir_core psmouse v4l1_compat videobuf_dma_sg tveeprom videobuf_core tpm_tis
Feb 16 07:07:35 heffalump kernel: snd_page_alloc parport_pc soundcore tpm serio_raw tpm_bios btcx_risc pcspkr ns558 gameport parport rng_core evdev i2c_i801 processor shpchp pci_hotplug thermal_sys ext3 jbd mbcache dm_mod raid1 raid0 md_mod osst st hid_logitech ff_memless usbhid hid sg radeon sd_mod sr_mod ttm cdrom crc_t10dif aic7xxx drm_kms_helper ata_generic uhci_hcd aic79xx ehci_hcd scsi_transport_spi 3w_xxxx ata_piix drm libata firewire_ohci i2c_algo_bit usbcore scsi_mod skge i2c_core firewire_core power_supply crc_itu_t button nls_base [last unloaded: scsi_wait_scan]
Feb 16 07:07:35 heffalump kernel: [26468.968274] Pid: 10077, comm: xmoto Not tainted 2.6.37-trunk-686-bigmem #1
Feb 16 07:07:35 heffalump kernel: [26468.968277] Call Trace:
Feb 16 07:07:35 heffalump kernel: [26468.968286]  [<c1036005>] ? warn_slowpath_common+0x6a/0x7b
Feb 16 07:07:35 heffalump kernel: [26468.968310]  [<f8b56f1a>] ? radeon_fence_wait+0x1f3/0x281 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968316]  [<c103607c>] ? warn_slowpath_fmt+0x28/0x2c
Feb 16 07:07:35 heffalump kernel: [26468.968343]  [<f8b56f1a>] ? radeon_fence_wait+0x1f3/0x281 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968358]  [<c104b7a6>] ? autoremove_wake_function+0x0/0x29
Feb 16 07:07:35 heffalump kernel: [26468.968377]  [<f8a0e5a4>] ? ttm_bo_wait+0xaa/0x132 [ttm]
Feb 16 07:07:35 heffalump kernel: [26468.968396]  [<f8a10bf6>] ? ttm_bo_move_accel_cleanup+0x76/0x240 [ttm]
Feb 16 07:07:35 heffalump kernel: [26468.968424]  [<f8b575ed>] ? radeon_move_blit+0x14b/0x16b [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968448]  [<f8b5789d>] ? radeon_bo_move+0x19b/0x1f8 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968472]  [<f8b57702>] ? radeon_bo_move+0x0/0x1f8 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968486]  [<f8a0e951>] ? ttm_bo_handle_move_mem+0x198/0x259 [ttm]
Feb 16 07:07:35 heffalump kernel: [26468.968501]  [<f8a0fedd>] ? ttm_bo_move_buffer+0xa1/0xc6 [ttm]
Feb 16 07:07:35 heffalump kernel: [26468.968515]  [<f8a0ffab>] ? ttm_bo_validate+0xa9/0xec [ttm]
Feb 16 07:07:35 heffalump kernel: [26468.968540]  [<f8b58705>] ? radeon_bo_list_validate+0x7e/0xd1 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968565]  [<f8b6a445>] ? radeon_cs_ioctl+0x9c/0x167 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968585]  [<f876c4c5>] ? drm_ioctl+0x224/0x2d7 [drm]
Feb 16 07:07:35 heffalump kernel: [26468.968613]  [<f8b6a3a9>] ? radeon_cs_ioctl+0x0/0x167 [radeon]
Feb 16 07:07:35 heffalump kernel: [26468.968621]  [<c10aa9b0>] ? handle_mm_fault+0x479/0xaa0
Feb 16 07:07:35 heffalump kernel: [26468.968638]  [<f876c2a1>] ? drm_ioctl+0x0/0x2d7 [drm]
Feb 16 07:07:35 heffalump kernel: [26468.968643]  [<c10cd629>] ? do_vfs_ioctl+0x49e/0x4e9
Feb 16 07:07:35 heffalump kernel: [26468.968650]  [<c12a0efb>] ? do_page_fault+0x353/0x36f
Feb 16 07:07:35 heffalump kernel: [26468.968654]  [<c12a0ee8>] ? do_page_fault+0x340/0x36f
Feb 16 07:07:35 heffalump kernel: [26468.968658]  [<c10cd6b8>] ? sys_ioctl+0x44/0x64
Feb 16 07:07:35 heffalump kernel: [26468.968664]  [<c1008b1f>] ? sysenter_do_call+0x12/0x28
Feb 16 07:07:35 heffalump kernel: [26468.968668] ---[ end trace 7d34de4a0d796cfb ]---
Feb 16 07:07:35 heffalump kernel: [26468.968680] [drm] Disabling audio support
Feb 16 07:07:35 heffalump kernel: [26468.984595] radeon 0000:01:00.0: f6e76800 unpin not necessary
Feb 16 07:07:35 heffalump kernel: [26468.984601] radeon 0000:01:00.0: GPU softreset 
Feb 16 07:07:35 heffalump kernel: [26468.984605] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xF5703028
Feb 16 07:07:35 heffalump kernel: [26468.984609] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00330302
Feb 16 07:07:35 heffalump kernel: [26468.984613] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200000C0
Feb 16 07:07:35 heffalump kernel: [26468.984625] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
Feb 16 07:07:35 heffalump kernel: [26468.999513] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
Feb 16 07:07:36 heffalump kernel: [26469.015398] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0x00003028
Feb 16 07:07:36 heffalump kernel: [26469.015402] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000002
Feb 16 07:07:36 heffalump kernel: [26469.015406] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200000C0
Feb 16 07:07:36 heffalump kernel: [26469.016407] radeon 0000:01:00.0: GPU reset succeed
Feb 16 07:07:36 heffalump kernel: [26469.035388] radeon 0000:01:00.0: WB disabled
Feb 16 07:07:36 heffalump kernel: [26469.081355] [drm] ring test succeeded in 1 usecs
Feb 16 07:07:36 heffalump kernel: [26469.081370] [drm] ib test succeeded in 1 usecs
Feb 16 07:07:36 heffalump kernel: [26469.081373] [drm] Enabling audio support


Let me know if you need any more info, or if you want me to try patches etc.

This could be https://bugs.freedesktop.org/show_bug.cgi?id=31532 but since similar symptoms don't always imply the same bug, I'll leave it up to you experts!

Regards, and thanks for your time,

Stephen
Comment 1 Stephen Kitt 2011-06-18 06:17:34 UTC
This no longer happens with 7.10.3.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.