Bug 25544 - KMS/R200: No texture bound to unit 1
Summary: KMS/R200: No texture bound to unit 1
Status: RESOLVED FIXED
Alias: None
Product: Mesa
Classification: Unclassified
Component: Drivers/DRI/r200 (show other bugs)
Version: git
Hardware: x86 (IA32) Linux (All)
: medium normal
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
: 31138 (view as bug list)
Depends on:
Blocks:
 
Reported: 2009-12-09 14:29 UTC by Alan Swanson
Modified: 2010-11-30 11:30 UTC (History)
4 users (show)

See Also:
i915 platform:
i915 features:


Attachments
possible fix (643 bytes, patch)
2009-12-10 07:23 UTC, Roland Scheidegger
Details | Splinter Review
drm_r200_disable_texture_check.patch (412 bytes, patch)
2010-02-28 08:01 UTC, Alan Swanson
Details | Splinter Review
fix the drm CS checker (3.40 KB, patch)
2010-10-26 14:21 UTC, Alex Deucher
Details | Splinter Review

Description Alan Swanson 2009-12-09 14:29:27 UTC
On a R200 (Radeon 8500) using KMS with current mesa_7_7_branch git e3fa700c178e11e6735430119232919176ab7b42, libdrm 2.4.16 and linux 2.6.32 Compiz is near continuously logging the following error;

CS section size missmatch start at (r200_state_init.c,tex_emit_mm,703) 5 vs 9
CS section end at (r200_state_init.c,tex_emit_mm,719)

However on a rv250 (Mobility 9000) with exactly the same version there are no errors so maybe due to the original two texture units per pipeline? No errors in the Xorg.log or dmesg. And mismatch is spelt with one "s" too!
Comment 1 Roland Scheidegger 2009-12-10 07:23:35 UTC
Created attachment 31941 [details] [review]
possible fix

does this patch help?
As you suspect there are differences due to the two physical texture units on r200. Well, in fact there shouldn't be any differences due to that, but there are hardware bugs so need to emit state for texture units 0/1 pairwise (these workarounds are insufficient btw at least when using ati_fragment_shader, but that's another story).
I think there's an issue in the current code if neither a texture is bound to a unit, nor is it really enabled, in this case it will try to subtract the dwords it doesn't need for relocation twice, which can only happen with r200 (on rv250 it would never submit the state for this texture unit in the first place).
Comment 2 Alan Swanson 2009-12-10 12:43:09 UTC
Yes, that patch resolves the error on R200 with no problems on rv250.

It looks like the texture unit differences and hardware bugs cause other issues on R200 not present on rv250; dark and strobed textures with Compiz alt-tab and cube effects whilst running most games crash on a "drmRadeonCmdBuffer: -22" error with the following dmesg.

[drm:r100_cs_track_texture_check] *ERROR* No texture bound to unit 1
[drm:radeon_cs_ioctl] *ERROR* Invalid command stream !

I'll open another bug after a bit more testing as I've just started using KMS.
Comment 3 Roland Scheidegger 2009-12-10 14:49:42 UTC
(In reply to comment #2)
> Yes, that patch resolves the error on R200 with no problems on rv250.
> 
> It looks like the texture unit differences and hardware bugs cause other issues
> on R200 not present on rv250; dark and strobed textures with Compiz alt-tab and
> cube effects whilst running most games crash on a "drmRadeonCmdBuffer: -22"
> error with the following dmesg.
> 
> [drm:r100_cs_track_texture_check] *ERROR* No texture bound to unit 1
> [drm:radeon_cs_ioctl] *ERROR* Invalid command stream !
> 
> I'll open another bug after a bit more testing as I've just started using KMS.

Looks like the command verifier isn't getting along with fake enable of texture unit 1. So in some cases when unit 0 is active and 1 is not, we need to enable unit 1 too due to hw bug, but set to LOOKUP_DISABLE. Since that'll not actually sample from the "enabled" unit, that's just fine and it's not necessary a texture is bound on that unit. But cs_track_texture_check can't deal with that since it only sees unit 1 is enabled but no texture is actually bound and hence complains... 

I don't know how to fix this though, could either make the track_texture_check more clever or maybe the userspace driver should just bind some valid texture (maybe just take the one from unit 0?) so it doesn't look special from the kernel side.
Comment 4 Alex Deucher 2009-12-22 10:32:44 UTC
Roland, any reason not to apply the patch in comment 1?
Comment 5 Roland Scheidegger 2009-12-22 18:41:04 UTC
(In reply to comment #4)
> Roland, any reason not to apply the patch in comment 1?

No that's just fine. It just doesn't help much since you'll still get the cs verification failures (I believe you could also get these with rv250 not using that r200 workaround code, by using ati_fragment_shader and "disabled lookups" - I think the verifier needs some work, it also won't catch invalid textures if they happen in the first pass of ati_fragment_shader anyway probably).
Comment 6 Alan Swanson 2010-02-28 08:01:51 UTC
Created attachment 33643 [details] [review]
drm_r200_disable_texture_check.patch

Until R200 fake texture enable is supported by CS checker, R200 users can just disable texture checking with attached patch to actually have working 3D.
Comment 7 Alan Swanson 2010-10-26 12:03:21 UTC
*** Bug 31138 has been marked as a duplicate of this bug. ***
Comment 8 Daniel Richard G. 2010-10-26 12:41:59 UTC
Alan, can texture checking be disabled for R200 in production code? Building a patched kernel module isn't a feasible option for most users, and the effect of this bug is to make the 3D hardware basically unusable.

Was this same checking even done in the past? Before all the KMS upheaval, the R200 driver was rock-solid. (R200 was, I believe, the last Radeon release to have full 3D specs before the long dry spell that only recently ended.)
Comment 9 Alan Swanson 2010-10-26 13:04:11 UTC
Not as far as I'm aware unfortunately. Texture checking came in with the KMS rewrite IIRC.

(Updated bug summary to match problem.)
Comment 10 Daniel Richard G. 2010-10-26 13:46:12 UTC
I'm looking at r100_cs_track_texture_check() in linux-2.6.36/drivers/gpu/drm/radeon/r100.c (starting at line 3217). What if instead of an unconditional return, you were to do e.g.

	/* XXX: Disable texture check until R200 fake texture enable supported */
	if (rdev->family == CHIP_R200)
		return 0;

? Other parts of that function are already conditionalized based on chipset.
Comment 11 Alex Deucher 2010-10-26 14:21:18 UTC
Created attachment 39799 [details] [review]
fix the drm CS checker

This patch should fix the issue.  Please test and I'll get it upstream and into stable.
Comment 12 Nicholas Miell 2010-10-26 18:31:21 UTC
I've tested the patch and it works. Thanks.
Comment 13 Alex Deucher 2010-10-26 22:03:19 UTC
I've sent the patch to Dave for upstream inclusion.
Comment 14 Daniel Richard G. 2010-10-26 22:36:23 UTC
Confirmed here too! gltron now runs perfectly, as does OpenArena. Thank you, Alex, for pouncing on this one.

There's now a GPU-lockup problem (that existed before this fix) that I've reported as bug #31150.
Comment 15 madbiologist 2010-11-02 06:31:49 UTC
This fix has been released in kernel 2.6.37-rc1, and queued for stable.
Comment 16 Jared Clark 2010-11-30 08:04:23 UTC
I would like to know how I can get this resolved as well. I am having all the same issues with a R100, I have tried running the latest kernel from Ubuntu reps. 2.6.37 rc3. I do not really know how to incorporate the attached patch that you have said resolved the issue. Any help is greatly appreciated.
Comment 17 Alex Deucher 2010-11-30 08:35:02 UTC
(In reply to comment #16)
> I would like to know how I can get this resolved as well. I am having all the
> same issues with a R100, I have tried running the latest kernel from Ubuntu
> reps. 2.6.37 rc3. I do not really know how to incorporate the attached patch
> that you have said resolved the issue. Any help is greatly appreciated.

The patch is not relevant to r100.  If you are seeing the same symptom on r1xx, you are seeing a different issue.  Please open a new bug and include the drm error messages and information about how to reproduce the issue.
Comment 18 Roland Scheidegger 2010-11-30 08:58:22 UTC
(In reply to comment #17)
> (In reply to comment #16)
> > I would like to know how I can get this resolved as well. I am having all the
> > same issues with a R100, I have tried running the latest kernel from Ubuntu
> > reps. 2.6.37 rc3. I do not really know how to incorporate the attached patch
> > that you have said resolved the issue. Any help is greatly appreciated.
> 
> The patch is not relevant to r100.  If you are seeing the same symptom on r1xx,
> you are seeing a different issue.  Please open a new bug and include the drm
> error messages and information about how to reproduce the issue.

I think one issue the r100 texture tracking still has is that the tex_coord_type is not reset to 0 if a 2d texture is used after a cube map (only reset after a clear). Not sure if that's the issue here though...
Comment 19 Daniel Richard G. 2010-11-30 09:02:31 UTC
(In reply to comment #17)
> 
> The patch is not relevant to r100.  If you are seeing the same symptom on r1xx,
> you are seeing a different issue.  Please open a new bug and include the drm
> error messages and information about how to reproduce the issue.

I would also point out that 2.6.37rc3 already has Alex's fix for this issue.
Comment 20 Jared Clark 2010-11-30 11:30:02 UTC
Thank you for your prompt comments. Since it looks like I am dealing with a persistent bug on the R100, even using the latest from the xorg-edgers PPA and the 2.6.37-rc3, I will submit a new bug ticket with hopefully the important details.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.