Created attachment 19479 [details]
Platform: Montevina, Mccreary
Packages: 2008Q3RC2, 2008Q3RC3
Initiate several glxgears windowns (5 or more), and then maxsize one of more, switch them and wait, you will find the OS and X crashed. not any response.
Or you can move one of glxgears window quickly, the problem is the same.
Since this case is easily reproduced, I don't upload xorg.conf.
Created attachment 19480 [details]
Jiewen, can you reproduce this on G45 or GM45?
Quanxian, have you tried other platforms? I want to know if this is GM45/G45 specific.
With Packages 2008Q3RC2, 2008Q3RC3, I can't see "the OS and X crash" on our GM45.But when dragging one of them , last for several seconds, others application windows are freezing except the one dragged, and everything works well as soon as release you mouse button. Maxsizing one of more is the same as dragging, freezing happens "only in the course of your operation". After your operation finish, everything works well.
The phenomena descripted above happen Not only on the app glxgear, like arbfplight.
(In reply to comment #3)
> With Packages 2008Q3RC2, 2008Q3RC3, I can't see "the OS and X crash" on our
> GM45.But when dragging one of them , last for several seconds, others
> application windows are freezing except the one dragged, and everything works
> well as soon as release you mouse button. Maxsizing one of more is the same as
> dragging, freezing happens "only in the course of your operation". After your
> operation finish, everything works well.
Do you open 5 or more glxgears windows? one window can not reproduce this. Also when dragging the window, please quickly drag, and drag to every place for about 30 seconds. :)
we are trying on T61(965GM) to check if it is specific for GM45 / G45.
Crash issue? I saw a frozen screen with my 965GM.
What is the kernel version used in SLES_11_beta2
(In reply to comment #7)
> What is the kernel version used in SLES_11_beta2
2.6.27 rc6 (without any GEM patches).
Could you set vblank_mode to 0 in your ~/.drirc, such as
<device screen="0" driver="i965">
<option name="vblank_mode" value="0" />
We have tried this. After the configuration, it is OK. :). Seems it is vblank_mode problem.
So it turns out a vblank issue.
Jesse has vblank-rework in anholt's drm-intel-next kernel. So I guess we'll follow up there in Q4 release.
Gordon, you can close this bug.
Thanks for your help
(In reply to comment #12)
> Gordon, you can close this bug.
No. This _is_ a bug.
Ok, Eric and I spent the day playing with vblank. We've got fixes in the 2D driver, mesa, libdrm and the kernel. Would be nice to know whether this bug remains a problem now.
(In reply to comment #14)
> Ok, Eric and I spent the day playing with vblank. We've got fixes in the 2D
> driver, mesa, libdrm and the kernel. Would be nice to know whether this bug
> remains a problem now.
It is great.
I have checked the email. There are two email with vblank issue.
1) [Intel-gfx] Several vblank swapbuffers fixes
2) [PATCH] [drm/i915] Protect vblank IRQ regaccess with spinlock
Is still others ? or just use the branch. If so, please tell me which branch is touched.
The problem still is there. We just disable vblank. I don't make where your commit for vblank are for libdrm, mesa, 2d and kernel.
It could be that this isn't a full machine hang, but rather a deadlock in the DRM. You may be able to run a script in the background that waits a few seconds and then captures the dmesg after a sysrq-t trigger, something like:
$ sleep 30; echo t > /proc/sysrq-trigger; dmesg > dmesg.out; sync &
<reproduce hang, wait 30s before rebooting>
After the hang, network is broken. We can not ssh to the machine to get the information.
We have tried RC5 packages on T61, Mccreary and Montevina, all of them have such problem.
I upgrade the priority of this issue since it is critical for 3D. If it is not works, it will block Novell-SLED11 3D release for new platforms and old platforms.
We have tried this on G33, T61, Montevina and Mccreary. Every machines which we touched will have such problem.
Also I have tried the vblank patch for libdrm and mesa, not works. For drm patches, it is for gem. Novell just use no gem for kernel(Q3 final release).
Bug 17963, is also the problems initialized by glxgears (black screen).
Such problems will be a block path for 3D support.
I've posted a patch to intel-gfx and dri-devel for review which makes vblank work on my machine at least.
Ok. I have checked the email. Seem it is the email with title [Intel-gfx] [PATCH] Manage PIPESTAT pending interrupt values to unblock vblank interrupts
I will try this.
any information, I will report.
I have checked the content of patch. It has the big difference with Q3 release.
Drm package of Q3 release is from linux kernel. However I checked the contents, seems it is for drm-gem. Novell drm doesn't support drm-gem kernel.
Also the interface has been changed more.
Any comments for that?
Keith or Jesse, are you planning to provide fix for Gem-classic? Vblank issue seems the root cause for many current critical 3D issues.
Also from Eric's concern in Bug # 17963, the Vblank code path should be reverted for 855/865. Will you also include that fix together?
any progress for that?
I've now disabled VBlank (by default) for i965 for openSUSE 11.1/SLE11. Did
this already for i915 before (Bug #18041). So no more VBlank for intel.
Stefan, disabling vblank altogether is a pretty big hammer, since apps depend on it to draw without tearing. There's another patch which might help this bug at http://lists.freedesktop.org/archives/intel-gfx/2008-November/000614.html, you might want to give it a try.
As for the backporting question, no we weren't planning on doing the backport, but someone in the OSV team or at Novell probably could. There have been a lot of changes though, so it won't be trivial.
Quanxiang, can you try Eric's for-airlied tree? Keith's fixes are included there, along with some additional fixes that have occurred since then. If that works at least we'll have an idea of whether backporting is needed.
We am try to backporting you vbalnk packages. Keith patch is based on your packages and also some others are also based on yours or gem. Also we will have a try on the latest branch including for-airlied. Seems it is very hard for upstream to provide a patch based on Q3 release. :( . I know you are very busy for gem. Wish This way can help.
1) for-airlied works.
2) After packageing your vblank package and adding Keith patch for "[drm] Move drm vblank initialization/cleanup to driver load/unload", the glxgears window becomes black just as bug 17963.
By the way, I tested them in GM965(T61).
shall we close this since for-airlied works?
I don't think. We still not find the solution for this. We ever included Jesse vblank patch and plus Keith patch, glxgears still be hang. For branch of for-airlied, it is based on v2.6.28 kernel. We ever try to backporting to v2.6.27. However it is stopped by io-mapping. There are much dependency on new kernel.
Therefore we still need to find a solution for non-gem branch.
There is no way for us to backporting gem to v2.6.27 since novell will not change kernel.
Any idea for that?
I have ported your patches for vblank and plus Keith patch. It doesn't work on 965GM.
Also I get the information from Novell, if we don't provide the solution for this. They will disable vblank.
This is still the blocker issue for Novell if we want them to enable vblank.
Disabling vblank by default is fine; it just means users will see tearing in some cases, but shouldn't affect correctness otherwise.
Fixed upstream and worked around in SuSE by disabling vblank sync.