Bug 10467 - RC3, Intel and DPMS hang
Summary: RC3, Intel and DPMS hang
Status: RESOLVED FIXED
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/intel (show other bugs)
Version: 7.2 (2007.02)
Hardware: Other All
: medium normal
Assignee: Eric Anholt
QA Contact: Xorg Project Team
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-03-29 14:40 UTC by Samuel Thibault
Modified: 2008-03-11 16:16 UTC (History)
5 users (show)

See Also:
i915 platform:
i915 features:


Attachments
my xorg.conf (928 bytes, text/plain)
2007-03-29 14:42 UTC, Samuel Thibault
no flags Details
Typical Xorg log just before complete box hang (56.86 KB, text/plain)
2007-03-29 14:48 UTC, Samuel Thibault
no flags Details
my dmesg (16.72 KB, text/plain)
2007-04-02 12:19 UTC, Samuel Thibault
no flags Details
my /proc/interrupts (557 bytes, text/plain)
2007-04-02 12:19 UTC, Samuel Thibault
no flags Details
xorg.conf (driver 2.0.0) (4.12 KB, application/octet-stream)
2007-04-24 18:07 UTC, Conn
no flags Details
Xorg.0.log (driver 2.0.0, after GDM bug, but no hang) (54.72 KB, text/plain)
2007-04-24 18:08 UTC, Conn
no flags Details
dmesg (driver 2.0.0) (19.70 KB, text/plain)
2007-04-24 18:09 UTC, Conn
no flags Details
.xsession-errors relating to strange GDM behaviour (driver 2.0.0) (861 bytes, text/plain)
2007-04-24 18:10 UTC, Conn
no flags Details
Crash log (actually probably not related to this bug) (85.02 KB, text/plain)
2007-06-19 04:28 UTC, Samuel Thibault
no flags Details

Description Samuel Thibault 2007-03-29 14:40:33 UTC
I tried RC3 or X server and intel driver.  When X enters DPMS mode (either from xscreensaver or by hand with xset s activate), the machine hangs shortly.
Comment 1 Samuel Thibault 2007-03-29 14:42:58 UTC
Created attachment 9371 [details]
my xorg.conf
Comment 2 Samuel Thibault 2007-03-29 14:48:43 UTC
Created attachment 9372 [details]
Typical Xorg log just before complete box hang
Comment 3 Michel Dänzer 2007-03-30 00:45:15 UTC
(In reply to comment #0)
> When X enters DPMS mode (either from xscreensaver or by hand with xset s
> activate), the machine hangs shortly.

Hangs as in not pingable?

Are you sure xset s activate goes to DPMS directly?

Is anything 3D running in your session? Which version of libgl1-mesa-dri is installed?
Comment 4 Samuel Thibault 2007-03-30 14:58:18 UTC
not pingable indeed.

xset q reports   Standby: 1800    Suspend: 1800    Off: 1800, so I guess it directly enters "off" mode yes. I'll try to set different values for determining if one particular level breaks.

I indeed usually have a 3D application running (simply because there is a xscreensaver). It looks like it's only some GL savers that poses problem, because the hang is not systematic (even after seeing several GL savers running).

I have libgl1-mesa-{dri,glx} 6.5.2-3
Comment 5 Samuel Thibault 2007-03-30 16:41:44 UTC
I found a reproducable way to trigger it: I start xinit /usr/bin/fvwm, then run xscreensaver-demo, let it start the daemon, ask for Preview of the "providence" saver, and then via ssh run xset s activate. Note: xscrensaver's standby/suspend/off times are all the same.
Comment 6 Samuel Thibault 2007-03-30 16:43:52 UTC
Note: the hang is not immediate, it usually takes a few dozens of seconds (during which the GL saver is still active I guess?).
Comment 7 Samuel Thibault 2007-03-30 17:03:52 UTC
Actually, standby/suspend/off times don't need to be the same. Even when set to 10/20/30 minutes I get the crash.
Comment 8 Michel Dänzer 2007-04-02 04:39:14 UTC
Does this also happen with an older kernel, say 2.6.18?
Comment 9 Samuel Thibault 2007-04-02 12:18:12 UTC
I tried the debian 2.6.18-4-686 kernel with same result.  I also tried the git version, same result but I get some debugging message when starting the Providence saver:  do_wait: drmWaitVBlank returned -1, IRQs don't seem to be working correctly. Try running with LIBGL_THROTTLE_REFRESH and LIBL_SYNC_REFRESH unset.  In case it might be helpful, I'll attach dmesg + /proc/interrupts 
Comment 10 Samuel Thibault 2007-04-02 12:19:30 UTC
Created attachment 9439 [details]
my dmesg
Comment 11 Samuel Thibault 2007-04-02 12:19:59 UTC
Created attachment 9440 [details]
my /proc/interrupts
Comment 12 Samuel Thibault 2007-04-02 13:08:19 UTC
Actually, the debugging message only shows up with a 2.6.18 kernel, not with the 2.6.20 kernel
Comment 13 Michel Dänzer 2007-04-02 22:47:16 UTC
(In reply to comment #9)
> do_wait: drmWaitVBlank returned -1, IRQs don't seem to be
> working correctly. Try running with LIBGL_THROTTLE_REFRESH and
> LIBL_SYNC_REFRESH unset.

Hmm. Does this and/or the hang also happen if you set the driconf setting vblank_mode ('Synchronization with vertical refresh (swap intervals)') to 0 ('Never synchronize with vertical refresh, ignore application's choice')?
Comment 14 Samuel Thibault 2007-04-03 12:18:23 UTC
When doing this the warning goes away and I can't reproduce the hang indeed.
Comment 15 Samuel Thibault 2007-04-04 01:48:10 UTC
Oops, sorry, I spoke too fast. I still have the warning and the hang. 
Comment 16 Michel Dänzer 2007-04-08 07:16:41 UTC
Would the patch attached to bug 10546 happen to help?
Comment 17 Samuel Thibault 2007-04-09 16:44:15 UTC
Unfortunately it doesn't.  I'm still getting the warning and the hang. 
Comment 18 JM Ibanez 2007-04-10 04:10:36 UTC
On my end, I'm encountering DPMS problems too, but this doesn't involve any 3D applications nor DRI. DRI is disabled (I have a too-large Virtual screen size).

Unfortunately, I haven't been able to save any Xorg logs yet.
Comment 19 Samuel Thibault 2007-04-17 12:57:57 UTC
xserver RC5 fixed the issue for me, whatever the version of xf86-video-intel (RC3 or current git). Would you like me to to bisect the git for discovering which commit fixed it?
Comment 20 JM Ibanez 2007-04-18 03:38:24 UTC
This bug seems to still hit me.
Comment 21 Samuel Thibault 2007-04-18 03:40:39 UTC
JM: how do you reproduce the bug?
Comment 22 Samuel Thibault 2007-04-20 01:18:56 UTC
Mmm, I also got a hang yesterday. That was during a long noon break during which I left xscreensaver running. I'm now trying without it, just to check whether it's related.
Comment 23 JM Ibanez 2007-04-20 05:40:39 UTC
Samuel: I'm seeing the problem occur sporadically. Hard to reproduce. However:

 - It seems that when I have an external monitor connected, it occurs more readily.
 - If when I run xrandr I'm told that there's an output connected to the VGA (even if there isn't), I'm also more susceptible to it.

I'll see if I can try isolating it.

Also: running 'xset dpms force off' doesn't hit me with the bug; however, lid closure does. (it could be a fluke though -- I'll investigate further).  
Comment 24 Conn 2007-04-22 09:08:48 UTC
I believe that I may be experiencing the same bug on my system (Dell Inspiron 510m laptop, 855GM graphics running Ubuntu Feisty and the 2.0.0 intel driver from git).

My system tends to hang both when I switch from X to a VT or run an application that changes the screen resolution, e.g. ppracer (DRI functions perfectly, that's not the issue). When the hang occurs, the screen blanks and the system becomes unresponsive and only a hard reset works. Is this is the same issue (with DPMS), or will I open another bug?
Comment 25 lkislaire 2007-04-24 05:01:26 UTC
I get the same lockup with my 855GM on a HP nx5000 running debian unstable
Kernel is 2.6.20.3
I _think_ the problem appeared when doing this upgrade
2007-04-20 12:57:41: - C        xserver-xorg-video-i810                                2:1.7.2-4                                         
2007-04-20 12:58:04:   U                     libxrandr2                              2:1.1.0.2-5                                2:1.2.1-1
2007-04-20 12:58:11:   U                        libdrm2                                  2.3.0-2                                  2.3.0-4
2007-04-20 12:58:11:   U                libgl1-mesa-dri                                6.5.1-0.6                                  6.5.2-4
2007-04-20 12:58:15:   U                libgl1-mesa-glx                                6.5.1-0.6                                  6.5.2-4
2007-04-20 12:58:41:   U                   libglu1-mesa                                6.5.1-0.6                                  6.5.2-4
2007-04-20 12:58:43:   U                     mesa-utils                                6.3.2-2.1                                  6.5.2-4
2007-04-20 12:58:44:   U              xserver-xorg-core                               2:1.1.1-21                         2:1.3.0.0.dfsg-1
2007-04-20 12:58:45: + I       xserver-xorg-video-intel                                                                         2:2.0.0-1

I will try the 2.6.21-rc7 and report if it make any difference and also try to downgrade some of the above packages.

Thank you for the work so far.
Comment 26 lkislaire 2007-04-24 07:21:26 UTC
Using 2.6.21-rc7 makes no difference
Pushing the laptop lid freezes the computer after 2 seconds (less than this, it comes back)
The regression appears when going from xserver-xorg-video-i810 1.9.94-1 -> 2.0.0-1
Adding acpi=off to the kernel inhibits the problem


Section "Module"
        Load    "bitmap"
        Load    "dbe"
        Load    "ddc"
        Load    "extmod"
        Load    "freetype"
        Disable "glx"
        Disable "dri"
        Load    "int10"
        Load    "vbe"
EndSection
Comment 27 Michel Dänzer 2007-04-24 07:52:44 UTC
(In reply to comment #26)
> The regression appears when going from xserver-xorg-video-i810 1.9.94-1 ->
> 2.0.0-1

This bug was originally reported against 1.9.93 though... Still, it would be great if you could isolate the guilty change with git-bisect.
Comment 28 Conn 2007-04-24 18:06:27 UTC
(In reply to comment #27)

> This bug was originally reported against 1.9.93 though... Still, it would be
> great if you could isolate the guilty change with git-bisect.
> 

I tried running version 1.9.94 on my system (also with ACPI disabled), but I can still reproduce the hang with that version of the driver. This bug is quite frustrating to isolate, because it's hard to anticipate when a lockup will occur and because it's a hard freeze no logs are recoverable.

Observations on my system (Dell Inspiron 510m, 855GM, Ubuntu Feisty):

Driver 1.7.4: Everything works perfectly - DRI, DPMS, XV output, switching to VT, closing lid, etc. I don't recall ever experiencing a system lockup with this driver (except perhaps from playing around with DRI on rare occasions, but it's not related to this).

Driver 1.9.94: Switching between VTs, deliberately killing/respawning X seems stable for a while, but after some time a hang will occur. DPMS function by pressing the close lid button briefly, but system hangs if pressed for several seconds or pressed repeatedly. XV playback will crash or lockup X (but fixable with temporary patches in bug #10645). Changing screen resolution will sometimes cause the server to restart. Running ppracer (fullscreen 800x600) will sometimes work initially, but then you can guarantee a hard freeze if you try to relaunch ppracer, switch to a VT or kill X afterwards.

Driver 2.0.0: This driver is quite erratic on my system. Sometimes when launched, GDM tries to set a virtual mode rather than the intended 1024x768, and  crashes with an .xsession-errors message (will post logs). If this behaviour doesn't occur, once logged into GNOME I experience virtually the same behaviour as the 1.9.94 driver.

I can't ssh into the laptop when the freezes occur, and the logs aren't written to disk in time, so I'm unsure how I can provide good logs for troubleshooting. In the meantime I'm attaching logs from my system when the 2.0.0 driver is active and exhibiting the strange GDM bug, maybe it'll give some clues.
Comment 29 Conn 2007-04-24 18:07:36 UTC
Created attachment 9727 [details]
xorg.conf (driver 2.0.0)
Comment 30 Conn 2007-04-24 18:08:49 UTC
Created attachment 9728 [details]
Xorg.0.log (driver 2.0.0, after GDM bug, but no hang)
Comment 31 Conn 2007-04-24 18:09:36 UTC
Created attachment 9729 [details]
dmesg (driver 2.0.0)
Comment 32 Conn 2007-04-24 18:10:38 UTC
Created attachment 9730 [details]
.xsession-errors relating to strange GDM behaviour (driver 2.0.0)
Comment 33 Michel Dänzer 2007-04-25 03:00:15 UTC
(In reply to comment #28)
> Driver 2.0.0: This driver is quite erratic on my system. Sometimes when
> launched, GDM tries to set a virtual mode rather than the intended 1024x768,
> and  crashes with an .xsession-errors message (will post logs).

Please don't mix up several issues in a single report but file a new one if this hasn't been reported yet (it sounds like this could be related to intermittent detection of a non-existant external display reported elsewhere).


> I can't ssh into the laptop when the freezes occur, and the logs aren't written
> to disk in time, so I'm unsure how I can provide good logs for troubleshooting.

Remounting the filesystem containing the logs with the sync option might help.
Comment 34 lkislaire 2007-04-25 05:24:50 UTC
(In reply to comment #27)
> (In reply to comment #26)
> > The regression appears when going from xserver-xorg-video-i810 1.9.94-1 ->
> > 2.0.0-1
> 
> This bug was originally reported against 1.9.93 though... Still, it would be
> great if you could isolate the guilty change with git-bisect.
> 
Short comment (because I lack time..):
While bisecting from freedesktop git (thanks for the http-export btw), I discovered that 1.9.94 is indeed bad for me. No time to check why the same debian version didn't show the problem.
I tried to bisect between 1.9.93 and 1.9.94 but the compilation failed due to changes in libxrandr. I hope to get today again on the problem as soon as I have more time.

Comment 35 Vincent Bernat 2007-05-13 07:20:24 UTC
I have the same problem here. X.org from Debian Unstable. Going back to xserver-xorg-video-i810 from Etch solves the problem. I do not use 3D (but DRI is enabled). I may give more details on thursday.
Comment 36 lkislaire 2007-05-16 08:18:39 UTC
Some news... Since a few days I'm fighting with this lockup and my last discovery:

Building xf86-video-intel 1.9.91 against xserver 1.2.99.902 already shows the problem (no dri, no mesa, not loading module dri and glx). Can somebody confirm?
Building xf86-video-intel 1.7.4 against the same internal xserver does not show this lockup.

At the moment when I build an intermediate version (300 commits between the 1.7.4 and 1.9.91..) I get a signal 11 after entering the preinit of i810_drv.so, and this for several versions of xf86-video-intel.. ??

I will try some "manual bisect",meanwhile, if somebody has at least two commit-ids for xserver and xf86-video-input that I should try, that would help me.
Comment 37 Samuel Thibault 2007-06-19 04:28:41 UTC
Created attachment 10373 [details]
Crash log (actually probably not related to this bug)


I don't seem to have hangs any more, but I have experienced a server crash when leaving xscreensaver with DPMS enabled (I don't seem to have any problem with DPMS but no xscreensaver or xscreensaver without DPMS). Here is the log file.
Comment 38 Colin Macdonald 2007-07-11 16:12:51 UTC
I'm seeing something like this with 2.1.0 on my thinkpad r31 (i830).  DRI is disabled.  This is the first -intel driver I could use b/c of #10389 in 2.0.0.  

The locks have happened twice since upgrading to 2.1.0 last night.  Both times I left my laptop for sometime and returned to find it locked up (as above, not replying to pings even sysrq is unresponsive).

I have reduced my gnome-screensaver times to 1 min til blanking and 2 mins to dpms off (in gnome-power-manager) but have not yet seen it.  I haven't not yet tracked down how exactly to reproduce it but will post once/if I have something more definite.
Comment 39 lkislaire 2007-07-27 02:47:30 UTC
Hi,
I took the time to pull the latest git and until now, I haven't experienced any freeze when putting down the lid button of my laptop (855GM)

As a recall, the xorg.conf is a default one with only these modules:
Section "Module"
        Disable "dri"
        Disable "glx"
EndSection

I guess it's maybe the BIOS timing problem that Keith corrected but haven't verified (yet).

That's really cool! I'm very happy to be using this driver!

Comment 40 Raúl 2007-07-28 17:02:53 UTC
lkislaire the lid locking issue you mention is a different problem of the reported here. The problem that Conn has (and myself) is also different and reported as https://bugs.freedesktop.org/show_bug.cgi?id=11432. This seems to be related with the Dell laptops BIOS.

The lid issue lkislaire is reporting to be fixed maybe the one solved with commit http://gitweb.freedesktop.org/?p=xorg/driver/xf86-video-intel.git;a=commit;h=d957c6b8e1dde8e11c1db3431e0ff58c5d984880 

I'm not a perfect bug reporter, but as I've been told is better to create a new bug report and merge afterwards than mess several issues on the same bug report. Let's help developers! ;)
Comment 41 Michael Fu 2007-08-23 19:03:23 UTC
there are too many reports in this bug. As the original bug reporter said this bug has been fixed (no hang). I will mark this bug as fixed and please open new bug if you experience other error.
Comment 42 Colin Macdonald 2008-03-11 16:16:20 UTC
Just in case anyone is looking at, my issue in comment #38 seems to be related to the Pipe A quirk and is reported in bug #14960


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.