Bug 80456 - [IVB/HSW Regression] X server crash when Lightsmark v2008 run 10~15 times
Summary: [IVB/HSW Regression] X server crash when Lightsmark v2008 run 10~15 times
Status: VERIFIED FIXED
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/intel (show other bugs)
Version: git
Hardware: All Linux (All)
: high major
Assignee: Chris Wilson
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
: 83176 (view as bug list)
Depends on:
Blocks:
 
Reported: 2014-06-24 03:08 UTC by liulei
Modified: 2014-09-19 21:17 UTC (History)
4 users (show)

See Also:
i915 platform:
i915 features:


Attachments
dmesg (122.71 KB, text/plain)
2014-06-24 03:08 UTC, liulei
no flags Details
Xorg.log (15.84 KB, text/plain)
2014-06-24 03:10 UTC, liulei
no flags Details
reopen-xorg.log (22.50 KB, text/plain)
2014-07-21 06:44 UTC, liulei
no flags Details
reopen-dmesg.log (73.36 KB, text/plain)
2014-07-21 06:59 UTC, liulei
no flags Details
gdb.log (6.74 KB, text/plain)
2014-07-22 01:42 UTC, liulei
no flags Details

Description liulei 2014-06-24 03:08:55 UTC
Created attachment 101622 [details]
dmesg

= Environment=
--------------------------
Platform:IVB/HSW
Libdrm:	(master)libdrm-2.4.54-9-g8fc62ca8ac010659023bb63c4759eb683de4f9af
Mesa:	(master)39cdf1621efaebcefc74403b0ef8fa1f699e4b97
Xserver:(master)xorg-server-1.15.99.902-121-g2f5cf9ff9a0f713b7e038636484c77f113a5f10a
Kernel version : 
-nightly   (fails)
e9ce9f961282205b5954a0da4e48e96884190a77
drm-intel-nightly: 2014y-06m-16d-21h-22m-19s integration manifest

==Bug detailed description==
-----------------------------
Lightsmark v2008 crash when run 6~8 times on IVB/HSW. The problem exists on Raw X and gnome-session. It's kernel regression, please see Xorg.0.log and dmesg.
-nightly   (fails)
e9ce9f961282205b5954a0da4e48e96884190a77
drm-intel-nightly: 2014y-06m-16d-21h-22m-19s integration manifest

-next-queued  (fails)
8c0f92e1f87149bd2429015c9d0600a13a34d607
drm/i915: Fix comment about our plane remapping on gen2/3

-fixes (works)
223a6f2b975ab35d93270ea1d4fb6e0ac6b27fe6
drm/i915/bdw: remove erroneous chv specific workarounds from bdw code

-next-queued (good)
Good commit:868d665b43473e230d560d5186535270a3d57a19
            drm/i915: Fix memory leak in intel_dsi_init() error path

Output log
-------------------------
Penumbra quality: 8/8 on Mesa DRI Intel(R) Ivybridge Mobile .
Loading Lightsmark2008.cfg...
  Loading objects/I_Robot_female_HD.3DS...
(EE)
(EE) Backtrace:
(EE) 0: X (xorg_backtrace+0x48) [0x46db28]
(EE) 1: X (0x400000+0x73aa9) [0x473aa9]
(EE) 2: /usr/lib64/libpthread.so.0 (0x3f7a200000+0xefa0) [0x3f7a20efa0]
(EE) 3: X (0x400000+0x1364dd) [0x5364dd]
(EE) 4: X (0x400000+0x1367b8) [0x5367b8]
(EE) 5: X (0x400000+0x13a1d1) [0x53a1d1]
(EE) 6: X (0x400000+0x13165c) [0x53165c]
(EE) 7: X (0x400000+0x132291) [0x532291]
(EE) 8: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f9abf880000+0xf37d1) [0x7f9abf9737d1]
(EE) 9: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f9abf880000+0x6b0a5) [0x7f9abf8eb0a5]
(EE) 10: X (WakeupHandler+0x9c) [0x43e68c]
(EE) 11: X (WaitForSomething+0x1b6) [0x477636]
(EE) 12: X (0x400000+0x398ea) [0x4398ea]
(EE) 13: X (0x400000+0x3db2a) [0x43db2a]
(EE) 14: /usr/lib64/libc.so.6 (__libc_start_main+0xf5) [0x3f79e21b75]
(EE) 15: X (0x400000+0x282b1) [0x4282b1]
(EE)
(EE) Segmentation fault at address 0x1000007bb
(EE)
Fatal server error:
(EE) Caught signal 11 (Segmentation fault). Server aborting
(EE)
(EE)
Please consult the The X.Org Foundation support
         at http://wiki.x.org
 for help.
(EE) Please also check the log file at "/opt/X11R7/var/log/Xorg.0.log" for additional information.
(EE)
(II) AIGLX: Suspending AIGLX clients for VT switch
  Loading scenes/wop_padattic/wop_padatticBB.ani...
  Detection quality: auto->high.
  Loading scene scenes/wop_padattic/wop_padatticBB.bsp...
(EE) Server terminated with error (1). Closing log file.
xterm: fatal IO error 11 (Resource temporarily unavailable) or KillClient on X server ":0"
xinit: connection to X server lost
XIO:  fatal IO error 104 (Connection reset by peer) on X server ":0.0"
      after 77 requests (74 known processed) with 0 events remaining.
[1]+  Done                    xinit

==Reproduce steps==
---------------------------- 
1.xinit &
2.start Lightsmark
Comment 1 liulei 2014-06-24 03:10:10 UTC
Created attachment 101623 [details]
Xorg.log
Comment 2 Chris Wilson 2014-06-24 06:02:36 UTC
I'll be impressed if you can bisect this to a kernel commit...
Comment 3 liulei 2014-06-25 03:42:35 UTC
(In reply to comment #2)
> I'll be impressed if you can bisect this to a kernel commit...
It's hard to reproduce this issue. Game may crash after ten or even fifteen times running. I find it's not kernel after I increase my attempts to fifteen.
Comment 4 Chris Wilson 2014-06-26 07:04:51 UTC

*** This bug has been marked as a duplicate of bug 80157 ***
Comment 5 Chris Wilson 2014-07-21 05:53:32 UTC
If you plan on reopening a bug, make sure you include the up-to-date information, i.e. treat it as a fresh bug report to at least convince us that is the same bug.
Comment 6 liulei 2014-07-21 06:39:54 UTC
= Environment=
--------------------------
Platform:IVB/HSW
 Libdrm:		(master)libdrm-2.4.54-19-gc0b34dca2632a774249cfa3b969c3f7ce9df33e1
 Mesa:		(master)48deb4dbf278c03ea49aeaa7f9d9eda231def817
 Xserver:		(master)xorg-server-1.15.99.902-121-g2f5cf9ff9a0f713b7e038636484c77f113a5f10a
 Xf86_video_intel:		(master)2.99.912-233-gf33d44f41ef0f287375b7a6b1c117abff5a23b19
 Cairo:		(master)f4dbba26d4fc571e0621f3c7a98196c501f8bcc5
 Libva:		(master)c61d8c6ce9ffc27320e9e177c1e1123d5f1b5014
 Libva_intel_driver:		(master)c5cb17ea86f0065a939d3636dd26651c93d497c8

kernel: drm-intel-nightly
8734408c113bb38234ed03ec51c723b3deff579b
  drm-intel-nightly: 2014y-07m-18d-15h-21m-24s integration manifest
Comment 7 liulei 2014-07-21 06:41:59 UTC
I found this issue still exists.
(EE) Backtrace:
(EE) 0: X (xorg_backtrace+0x48) [0x469fd8]
(EE) 1: X (0x400000+0x6fd69) [0x46fd69]
(EE) 2: /usr/lib64/libpthread.so.0 (0x3f7a200000+0xefa0) [0x3f7a20efa0]
(EE) 3: /usr/lib64/libc.so.6 (0x3f79e00000+0x148593) [0x3f79f48593]
(EE) 4: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x61f3b) [0x7f1ee4a3af3b]
(EE) 5: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x62c81) [0x7f1ee4a3bc81]
(EE) 6: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x41a4c) [0x7f1ee4a1aa4c]
(EE) 7: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x52cb2) [0x7f1ee4a2bcb2]
(EE) 8: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x324b4) [0x7f1ee4a0b4b4]
(EE) 9: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x327ab) [0x7f1ee4a0b7ab]
(EE) 10: X (0x400000+0x128aba) [0x528aba]
(EE) 11: X (0x400000+0x120a5c) [0x520a5c]
(EE) 12: X (0x400000+0x121592) [0x521592]
(EE) 13: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0xf7f71) [0x7f1ee4ad0f71]
(EE) 14: /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so (0x7f1ee49d9000+0x6c12f) [0x7f1ee4a4512f]
(EE) 15: X (WakeupHandler+0x9c) [0x43da6c]
(EE) 16: X (WaitForSomething+0x1bf) [0x4738cf]
(EE) 17: X (0x400000+0x38e71) [0x438e71]
(EE) 18: X (0x400000+0x3cf4a) [0x43cf4a]
(EE) 19: /usr/lib64/libc.so.6 (__libc_start_main+0xf5) [0x3f79e21b75]
(EE) 20: X (0x400000+0x286c1) [0x4286c1]
(EE)
(EE) Segmentation fault at address 0x7f1ee2fb6000
(EE)
Fatal server error:
(EE) Caught signal 11 (Segmentation fault). Server aborting
(EE)
(EE)
Please consult the The X.Org Foundation support
         at http://wiki.x.org
 for help.
(EE) Please also check the log file at "/opt/X11R7/var/log/Xorg.0.log" for additional information.
(EE)
(II) AIGLX: Suspending AIGLX clients for VT switch
Comment 8 liulei 2014-07-21 06:44:33 UTC
Created attachment 103167 [details]
reopen-xorg.log
Comment 9 Chris Wilson 2014-07-21 06:47:15 UTC
(In reply to comment #7)
> I found this issue still exists.
> (EE) Backtrace:
[snip]
> Fatal server error:
> (EE) Caught signal 11 (Segmentation fault). Server aborting

Which doesn't look a lot like the original. Either capture the bt in gdb or run it through addr2line.
Comment 10 liulei 2014-07-21 06:59:01 UTC
Created attachment 103169 [details]
reopen-dmesg.log
Comment 11 liulei 2014-07-21 07:18:37 UTC
(In reply to comment #9)
> (In reply to comment #7)
> > I found this issue still exists.
> > (EE) Backtrace:
> [snip]
> > Fatal server error:
> > (EE) Caught signal 11 (Segmentation fault). Server aborting
> 
> Which doesn't look a lot like the original. Either capture the bt in gdb or
> run it through addr2line.
Are they the same issue with below bug?
https://bugs.freedesktop.org/show_bug.cgi?id=81215
Comment 12 Chris Wilson 2014-07-21 07:25:21 UTC
That's what I suspect, but I don't have enough information to be sure.
Comment 13 liulei 2014-07-21 07:37:44 UTC
> Which doesn't look a lot like the original. Either capture the bt in gdb or
> run it through addr2line.
Excuse me, I haven't done this before. Would you please offer me steps to run it through addr2line?
Comment 14 Chris Wilson 2014-07-21 07:43:00 UTC
You have to specify the object to read the debug information from and the relative addresses:

addr2line -e /opt/X11R7/lib/xorg/modules/drivers/intel_drv.so -i 0x61f3b 0x62c81 0x41a4c 0x52cb2 0x324b4 0x327ab 0xf7f71 0x6c12f

It is not as good as using gdb and inspecting the fault in place, but it is better than nothing.
Comment 15 liulei 2014-07-22 01:42:48 UTC
Created attachment 103245 [details]
gdb.log
Comment 16 Chris Wilson 2014-07-22 05:56:53 UTC
(In reply to comment #15)
> Created attachment 103245 [details]
> gdb.log

Oh, if you want to print the locals at each frame, just use "bt full". That stack is bug 81215 so I don't think we have a re-occurrence of the DRI2 bug here.
Comment 17 Jacques Guilleron 2014-09-19 21:17:52 UTC
*** Bug 83176 has been marked as a duplicate of this bug. ***


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.