Bug 48913 - [SNB] Suspend-to-disk occasionally fails to suspend.
Summary: [SNB] Suspend-to-disk occasionally fails to suspend.
Status: CLOSED INVALID
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Daniel Vetter
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-04-19 01:43 UTC by Da Fox
Modified: 2017-07-24 23:02 UTC (History)
5 users (show)

See Also:
i915 platform:
i915 features:


Attachments
Kernel logs for a full working day, ending in failure to suspend-to-disk (247.15 KB, text/plain)
2012-04-19 01:43 UTC, Da Fox
no flags Details

Description Da Fox 2012-04-19 01:43:26 UTC
Created attachment 60299 [details]
Kernel logs for a full working day, ending in failure to suspend-to-disk

Occasionally suspend-to-disk will fail, leaving the laptop with a black screen, getting very hot (fan starts to ramp up to maximum speed). Suspend-to-ram seems unaffected, and always works.

The issue has been present with every kernel I've used on this laptop,
which goes back almost one year. Recent kernels which I have used include 
3.0.0, 3.1.0, 3.2.0, 3.2.6, 3.3.0, and now 3.4.0-rc2.

This seems unrelated to RC6, or to the failures to resume (bug #48912), and happens with both RC6 on and off.



Hardware is Dell XPS 15 (L502x) with:
00:02.0 VGA compatible controller: Intel Corporation Device 0116 (rev 09)
[    13.803] (II) intel(0): Integrated Graphics Chipset: Intel(R) Sandybridge
Mobile (GT2)

Software versions (current):
xf86-video-intel: fd81408b978c9b57c046ee43d2d32e1370e83a7d
xorg-server: 1.11.2-r2
kernel: 3.4-rc2

I'm attaching yesterday's kernel messages, which ended in a suspend failure. Note that the next item to appear in the logs is the laptop booting this morning (not included).
Comment 1 Da Fox 2012-05-13 06:11:01 UTC
(In reply to comment #0)
> Created attachment 60299 [details]
> Kernel logs for a full working day, ending in failure to suspend-to-disk
> 
> Occasionally suspend-to-disk will fail, leaving the laptop with a black screen,
> getting very hot (fan starts to ramp up to maximum speed). Suspend-to-ram seems
> unaffected, and always works.
> 
> The issue has been present with every kernel I've used on this laptop,
> which goes back almost one year. Recent kernels which I have used include 
> 3.0.0, 3.1.0, 3.2.0, 3.2.6, 3.3.0, and now 3.4.0-rc2.
> 
> This seems unrelated to RC6, or to the failures to resume (bug #48912), and
> happens with both RC6 on and off.
> 
> 
> 
> Hardware is Dell XPS 15 (L502x) with:
> 00:02.0 VGA compatible controller: Intel Corporation Device 0116 (rev 09)
> [    13.803] (II) intel(0): Integrated Graphics Chipset: Intel(R) Sandybridge
> Mobile (GT2)
> 
> Software versions (current):
> xf86-video-intel: fd81408b978c9b57c046ee43d2d32e1370e83a7d
> xorg-server: 1.11.2-r2
> kernel: 3.4-rc2
> 
> I'm attaching yesterday's kernel messages, which ended in a suspend failure.
> Note that the next item to appear in the logs is the laptop booting this
> morning (not included).

I experienced this again yesterday with kernel 3.4.0-rc6-00197-ga98e48e (v3.4-rx6 + drm-intel-next + the rps patch from bug #48912), so that patch doesn't seem to fix this issue. I checked the system logs again, and there still was nothing in there.
Comment 2 Jesse Barnes 2012-12-12 18:47:09 UTC
Looks like i915 is getting suspended (the "suspend of devices complete after 1128.954 msecs" seems to indicate that at least).

We can try to be sure though by removing the i915 device.  Can you try using a stress script like http://ubuntu.5.n6.nabble.com/PATCH-0-1-Jaunty-suspend-resume-stress-test-scripts-td368223.html and isolate which module might be breaking things?  It could also be an ACPI bug, in which case a BIOS upgrade may help...
Comment 3 Da Fox 2013-01-14 17:56:26 UTC
(In reply to comment #2)
> Looks like i915 is getting suspended (the "suspend of devices complete after
> 1128.954 msecs" seems to indicate that at least).
> 
> We can try to be sure though by removing the i915 device.  Can you try using
> a stress script like
> http://ubuntu.5.n6.nabble.com/PATCH-0-1-Jaunty-suspend-resume-stress-test-
> scripts-td368223.html and isolate which module might be breaking things?  It
> could also be an ACPI bug, in which case a BIOS upgrade may help...

I don't have any modules, except for 'bbswitch'. This is because everything is compiled into the kernel. Would doing a hibernate-stresstest without X running be of any help in pin-pointing the problem?

The BIOS is at the the second-to-last version (at least, it was the second-to-last version the last time I checked). This is because of a power-management bug, which is (hopefully) almost resolved. After that I will test the newer version of the BIOS again, however it appeared that windows might also have had a higher power consumption under that BIOS, in which case I will need stick with my current BIOS version.
Comment 4 Jani Nikula 2013-10-10 15:10:05 UTC
Sorry, this bug seems to have been neglegted a bit. Are you still seeing this with recent kernels?

If yes, please try drm-intel-nightly branch of git://people.freedesktop.org/~danvet/drm. Attach dmesg with drm.debug=0xe parameter set.

Also worth checking is building i915 as module; while built-in should of course work it doesn't get nearly as much test coverage as module.
Comment 5 Jani Nikula 2013-12-16 14:08:39 UTC
(In reply to comment #4)
> Sorry, this bug seems to have been neglegted a bit. Are you still seeing
> this with recent kernels?
> 
> If yes, please try drm-intel-nightly branch of
> git://people.freedesktop.org/~danvet/drm. Attach dmesg with drm.debug=0xe
> parameter set.
> 
> Also worth checking is building i915 as module; while built-in should of
> course work it doesn't get nearly as much test coverage as module.

Da Fox, ping for test.
Comment 6 Chris Wilson 2013-12-30 13:26:36 UTC
Cause unknown.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.