Bug 96442 - Hibernating while utilising i915 with modeset=1 results in crash.
Summary: Hibernating while utilising i915 with modeset=1 results in crash.
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-06-08 18:39 UTC by Volker Gronau
Modified: 2017-07-24 22:41 UTC (History)
1 user (show)

See Also:
i915 platform: HSW
i915 features: power/suspend-resume


Attachments
ZIP files containing screenshots and dmesg of system startup (3.72 MB, application/zip)
2016-06-08 18:39 UTC, Volker Gronau
no flags Details
dmesg of hibernate and resume cycles (1.12 MB, text/plain)
2016-06-09 19:53 UTC, Volker Gronau
no flags Details

Description Volker Gronau 2016-06-08 18:39:13 UTC
Created attachment 124405 [details]
ZIP files containing screenshots and dmesg of system startup

Suspending and resuming a Toshiba Satellite C50-B-1CJ laptop (latest BIOS, EFI start or BIOS start) randomly causes system freezes.

Hours of testing revealed that these freezes were caused by the i915 driver. When using nomodeset they do not happen. Using windows hibernate work perfectly fine. Sometimes it happens after just one hibernate cycle, sometimes it takes like (felt) 20 times. 

I tested all stock kernels from 3.9 to the today's nightly code. I enabled and disabled all drivers options to no effect. This behaviour persists. Using kernel DRI results in the fact that hibernate will freeze the machine sooner or later. Staying on console and not starting X does not resolve the problem.

Very seldom the display comes up after this and a kernel message could be seen. I took a screenshot (literally) because you cannot write to the hard disk anymore. Whatever you do causes the system to fully die. 

I bought this laptop as a present and will have access to it till the end of august. If you want me to try anything i can happily do. I can provide ssh access if you like.

Sadly I saw the instruction for posting bug reports utilising drm.debug=0x1e too late. As I already mentioned it is a rarity to capture a kernel panic at all but I try to catch one with this mode set.

In my personal opinion the driver somehow corrupts the memory as the errors before system death looked very different in the past. I have the impression if you let glxgears run it make the problem appear sooner.

(If somebody in the future experiences the same problem and it was never fixed with this https://github.com/vjay82/Linux-i915-Backlight-Control you can at least control the backlight but anyway you will loose acceleration.)
Comment 1 Volker Gronau 2016-06-08 18:57:57 UTC
Correction:

kernels from 3.9 -> Kernel 3.19 

I found this, somebody reporting the same issue having a Toshiba laptop too. https://bugs.freedesktop.org/show_bug.cgi?id=94203

Maybe it helps. On 01.org it is advised to open a separate report anyway.
Comment 2 Jani Nikula 2016-06-09 07:45:25 UTC
(In reply to Volker Gronau from comment #1)
> I found this, somebody reporting the same issue having a Toshiba laptop too.
> https://bugs.freedesktop.org/show_bug.cgi?id=94203
> 
> Maybe it helps. On 01.org it is advised to open a separate report anyway.

That one is Ironlake, this one is Haswell.
Comment 3 Jani Nikula 2016-06-09 07:58:37 UTC
(In reply to Volker Gronau from comment #0)
> Created attachment 124405 [details]
> ZIP files containing screenshots and dmesg of system startup

For future reference, please attach each file directly, unpacked.

> Very seldom the display comes up after this and a kernel message could be
> seen. I took a screenshot (literally) because you cannot write to the hard
> disk anymore. Whatever you do causes the system to fully die. 

Unfortunately, the screenshots do not contain the full backtrace. :(

> Sadly I saw the instruction for posting bug reports utilising drm.debug=0x1e
> too late. As I already mentioned it is a rarity to capture a kernel panic at
> all but I try to catch one with this mode set.

Please attach such a dmesg running drm-intel-nightly over a hibernate-resume sequence even if it doesn't result in a crash; it may provide valueable information anyway.

> (If somebody in the future experiences the same problem and it was never
> fixed with this https://github.com/vjay82/Linux-i915-Backlight-Control you
> can at least control the backlight but anyway you will loose acceleration.)

I can't recommend anyone using intel_reg for anything except testing and debugging.
Comment 4 Volker Gronau 2016-06-09 19:53:28 UTC
Created attachment 124425 [details]
dmesg of hibernate and resume cycles

I did add the proposed kernel parameters "drm.debug=0x1e log_buf_len=1M", took a dmesg of the system startup and let it continually log into the file.
Each time i cycled i echoed it into the file too.
Does this help? Can I provide sth else?
Comment 5 Volker Gronau 2016-06-09 20:01:50 UTC
I forgot to mention it: As you can see in the truncated file, after the third cycle the system did not come back. The screen stayed black and it did not answer when pinged. 

I was asking if I can provide more specific information because I prefer contributing that over randomly crashing the machine and hoping for another chance to screenshot it.
Comment 6 Volker Gronau 2016-08-21 21:40:24 UTC
fixed with today's nightly


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.