Bug 105577 - When resuming from hibernation the gpu hangs
Summary: When resuming from hibernation the gpu hangs
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-03-18 12:03 UTC by Tero Kivinen
Modified: 2018-04-27 12:31 UTC (History)
1 user (show)

See Also:
i915 platform: KBL
i915 features: GPU hang


Attachments
/sys/class/drm/card0/error (126.89 KB, application/cap)
2018-03-18 12:03 UTC, Tero Kivinen
no flags Details

Description Tero Kivinen 2018-03-18 12:03:20 UTC
Created attachment 138179 [details]
/sys/class/drm/card0/error

When resuming from the hibernation the system boots up, shows up the session for about 30 seconds and then the either crashes, or the X11 gets killed (in which case I get back to the login screen). I do not have log entries from the crashes, as it does not manage to write log entries to disk before crashing (there is about kilobyte worth of zeroes in the log file).

When it just kills the X11 there is following text in the log:

[drm] GPU HANG: ecode 9:0:0x3cc4e213, in Xorg [563], reason: Hang on render ring, action: reset
[drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[drm] GPU crash dump saved to /sys/class/drm/card0/error
drm/i915: Resetting chip after gpu hang
[drm] RC6 on
[drm] GuC firmware load skipped
nouveau 0000:01:00.0: DRM: resuming kernel object tree...
nouveau 0000:01:00.0: priv: HUB0: 10ecc0 ffffffff (1a40822c)
nouveau 0000:01:00.0: DRM: resuming client object trees...
nouveau 0000:01:00.0: DRM: evicting buffers...
nouveau 0000:01:00.0: DRM: waiting for kernel channels to go idle...
nouveau 0000:01:00.0: DRM: suspending client object trees...
nouveau 0000:01:00.0: DRM: suspending kernel object tree...
drm/i915: Resetting chip after gpu hang
[drm] RC6 on
[drm] GuC firmware load skipped
nouveau 0000:01:00.0: DRM: resuming kernel object tree...
nouveau 0000:01:00.0: priv: HUB0: 10ecc0 ffffffff (1a40822c)
nouveau 0000:01:00.0: DRM: resuming client object trees...


Information about system:

Linux kivitaulu 4.9.0-6-amd64 #1 SMP Debian 4.9.82-1+deb9u3 (2018-03-02) x86_64 GNU/Linux


00:00.0 Host bridge: Intel Corporation Device 5904 (rev 02)
00:02.0 VGA compatible controller: Intel Corporation Device 5916 (rev 02)
00:04.0 Signal processing controller: Intel Corporation Skylake Processor Thermal Subsystem (rev 02)
00:14.0 USB controller: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller (rev 21)
00:14.2 Signal processing controller: Intel Corporation Sunrise Point-LP Thermal subsystem (rev 21)
00:15.0 Signal processing controller: Intel Corporation Sunrise Point-LP Serial IO I2C Controller #0 (rev 21)
00:15.1 Signal processing controller: Intel Corporation Sunrise Point-LP Serial IO I2C Controller #1 (rev 21)
00:16.0 Communication controller: Intel Corporation Sunrise Point-LP CSME HECI #1 (rev 21)
00:17.0 SATA controller: Intel Corporation Sunrise Point-LP SATA Controller [AHCI mode] (rev 21)
00:1c.0 PCI bridge: Intel Corporation Device 9d10 (rev f1)
00:1c.5 PCI bridge: Intel Corporation Sunrise Point-LP PCI Express Root Port #6 (rev f1)
00:1f.0 ISA bridge: Intel Corporation Device 9d58 (rev 21)
00:1f.2 Memory controller: Intel Corporation Sunrise Point-LP PMC (rev 21)
00:1f.3 Audio device: Intel Corporation Device 9d71 (rev 21)
00:1f.4 SMBus: Intel Corporation Sunrise Point-LP SMBus (rev 21)
01:00.0 3D controller: NVIDIA Corporation GM107M [GeForce GTX 960M] (rev a2)
02:00.0 Network controller: Intel Corporation Wireless 8260 (rev 3a)
Comment 1 Elizabeth 2018-03-20 21:10:26 UTC
Hi, is it possible that you try to replicate this with the latest stable kernel https://www.kernel.org? Also have you tried disabling rc6 with i915_enable_rc6=0 parameter on grub?
Comment 2 Jani Saarinen 2018-03-29 07:11:24 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 3 Jani Saarinen 2018-04-20 19:04:39 UTC
Tero, have you tried with latest drm-tip: https://cgit.freedesktop.org/drm-tip?
Comment 4 Jani Saarinen 2018-04-27 12:31:32 UTC
Closing, please re-open is issue still exists.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.