Bug 103602 - Interface crash and reboot
Summary: Interface crash and reboot
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-11-06 22:02 UTC by Bob Kelso 1962
Modified: 2018-04-20 14:50 UTC (History)
1 user (show)

See Also:
i915 platform: KBL
i915 features: GPU hang


Attachments
/sys/class/drm/card0/error (780.27 KB, text/plain)
2017-11-06 22:02 UTC, Bob Kelso 1962
no flags Details

Description Bob Kelso 1962 2017-11-06 22:02:06 UTC
Created attachment 135276 [details]
/sys/class/drm/card0/error

Hi

About 1 or 2 time a week the interface crash. Everything freeze except the mouse, and after some time, I everything close and I get send back to the login form.
I'm using debian 9 with mate desktop, have a i7-7700k and using its video card.
In the computer I also have an Geforce gt 1030 but I'm using it for pci passthrough.

Here are the dmesg error:
[90246.510770] perf: interrupt took too long (2507 > 2500), lowering kernel.perf_event_max_sample_rate to 79750
[92338.237194] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[126401.404853] perf: interrupt took too long (3176 > 3133), lowering kernel.perf_event_max_sample_rate to 62750
[131590.917381] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[134922.237206] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[143144.252221] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[150424.038125] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[155128.509145] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[173355.669281] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[178418.933712] perf: interrupt took too long (3975 > 3970), lowering kernel.perf_event_max_sample_rate to 50250
[178695.228970] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[213049.357223] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[226689.413147] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[248727.117065] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[252349.434958] kvm: zapping shadow pages for mmio generation wraparound
[252349.440794] kvm: zapping shadow pages for mmio generation wraparound
[252451.784263] kvm: zapping shadow pages for mmio generation wraparound
[252451.790097] kvm: zapping shadow pages for mmio generation wraparound
[252581.542937] kvm: zapping shadow pages for mmio generation wraparound
[252581.549497] kvm: zapping shadow pages for mmio generation wraparound
[252669.731958] kvm: zapping shadow pages for mmio generation wraparound
[252669.737886] kvm: zapping shadow pages for mmio generation wraparound
[252887.289536] kvm: zapping shadow pages for mmio generation wraparound
[252887.300674] kvm: zapping shadow pages for mmio generation wraparound
[252986.203383] nfs: server 192.168.1.2 not responding, still trying
[252986.468431] nfs: server 192.168.1.2 OK
[260637.245052] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[294000.348610] [drm:gen8_irq_handler [i915]] *ERROR* CPU pipe A FIFO underrun
[303679.069817] [drm] GPU HANG: ecode 9:0:0x85dffffd, in Xorg [986], reason: Hang on render ring, action: reset
[303679.069820] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[303679.069821] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[303679.069822] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[303679.069823] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[303679.069824] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[303679.069888] drm/i915: Resetting chip after gpu hang
[303679.069956] [drm] RC6 on
[303679.087398] [drm] GuC firmware load skipped
[303691.069459] drm/i915: Resetting chip after gpu hang
[303691.069529] [drm] RC6 on
[303691.084890] [drm] GuC firmware load skipped

And I'll attach the /sys/class/drm/card0/error error file too.
Comment 1 Elizabeth 2017-11-07 17:17:40 UTC
HEAD and ACTH are different so problem was outside the batch, there is:

00000000_febef000 20971520 36 00 [ 119ecc6 00 00 00 00 ] 119ecc6 X dirty render uncached

Could you try latest stable:
https://www.kernel.org

This commit may help 
commit 1d033beb20d6d5885587a02a393b6598d766a382
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Tue Oct 31 10:36:07 2017 +0000

    drm/i915: Check incoming alignment for unfenced buffers (on i915gm)

Also you could give a try to intel_iommu=igfx_off on grub.
Comment 2 Jani Saarinen 2018-03-29 07:10:30 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 3 Jani Saarinen 2018-04-20 14:50:04 UTC
Closing, please re-open if still occurs.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.