Bug 90319 - [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset (after insmod)
Summary: [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset (after...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium major
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-05-05 16:43 UTC by Stefan Sydow
Modified: 2016-11-18 13:44 UTC (History)
2 users (show)

See Also:
i915 platform: BDW
i915 features: GPU hang


Attachments
Dump from /sys/class/drm/card0/error (2.66 MB, text/plain)
2015-05-05 16:43 UTC, Stefan Sydow
no flags Details

Description Stefan Sydow 2015-05-05 16:43:08 UTC
Created attachment 115549 [details]
Dump from /sys/class/drm/card0/error

demesg:

[    4.661136] [drm] Initialized drm 1.1.0 20060810
[    4.661292] e1000e 0000:00:19.0 enp0s25: renamed from eth0
[    4.664887] rtsx_pci 0000:02:00.0: rtsx_pci_acquire_irq: pcr->msi_en = 1, pci->irq = 46
[    4.665488] snd_hda_intel 0000:00:03.0: Applying patch firmware 'x250.fw'
[    4.665580] snd_hda_intel 0000:00:1b.0: Applying patch firmware 'x250.fw'
[    4.682402] systemd-udevd[497]: renamed network interface eth0 to enp0s25
[    4.685352] snd_hda_codec_realtek hdaudioC2D0: autoconfig for ALC3232: line_outs=1 (0x16/0x0/0x0/0x0/0x0) type:line
[    4.685355] snd_hda_codec_realtek hdaudioC2D0:    speaker_outs=1 (0x14/0x0/0x0/0x0/0x0)
[    4.685357] snd_hda_codec_realtek hdaudioC2D0:    hp_outs=1 (0x15/0x0/0x0/0x0/0x0)
[    4.685357] snd_hda_codec_realtek hdaudioC2D0:    mono: mono_out=0x0
[    4.685358] snd_hda_codec_realtek hdaudioC2D0:    inputs:
[    4.685359] snd_hda_codec_realtek hdaudioC2D0:      Dock Mic=0x19
[    4.685361] snd_hda_codec_realtek hdaudioC2D0:      Mic=0x1a
[    4.685362] snd_hda_codec_realtek hdaudioC2D0:      Internal Mic=0x12
[    4.686971] iwlwifi 0000:03:00.0: loaded firmware version 25.15.12.0 op_mode iwlmvm
[    4.689886] i915 0000:00:02.0: enabling device (0006 -> 0007)
[    4.693391] iwlwifi 0000:03:00.0: Detected Intel(R) Dual Band Wireless AC 7265, REV=0x210
[    4.707798] iwlwifi 0000:03:00.0: L1 Enabled - LTR Enabled
[    4.708241] iwlwifi 0000:03:00.0: L1 Enabled - LTR Enabled
[    4.708370] [drm] Memory usable by graphics device = 4096M
[    4.708372] [drm] VT-d active for gfx access
[    4.708374] checking generic (c0000000 7e9000) vs hw (c0000000 20000000)
[    4.708376] fb: switching to inteldrmfb from simple
[    4.708394] Console: switching to colour dummy device 80x25
[    4.708482] [drm] Replacing VGA console driver
[    4.709311] input: HDA Digital PCBeep as /devices/pci0000:00/0000:00:1b.0/sound/card2/input8
[    4.710472] input: HDA Intel PCH Dock Mic as /devices/pci0000:00/0000:00:1b.0/sound/card2/input9
[    4.710541] input: HDA Intel PCH Mic as /devices/pci0000:00/0000:00:1b.0/sound/card2/input10
[    4.710611] input: HDA Intel PCH Dock Headphone as /devices/pci0000:00/0000:00:1b.0/sound/card2/input11
[    4.710666] input: HDA Intel PCH Headphone as /devices/pci0000:00/0000:00:1b.0/sound/card2/input12
[    4.715899] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[    4.715900] [drm] Driver supports precise vblank timestamp query.
[    4.716347] vgaarb: device changed decodes: PCI:0000:00:02.0,olddecodes=io+mem,decodes=io+mem:owns=mem
[    4.727899] ACPI: Video Device [VID] (multi-head: yes  rom: no  post: no)
[    4.728107] input: Video Bus as /devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/LNXVIDEO:00/input/input13
[    4.728205] snd_hda_intel 0000:00:03.0: bound 0000:00:02.0 (ops i915_audio_component_bind_ops [i915])
[    4.728208] [drm] Initialized i915 1.6.0 20150327 for 0000:00:02.0 on minor 0
[    4.736029] input: HDA Intel HDMI HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.0/sound/card1/input14
[    4.736087] input: HDA Intel HDMI HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.0/sound/card1/input15
[    4.736141] input: HDA Intel HDMI HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.0/sound/card1/input16
[    4.771973] ieee80211 phy0: Selected rate control algorithm 'iwl-mvm-rs'
[    4.775503] iwlwifi 0000:03:00.0 wlp3s0: renamed from wlan0
[    4.794389] systemd-udevd[492]: renamed network interface wlan0 to wlp3s0
[    6.158061] random: nonblocking pool is initialized
[    6.379273] IPv6: ADDRCONF(NETDEV_UP): enp0s25: link is not ready
[    7.813177] psmouse serio2: trackpoint: IBM TrackPoint firmware: 0x0e, buttons: 3/3
[    8.007157] input: TPPS/2 IBM TrackPoint as /devices/platform/i8042/serio1/serio2/input/input7
[    9.098702] e1000e: enp0s25 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
[    9.098728] IPv6: ADDRCONF(NETDEV_CHANGE): enp0s25: link becomes ready
[   10.790671] iwlwifi 0000:03:00.0: L1 Enabled - LTR Enabled
[   10.791115] iwlwifi 0000:03:00.0: L1 Enabled - LTR Enabled
[   10.820926] IPv6: ADDRCONF(NETDEV_UP): wlp3s0: link is not ready
[   10.833906] [drm] stuck on render ring
[   10.834519] [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset
[   10.834521] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[   10.834522] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[   10.834522] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[   10.834523] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[   10.834524] [drm] GPU crash dump saved to /sys/class/drm/card0/error
[   10.836600] drm/i915: Resetting chip after gpu hang
[   16.861616] [drm] stuck on render ring
[   16.862254] [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset
[   16.862308] [drm:i915_context_is_banned [i915]] *ERROR* gpu hanging too fast, banning!
[   16.864380] drm/i915: Resetting chip after gpu hang
[   22.865290] [drm] stuck on render ring
[   22.865925] [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset
[   22.868026] drm/i915: Resetting chip after gpu hang
[   28.868967] [drm] stuck on render ring
[   28.869595] [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset
[   28.871691] drm/i915: Resetting chip after gpu hang
[   34.872648] [drm] stuck on render ring
[   34.873287] [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset
[   34.875394] drm/i915: Resetting chip after gpu hang
[   40.876322] [drm] stuck on render ring
[   40.876958] [drm] GPU HANG: ecode 8:0:0x00dffffe, reason: Ring hung, action: reset
Comment 1 yann 2016-09-28 13:46:22 UTC
We seem to have neglected the bug a bit, apologies.

from gpu crash dump we can notice that there is a failure with Protected Audio/Video Path :
ERROR: 0x00000028
    Invalid physical address in ROSTRM interface (PAVP)
    Invalid physical address in WRITE interface (PAVP)

There were improvements pushed in kernel and Mesa that will benefit to your system, so please re-test with latest kernel & Mesa to see if this issue is still occurring.
Comment 2 Jozef Chudy 2016-10-06 09:48:46 UTC
Oct  6 10:45:19 oc1068714158 kernel: [drm] stuck on render ring
Oct  6 10:45:19 oc1068714158 kernel: [drm] GPU HANG: ecode 8:0:0xfffffffe, in Xorg [1906], reason: Ring hung, action: reset
Oct  6 10:45:19 oc1068714158 kernel: [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
Oct  6 10:45:19 oc1068714158 kernel: [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
Oct  6 10:45:19 oc1068714158 kernel: [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
Oct  6 10:45:19 oc1068714158 kernel: [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
Oct  6 10:45:19 oc1068714158 kernel: [drm] GPU crash dump saved to /sys/class/drm/card0/error
Oct  6 10:45:19 oc1068714158 kernel: drm/i915: Resetting chip after gpu hang
Oct  6 10:45:24 oc1068714158 sh: abrt-watch-log: Warning, '/usr/bin/abrt-dump-xorg' did not process its input
Oct  6 10:45:25 oc1068714158 kernel: [drm] stuck on render ring
Oct  6 10:45:25 oc1068714158 kernel: [drm] GPU HANG: ecode 8:0:0xfffffffe, in Xorg [1906], reason: Ring hung, action: reset
Oct  6 10:45:25 oc1068714158 kernel: [drm:i915_set_reset_status [i915]] *ERROR* gpu hanging too fast, banning!
Oct  6 10:45:25 oc1068714158 kernel: drm/i915: Resetting chip after gpu hang
Oct  6 10:46:14 oc1068714158 kernel: INFO: rcu_sched detected stalls on CPUs/tasks: { 1} (detected by 3, t=60006 jiffies, g=404605, c=404604, q=0)
Oct  6 10:46:14 oc1068714158 kernel: sending NMI to all CPUs:
Oct  6 10:46:14 oc1068714158 kernel: NMI backtrace for cpu 0
Oct  6 10:46:14 oc1068714158 kernel: CPU: 0 PID: 0 Comm: swapper/0 Tainted: P           OE  ------------   3.10.0-327.36.1.el7.x86_64 #1
Oct  6 10:46:14 oc1068714158 kernel: Hardware name: LENOVO 20BTS0WX15/20BTS0WX15, BIOS N14ET37W (1.15 ) 09/06/2016
Oct  6 10:46:14 oc1068714158 kernel: task: ffffffff81951440 ti: ffffffff8193c000 task.ti: ffffffff8193c000
Oct  6 10:46:14 oc1068714158 kernel: RIP: 0010:[<ffffffff8135e1a7>]  [<ffffffff8135e1a7>] intel_idle+0xd7/0x160
Oct  6 10:46:14 oc1068714158 kernel: RSP: 0018:ffffffff8193fe18  EFLAGS: 00000046
Oct  6 10:46:14 oc1068714158 kernel: RAX: 0000000000000050 RBX: 0000000000000040 RCX: 0000000000000001
Oct  6 10:46:14 oc1068714158 kernel: RDX: 0000000000000000 RSI: ffffffff8193ffd8 RDI: 000000000194a000
Oct  6 10:46:14 oc1068714158 kernel: RBP: ffffffff8193fe48 R08: 000000000000a98e R09: 0000000000000018
Oct  6 10:46:14 oc1068714158 kernel: R10: 0000000000002a65 R11: 0000000000000000 R12: ffffffff8193ffd8
Oct  6 10:46:14 oc1068714158 kernel: R13: 0000000000000007 R14: 0000000000000050 R15: ffffffff819fef40
Oct  6 10:46:14 oc1068714158 kernel: FS:  0000000000000000(0000) GS:ffff88022dc00000(0000) knlGS:0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct  6 10:46:14 oc1068714158 kernel: CR2: 00007f2aaa841000 CR3: 000000000194a000 CR4: 00000000003407f0
Oct  6 10:46:14 oc1068714158 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Oct  6 10:46:14 oc1068714158 kernel: Stack:
Oct  6 10:46:14 oc1068714158 kernel: 000000008193fe48 ba5f6f28b1056d45 ffff88022dc1dd00 ffffffff819fecc0
Oct  6 10:46:14 oc1068714158 kernel: 00000d33c0fb387d 0000000000000007 ffffffff8193fe80 ffffffff814d4ab0
Oct  6 10:46:14 oc1068714158 kernel: ffff88022dc1dd00 0000000000000007 0000000000000007 ffffffff819fecc0
Oct  6 10:46:14 oc1068714158 kernel: Call Trace:
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff814d4ab0>] cpuidle_enter_state+0x40/0xc0
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff814d4c09>] cpuidle_idle_call+0xd9/0x210
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8101e4ee>] arch_cpu_idle+0xe/0x30
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810d64a5>] cpu_startup_entry+0x245/0x290
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81625f67>] rest_init+0x77/0x80
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81a90057>] start_kernel+0x429/0x44a
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81a8fa37>] ? repair_env_string+0x5c/0x5c
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81a8f120>] ? early_idt_handlers+0x120/0x120
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81a8f5ee>] x86_64_start_reservations+0x2a/0x2c
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81a8f742>] x86_64_start_kernel+0x152/0x175
Oct  6 10:46:14 oc1068714158 kernel: Code: 31 d2 65 48 8b 34 25 b8 b7 00 00 48 89 d1 48 8d 86 38 c0 ff ff 0f 01 c8 48 8b 86 38 c0 ff ff a8 08 75 08 b1 01 4c 89 f0 0f 01 c9 <65> 48 8b 04 25 b8 b7 00 00 f0 80 a0 3a c0 ff ff 7f 85 1d fa 0a 
Oct  6 10:46:14 oc1068714158 kernel: NMI backtrace for cpu 1
Oct  6 10:46:14 oc1068714158 kernel: CPU: 1 PID: 19341 Comm: kworker/1:0 Tainted: P           OE  ------------   3.10.0-327.36.1.el7.x86_64 #1
Oct  6 10:46:14 oc1068714158 kernel: Hardware name: LENOVO 20BTS0WX15/20BTS0WX15, BIOS N14ET37W (1.15 ) 09/06/2016
Oct  6 10:46:14 oc1068714158 kernel: Workqueue: events e1000e_systim_overflow_work [e1000e]
Oct  6 10:46:14 oc1068714158 kernel: task: ffff880121b6f300 ti: ffff88005d62c000 task.ti: ffff88005d62c000
Oct  6 10:46:14 oc1068714158 kernel: RIP: 0010:[<ffffffffa0155b1d>]  [<ffffffffa0155b1d>] e1000e_cyclecounter_read+0x1d/0xd0 [e1000e]
Oct  6 10:46:14 oc1068714158 kernel: RSP: 0018:ffff88005d62fda0  EFLAGS: 00000006
Oct  6 10:46:14 oc1068714158 kernel: RAX: 00000000ffffffff RBX: ffff88021ce23860 RCX: 0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: RDX: 0000000000000000 RSI: ffff88005d62fdf8 RDI: ffff88021ce23848
Oct  6 10:46:14 oc1068714158 kernel: RBP: ffff88005d62fda0 R08: 0000000000000246 R09: dff714793ea23790
Oct  6 10:46:14 oc1068714158 kernel: R10: dff714793ea23790 R11: 0000000000000001 R12: ffff88021ce23890
Oct  6 10:46:14 oc1068714158 kernel: R13: ffff88021ce23840 R14: 0000000000000246 R15: 0000000000000040
Oct  6 10:46:14 oc1068714158 kernel: FS:  0000000000000000(0000) GS:ffff88022dc40000(0000) knlGS:0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct  6 10:46:14 oc1068714158 kernel: CR2: 00007f688f3974a0 CR3: 000000000194a000 CR4: 00000000003407e0
Oct  6 10:46:14 oc1068714158 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Oct  6 10:46:14 oc1068714158 kernel: Stack:
Oct  6 10:46:14 oc1068714158 kernel: ffff88005d62fdb8 ffffffff810dc7c5 ffff88005d62fdf8 ffff88005d62fde8
Oct  6 10:46:14 oc1068714158 kernel: ffffffffa01627ef ffff88021ce23790 ffff88021fcd9200 ffff88022dc56000
Oct  6 10:46:14 oc1068714158 kernel: ffff88022dc5a300 ffff88005d62fe18 ffffffffa01629e1 ffffffff8163b3b8
Oct  6 10:46:14 oc1068714158 kernel: Call Trace:
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810dc7c5>] timecounter_read+0x15/0x60
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffffa01627ef>] e1000e_phc_gettime+0x2f/0x80 [e1000e]
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffffa01629e1>] e1000e_systim_overflow_work+0x31/0xa0 [e1000e]
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8163b3b8>] ? __schedule+0x2d8/0x900
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8109d69b>] process_one_work+0x17b/0x470
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8109e46b>] worker_thread+0x11b/0x400
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8109e350>] ? rescuer_thread+0x400/0x400
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810a5b8f>] kthread+0xcf/0xe0
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810a5ac0>] ? kthread_create_on_node+0x140/0x140
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81646958>] ret_from_fork+0x58/0x90
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810a5ac0>] ? kthread_create_on_node+0x140/0x140
Oct  6 10:46:14 oc1068714158 kernel: Code: 00 00 00 0f 4e c8 e9 63 ff ff ff 0f 1f 00 0f 1f 44 00 00 55 48 89 e5 0f 1f 80 00 00 00 00 48 8b 87 f8 d5 ff ff 8b 80 00 b6 00 00 <89> c0 48 3d ff ff ff 3f 77 e9 48 8b 97 f8 d5 ff ff 8b 8a 04 b6 
Oct  6 10:46:14 oc1068714158 kernel: NMI backtrace for cpu 2
Oct  6 10:46:14 oc1068714158 kernel: CPU: 2 PID: 0 Comm: swapper/2 Tainted: P           OE  ------------   3.10.0-327.36.1.el7.x86_64 #1
Oct  6 10:46:14 oc1068714158 kernel: Hardware name: LENOVO 20BTS0WX15/20BTS0WX15, BIOS N14ET37W (1.15 ) 09/06/2016
Oct  6 10:46:14 oc1068714158 kernel: task: ffff880222503980 ti: ffff88022251c000 task.ti: ffff88022251c000
Oct  6 10:46:14 oc1068714158 kernel: RIP: 0010:[<ffffffff8135e1a7>]  [<ffffffff8135e1a7>] intel_idle+0xd7/0x160
Oct  6 10:46:14 oc1068714158 kernel: RSP: 0018:ffff88022251fe10  EFLAGS: 00000046
Oct  6 10:46:14 oc1068714158 kernel: RAX: 0000000000000032 RBX: 0000000000000010 RCX: 0000000000000001
Oct  6 10:46:14 oc1068714158 kernel: RDX: 0000000000000000 RSI: ffff88022251ffd8 RDI: 000000000194a000
Oct  6 10:46:14 oc1068714158 kernel: RBP: ffff88022251fe40 R08: 0000000000001e3a R09: 0000000000000018
Oct  6 10:46:14 oc1068714158 kernel: R10: 0000000000001682 R11: 0000000000000000 R12: ffff88022251ffd8
Oct  6 10:46:14 oc1068714158 kernel: R13: 0000000000000005 R14: 0000000000000032 R15: ffffffff819fee90
Oct  6 10:46:14 oc1068714158 kernel: FS:  0000000000000000(0000) GS:ffff88022dc80000(0000) knlGS:0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct  6 10:46:14 oc1068714158 kernel: CR2: 000018c49c372028 CR3: 000000000194a000 CR4: 00000000003407e0
Oct  6 10:46:14 oc1068714158 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Oct  6 10:46:14 oc1068714158 kernel: Stack:
Oct  6 10:46:14 oc1068714158 kernel: 000000022251fe40 5adf106a9ea165ca ffff88022dc9dd00 ffffffff819fecc0
Oct  6 10:46:14 oc1068714158 kernel: 00000d33c1f569ea 0000000000000005 ffff88022251fe78 ffffffff814d4ab0
Oct  6 10:46:14 oc1068714158 kernel: ffff88022dc9dd00 0000000000000005 0000000000000005 ffffffff819fecc0
Oct  6 10:46:14 oc1068714158 kernel: Call Trace:
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff814d4ab0>] cpuidle_enter_state+0x40/0xc0
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff814d4c09>] cpuidle_idle_call+0xd9/0x210
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8101e4ee>] arch_cpu_idle+0xe/0x30
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810d64a5>] cpu_startup_entry+0x245/0x290
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8104768a>] start_secondary+0x1ba/0x230
Oct  6 10:46:14 oc1068714158 kernel: Code: 31 d2 65 48 8b 34 25 b8 b7 00 00 48 89 d1 48 8d 86 38 c0 ff ff 0f 01 c8 48 8b 86 38 c0 ff ff a8 08 75 08 b1 01 4c 89 f0 0f 01 c9 <65> 48 8b 04 25 b8 b7 00 00 f0 80 a0 3a c0 ff ff 7f 85 1d fa 0a 
Oct  6 10:46:14 oc1068714158 kernel: NMI backtrace for cpu 3
Oct  6 10:46:14 oc1068714158 kernel: CPU: 3 PID: 0 Comm: swapper/3 Tainted: P           OE  ------------   3.10.0-327.36.1.el7.x86_64 #1
Oct  6 10:46:14 oc1068714158 kernel: Hardware name: LENOVO 20BTS0WX15/20BTS0WX15, BIOS N14ET37W (1.15 ) 09/06/2016
Oct  6 10:46:14 oc1068714158 kernel: task: ffff880222504500 ti: ffff880222520000 task.ti: ffff880222520000
Oct  6 10:46:14 oc1068714158 kernel: RIP: 0010:[<ffffffff8101c800>]  [<ffffffff8101c800>] hw_breakpoint_pmu_read+0x10/0x10
Oct  6 10:46:14 oc1068714158 kernel: RSP: 0018:ffff88022dcc3d90  EFLAGS: 00000046
Oct  6 10:46:14 oc1068714158 kernel: RAX: 0000000000000003 RBX: 00000000a6364244 RCX: 0000000000000008
Oct  6 10:46:14 oc1068714158 kernel: RDX: 00000000a636426a RSI: 0000000000000008 RDI: 0000000000230367
Oct  6 10:46:14 oc1068714158 kernel: RBP: ffff88022dcc3db0 R08: 0000000000000092 R09: 000000000000073d
Oct  6 10:46:14 oc1068714158 kernel: R10: 0000000000000000 R11: ffff88022dcc3ad6 R12: 0000000000230367
Oct  6 10:46:14 oc1068714158 kernel: R13: 0000000000000003 R14: 0000000000000003 R15: ffffffff819a7040
Oct  6 10:46:14 oc1068714158 kernel: FS:  0000000000000000(0000) GS:ffff88022dcc0000(0000) knlGS:0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct  6 10:46:14 oc1068714158 kernel: CR2: 000018c49c36d000 CR3: 000000000194a000 CR4: 00000000003407e0
Oct  6 10:46:14 oc1068714158 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Oct  6 10:46:14 oc1068714158 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Oct  6 10:46:14 oc1068714158 kernel: Stack:
Oct  6 10:46:14 oc1068714158 kernel: ffffffff8130073a 0000000000002710 0000000000000001 0000000000000008
Oct  6 10:46:14 oc1068714158 kernel: ffff88022dcc3dc0 ffffffff81300688 ffff88022dcc3e00 ffffffff8104b3ca
Oct  6 10:46:14 oc1068714158 kernel: ffffffff8101cd45 ffffffff81a69420 ffffffff819a6f40 ffffffff819a6f40
Oct  6 10:46:14 oc1068714158 kernel: Call Trace:
Oct  6 10:46:14 oc1068714158 kernel: <IRQ> #001d [<ffffffff8130073a>] ? delay_tsc+0x4a/0x80
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81300688>] __const_udelay+0x28/0x30
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8104b3ca>] arch_trigger_all_cpu_backtrace+0x12a/0x2d0
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8101cd45>] ? native_sched_clock+0x35/0x80
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81126e1d>] rcu_check_callbacks+0x5bd/0x610
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810e08c0>] ? tick_sched_handle.isra.14+0x60/0x60
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8108e967>] update_process_times+0x47/0x80
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810e0885>] tick_sched_handle.isra.14+0x25/0x60
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810e0901>] tick_sched_timer+0x41/0x70
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810a9db2>] __hrtimer_run_queues+0xd2/0x260
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810aa350>] hrtimer_interrupt+0xb0/0x1e0
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810495c7>] local_apic_timer_interrupt+0x37/0x60
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff81648f8f>] smp_apic_timer_interrupt+0x3f/0x60
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8164765d>] apic_timer_interrupt+0x6d/0x80
Oct  6 10:46:14 oc1068714158 kernel: <EOI> #001d [<ffffffff8108e79c>] ? get_next_timer_interrupt+0xec/0x270
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff814d4ac2>] ? cpuidle_enter_state+0x52/0xc0
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff814d4c09>] cpuidle_idle_call+0xd9/0x210
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8101e4ee>] arch_cpu_idle+0xe/0x30
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff810d64a5>] cpu_startup_entry+0x245/0x290
Oct  6 10:46:14 oc1068714158 kernel: [<ffffffff8104768a>] start_secondary+0x1ba/0x230
Oct  6 10:46:14 oc1068714158 kernel: Code: 48 c7 43 f8 00 00 00 00 41 83 ec 01 75 e6 5b 41 5c 5d c3 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 5d c3 0f 1f 44 00 00 <55> 48 89 e5 0f 31 89 c0 48 c1 e2 20 48 09 c2 48 89 d0 5d c3 66
Comment 3 yann 2016-10-06 09:53:03 UTC
Jozef, can you attach gpu crash dump (located at /sys/class/drm/card0/error) to confirm this is same issue. And please share also which version of Mesa you are using.
Comment 4 yann 2016-11-18 13:44:24 UTC
(In reply to yann from comment #3)
> Jozef, can you attach gpu crash dump (located at /sys/class/drm/card0/error)
> to confirm this is same issue. And please share also which version of Mesa
> you are using.

Timeout. Assuming that it is fixed by now. If this is not the case, please re-test with latest kernel & Mesa (12-13) to see if this issue is still occurring since there were improvements pushed in kernel and Mesa that will benefit to your system, and fill a new bug.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.