Bug 102426 - [SNB] WARNING: CPU: 1 PID: 1630 at drivers/gpu/drm/i915/intel_uncore.c:224 __gen6_gt_wait_for_fifo+0xb5/0xc0 [i915]
Summary: [SNB] WARNING: CPU: 1 PID: 1630 at drivers/gpu/drm/i915/intel_uncore.c:224 __...
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2017-08-27 10:29 UTC by samuel.rakitnican
Modified: 2018-03-29 07:29 UTC (History)
1 user (show)

See Also:
i915 platform: SNB
i915 features: GEM/Other


Attachments
Full log from journalctl at the time it happens (14.65 KB, application/gzip)
2017-08-27 10:29 UTC, samuel.rakitnican
no flags Details
journalctl --dmesg (24.01 KB, application/gzip)
2017-08-27 11:32 UTC, samuel.rakitnican
no flags Details

Description samuel.rakitnican 2017-08-27 10:29:22 UTC
Created attachment 133811 [details]
Full log from journalctl at the time it happens

Happens when user is logged in and using the DE.

Kernel: 4.11.4-1.el7.elrepo.x86_64
OS: CentOS 7
CPU: i3-2100


Aug 25 16:25:16 hostname kernel: WARNING: CPU: 1 PID: 1630 at drivers/gpu/drm/i915/intel_uncore.c:224 __gen6_gt_wait_for_fifo+0xb5/0xc0 [i915]
Aug 25 16:25:16 hostname kernel: WARN_ON(loop < 0 && fifo <= 20)
Aug 25 16:25:16 hostname kernel: Modules linked in: nls_utf8 isofs fuse usblp rpcsec_gss_krb5 xt_set xt_multiport xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun ip_set_hash_ip nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ipt_REJECT nf_reject_ipv4 ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack libcrc32c iptable_mangle iptable_security iptable_raw ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter dm_mirror dm_region_hash dm_log dm_mod vfat fat rc_hauppauge ir_kbd_i2c x86_pkg_temp_thermal intel_powerclamp tuner_simple tuner_types coretemp kvm_intel kvm irqbypass tuner
Aug 25 16:25:16 hostname kernel:  crct10dif_pclmul crc32_pclmul tda7432 tvaudio ghash_clmulni_intel cryptd snd_hda_codec_hdmi intel_cstate msp3400 iTCO_wdt ppdev iTCO_vendor_support snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel intel_rapl_perf snd_hda_codec snd_hda_core snd_bt87x snd_hwdep snd_seq snd_seq_device bttv snd_pcm tea575x tveeprom videobuf_dma_sg videobuf_core pcspkr v4l2_common snd_timer input_leds videodev i2c_i801 parport_pc nuvoton_cir sg mei_me snd parport soundcore rc_core mei shpchp lpc_ich mfd_core nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables ext4 jbd2 mbcache sr_mod cdrom sd_mod i915 ahci libahci video i2c_algo_bit serio_raw drm_kms_helper syscopyarea sysfillrect r8169 sysimgblt fb_sys_fops libata mii crc32c_intel drm floppy
Aug 25 16:25:16 hostname kernel: CPU: 1 PID: 1630 Comm: Xorg Not tainted 4.11.4-1.el7.elrepo.x86_64 #1
Aug 25 16:25:16 hostname kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./H67M, BIOS P1.40 02/18/2011
Aug 25 16:25:16 hostname kernel: Call Trace:
Aug 25 16:25:16 hostname kernel:  dump_stack+0x63/0x87
Aug 25 16:25:16 hostname kernel:  __warn+0xd1/0xf0
Aug 25 16:25:16 hostname kernel:  warn_slowpath_fmt+0x5f/0x80
Aug 25 16:25:16 hostname kernel:  __gen6_gt_wait_for_fifo+0xb5/0xc0 [i915]
Aug 25 16:25:16 hostname kernel:  gen6_write32+0x129/0x180 [i915]
Aug 25 16:25:16 hostname kernel:  gen6_irq_disable+0x33/0x50 [i915]
Aug 25 16:25:16 hostname kernel:  irq_disable+0x25/0x40 [i915]
Aug 25 16:25:16 hostname kernel:  __intel_breadcrumbs_disable_irq+0x3f/0x50 [i915]
Aug 25 16:25:16 hostname kernel:  intel_engine_remove_wait+0x1a5/0x1b0 [i915]
Aug 25 16:25:16 hostname kernel:  i915_wait_request+0x1ab/0x450 [i915]
Aug 25 16:25:16 hostname kernel:  ? remove_wait_queue+0x60/0x60
Aug 25 16:25:16 hostname kernel:  i915_gem_object_wait_fence+0x7b/0x170 [i915]
Aug 25 16:25:16 hostname kernel:  i915_gem_object_wait_reservation+0x169/0x1d0 [i915]
Aug 25 16:25:16 hostname kernel:  i915_gem_set_domain_ioctl+0xa3/0x230 [i915]
Aug 25 16:25:16 hostname kernel:  drm_ioctl+0x2b7/0x4c0 [drm]
Aug 25 16:25:16 hostname kernel:  ? __set_task_blocked+0x3c/0xa0
Aug 25 16:25:16 hostname kernel:  ? i915_gem_obj_prepare_shmem_write+0x1c0/0x1c0 [i915]
Aug 25 16:25:16 hostname kernel:  ? do_signal+0x198/0x6a0
Aug 25 16:25:16 hostname kernel:  do_vfs_ioctl+0xa7/0x5f0
Aug 25 16:25:16 hostname kernel:  ? trace_event_raw_event_signal_deliver+0xeb/0x100
Aug 25 16:25:16 hostname kernel:  SyS_ioctl+0x79/0x90
Aug 25 16:25:16 hostname kernel:  do_syscall_64+0x67/0x180
Aug 25 16:25:16 hostname kernel:  entry_SYSCALL64_slow_path+0x25/0x25
Aug 25 16:25:16 hostname kernel: RIP: 0033:0x7fc69f67f537
Aug 25 16:25:16 hostname kernel: RSP: 002b:00007ffd1f760608 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
Aug 25 16:25:16 hostname kernel: RAX: ffffffffffffffda RBX: 00007fc6a1b5f700 RCX: 00007fc69f67f537
Aug 25 16:25:16 hostname kernel: RDX: 00007ffd1f760640 RSI: 00000000400c645f RDI: 0000000000000009
Aug 25 16:25:16 hostname kernel: RBP: 00007ffd1f760640 R08: 000000000000000c R09: 00000000ffffffff
Aug 25 16:25:16 hostname kernel: R10: 0000000000000000 R11: 0000000000003246 R12: 00000000400c645f
Aug 25 16:25:16 hostname kernel: R13: 0000000000000009 R14: 0000563feb6b41a0 R15: 0000563feb6b04c0


See also: bug #54226
Comment 1 Chris Wilson 2017-08-27 11:13:54 UTC
The GPU died completely, we could do with the logs earlier to this as well.

Aug 25 16:25:16 hostname kernel: [drm:fw_domains_get [i915]] *ERROR* render: timed out waiting for forcewake ack request.
Aug 25 16:25:16 hostname kernel: [drm:__gen6_gt_wait_for_thread_c0 [i915]] *ERROR* GT thread status wait timed out
Aug 25 16:25:16 hostname kernel: [drm:fw_domains_get [i915]] *ERROR* render: timed out waiting for forcewake ack request.
Aug 25 16:25:16 hostname kernel: [drm:__gen6_gt_wait_for_thread_c0 [i915]] *ERROR* GT thread status wait timed out

But what caused those?...
Comment 2 samuel.rakitnican 2017-08-27 11:32:13 UTC
Created attachment 133813 [details]
journalctl --dmesg

Attached full kernel log for the current boot. Not sure if there is something else there except this two lines which seems the same so not sure what you are looking for.

Jul 28 23:31:10 hostname kernel: [drm:fw_domains_get [i915]] *ERROR* render: timed out waiting for forcewake ack request.
Jul 28 23:31:10 hostname kernel: [drm:__gen6_gt_wait_for_thread_c0 [i915]] *ERROR* GT thread status wait timed out

Please note that this machine is mostly just sitting at GDM login screen is used mostly as server.
Comment 3 samuel.rakitnican 2017-08-27 11:53:26 UTC
As what happened prior to segfault, I've logged in into GNOME desktop, and opened Vinagre and connected to a local VNC server, and maybe even opened gnome-terminal. This happened few seconds after. I am not sure if it is reproducible with these actions, probably not from my past experience.
Comment 4 Chris Wilson 2017-08-27 12:00:00 UTC
I was hoping it was preceded by a GPU hang -- but that report still be pending, do you also have the next couple of minutes of dmesg?
Comment 5 samuel.rakitnican 2017-08-27 12:10:56 UTC
The attached log at comment 2 is full log, there is nothing after that. Machine is still running.
Comment 6 Elizabeth 2017-10-26 20:52:40 UTC
Hello Samuel, any change with latest stable or mainline?? https://www.kernel.org
Comment 7 Jani Saarinen 2018-03-29 07:10:51 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 8 samuel.rakitnican 2018-03-29 07:16:40 UTC
I think this was solved by disabling semaphores: 
    https://bugs.freedesktop.org/show_bug.cgi?id=54226#c264

I can't see the issue anymore on newer kernels. 

# uname -r
4.15.12-1.el7.elrepo.x86_64
Comment 9 Jani Saarinen 2018-03-29 07:29:25 UTC
OK, thanks you for fast response. Resolving with this information


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.