Bug 112144

Summary: Crash on i915
Product: DRI Reporter: Brian J. Murrell <brian.murrell>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: RESOLVED NOTOURBUG QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: blocker    
Priority: not set CC: intel-gfx-bugs
Version: XOrg git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:

Description Brian J. Murrell 2019-10-27 21:19:46 UTC
Per dmesg:

GPU HANG: ecode 7:1:0xfffffffe, in gnome-shell [23430], hang on rcs0
Kernel: 5.3.6-200.fc30.x86_64 x86_64
Time: 1572202521 s 617897 us
Boottime: 201042 s 6705 us
Uptime: 201039 s 772255 us
Epoch: 4495703040 jiffies (1000 HZ)
Capture: 4495709056 jiffies; 7734088 ms ago, 6016 ms after epoch
Active process (on ring rcs0): gnome-shell [23430]
Reset count: 0
Suspend count: 0
Platform: HASWELL
Subplatform: 0x0
PCI ID: 0x0402
PCI Revision: 0x06
PCI Subsystem: 1458:d000
IOMMU enabled?: 0
GT awake: yes
RPM wakelock: yes
PM suspended: no
EIR: 0x00000000
IER: 0xfc080421
GTIER[0]: 0x00401821
PGTBL_ER: 0x00000000
FORCEWAKE: 0x00000001
DERRMR: 0xffffffff
CCID: 0x7fba010d
  fence[0] = 00000000
  fence[1] = 00000000
  fence[2] = 00000000
  fence[3] = 6bf3053057f0001
  fence[4] = 00000000
  fence[5] = 00000000
  fence[6] = 00000000
  fence[7] = 00000000
  fence[8] = 00000000
  fence[9] = 00000000
  fence[10] = 00000000
  fence[11] = 00000000
  fence[12] = 00000000
  fence[13] = 00000000
  fence[14] = 00000000
  fence[15] = 00000000
  fence[16] = 00000000
  fence[17] = 00000000
  fence[18] = 00000000
  fence[19] = 00000000
  fence[20] = 00000000
  fence[21] = 00000000
  fence[22] = 00000000
  fence[23] = 00000000
  fence[24] = 00000000
  fence[25] = 00000000
  fence[26] = 00000000
  fence[27] = 00000000
  fence[28] = 00000000
  fence[29] = 00000000
  fence[30] = 00000000
  fence[31] = 00000000
ERROR: 0x00000001
DONE_REG: 0xffffffff
ERR_INT: 0x00000000
rcs0 command stream:
  IDLE?: no
  START: 0x00001000
  HEAD:  0xa520fd48 [0x0000fd48]
  TAIL:  0x000102c0 [0x0000fe48, 0x0000fe60]
  CTL:   0x0001f001
  MODE:  0x00004000
  HWS:   0x7fffe000
  ACTHD: 0x00000000 a520fd48
  IPEIR: 0x00000000
  IPEHR: 0x780c0000
  INSTDONE: 0xffcfffff
  SC_INSTDONE: 0xffffffff
  SAMPLER_INSTDONE[0][0]: 0xffffffff
  ROW_INSTDONE[0][0]: 0xffffffff
  BBADDR: 0x00000000_073fe32c
  BB_STATE: 0x00000000
  INSTPS: 0x80000100
  INSTPM: 0x00006080
  FADDR: 0x00000000 00010f00
  RC PSMI: 0x00000010
  FAULT_REG: 0x800050c1
  GFX_MODE: 0x00002a00
  PP_DIR_BASE: 0x03fe0000
  ring->head: 0x0000fd48
  ring->tail: 0x000102c0
  hangcheck timestamp: 0ms (4495703040; epoch)
  engine reset count: 0
  Active context: gnome-shell[23430] hw_id 0, prio 0, guilty 0 active 0
bcs0 command stream:
  IDLE?: yes
  START: 0x00021000
  HEAD:  0x00000618 [0x00000000]
  TAIL:  0x00000618 [0x00000000, 0x00000000]
  CTL:   0x0001f001
  MODE:  0x00000200
  HWS:   0x7fffd000
  ACTHD: 0x00000000 00000618
  IPEIR: 0x00000000
  IPEHR: 0x00000000
  INSTDONE: 0xfffffffe
  BBADDR: 0x00000000_00000000
  BB_STATE: 0x00000000
  INSTPS: 0x00000000
  INSTPM: 0x00000000
  FADDR: 0x00000000 00021618
  RC PSMI: 0x00000010
  FAULT_REG: 0x00000000
  GFX_MODE: 0x00000200
  PP_DIR_BASE: 0x7fde0000
  ring->head: 0x00000000
  ring->tail: 0x00000000
  hangcheck timestamp: -200735744ms (0)
  engine reset count: 0
  Active context: [0] hw_id 0, prio 0, guilty 0 active 0
vcs0 command stream:
  IDLE?: yes
  START: 0x00041000
  HEAD:  0x00000618 [0x00000000]
  TAIL:  0x00000618 [0x00000000, 0x00000000]
  CTL:   0x0001f001
  MODE:  0x00000200
  HWS:   0x7fffc000
  ACTHD: 0x00000000 00000618
  IPEIR: 0x00000000
  IPEHR: 0x00000000
  INSTDONE: 0xfffffffe
  BBADDR: 0x00000000_00000000
  BB_STATE: 0x00000000
  INSTPS: 0x00000000
  INSTPM: 0x00000000
  FADDR: 0x00000000 00041618
  RC PSMI: 0x00000010
  FAULT_REG: 0x00000000
  GFX_MODE: 0x00000200
  PP_DIR_BASE: 0x7fde0000
  ring->head: 0x00000000
  ring->tail: 0x00000000
  hangcheck timestamp: -200735744ms (0)
  engine reset count: 0
  Active context: [0] hw_id 0, prio 0, guilty 0 active 0
vecs0 command stream:
  IDLE?: yes
  START: 0x00061000
  HEAD:  0x00000618 [0x00000000]
  TAIL:  0x00000618 [0x00000000, 0x00000000]
  CTL:   0x0001f001
  MODE:  0x00000200
  HWS:   0x7fffb000
  ACTHD: 0x00000000 00000618
  IPEIR: 0x00000000
  IPEHR: 0x00000000
  INSTDONE: 0xfffffffe
  BBADDR: 0x00000000_00000000
  BB_STATE: 0x00000000
  INSTPS: 0x00000000
  INSTPM: 0x00000000
  FADDR: 0x00000000 00061618
  RC PSMI: 0x00000010
  FAULT_REG: 0x00000000
  GFX_MODE: 0x00000200
  PP_DIR_BASE: 0x7fde0000
  ring->head: 0x00000000
  ring->tail: 0x00000000
  hangcheck timestamp: -200735744ms (0)
  engine reset count: 0
  Active context: [0] hw_id 0, prio 0, guilty 0 active 0
Active (rcs0) [76]:
    00000000_0b1eb000  5242880 3e 02 dirty LLC
    00000000_0452e000  8388608 3f 00 Y dirty LLC
    00000000_0b6eb000  1835008 3f 00 X dirty LLC
    00000000_0cc09000     4096 3f 00 Y dirty LLC
    00000000_169bb000     8192 3f 00 Y dirty LLC
    00000000_0b196000   114688 3f 00 X dirty LLC
    00000000_0eb27000    40960 3f 00 Y dirty LLC
    00000000_105f3000  1310720 3f 00 X dirty LLC
    00000000_14cfa000 20971520 3e 02 Y dirty LLC
    00000000_174fa000  2621440 3e 02 Y dirty LLC
    00000000_174f5000    16384 3f 00 dirty LLC
    00000000_160fa000  8388608 3f 00 X dirty LLC
    00000000_1897a000  2097152 3f 00 Y dirty LLC
    00000000_0bfca000  4194304 3e 00 X dirty LLC
    00000000_0cc0a000  1310720 3f 00 Y dirty LLC
    00000000_168fa000    12288 3f 00 Y dirty LLC
    00000000_00000000     4096 3e 02 dirty LLC
    00000000_00005000    16384 3f 00 dirty LLC
    00000000_0f29c000   458752 3e 02 Y dirty LLC
    00000000_0f35c000   327680 3e 02 Y dirty LLC
    00000000_0f4f1000  8388608 3f 00 Y dirty LLC
    00000000_0b9f1000    16384 3f 00 dirty purgeable LLC
    00000000_030bb000     8192 3e 02 Y dirty LLC
    00000000_0c42c000     4096 3e 02 Y dirty LLC
    00000000_169ea000     4096 3e 02 Y dirty LLC
    00000000_014ba000    12288 3e 02 X dirty LLC
    00000000_0ba3f000     4096 3e 02 Y dirty LLC
    00000000_169e9000     4096 3e 02 Y dirty LLC
    00000000_16f6f000     4096 3e 02 dirty LLC
    00000000_0c3ca000   327680 3e 02 Y dirty LLC
    00000000_0b9fc000    40960 3e 02 Y dirty LLC
    00000000_01bfb000    81920 3e 02 dirty LLC
    00000000_04d2e000  4194304 3f 00 Y dirty LLC
    00000000_0f45a000     4096 3f 00 dirty LLC
    00000000_107f4000     4096 3f 00 Y dirty LLC
    00000000_0f45f000     4096 3f 00 Y dirty LLC
    00000000_0fcf1000     4096 3f 00 Y dirty LLC
    00000000_0fcf2000     4096 3f 00 Y dirty LLC
    00000000_0f459000     4096 3e 00 Y dirty LLC
    00000000_174f9000     4096 3e 00 Y dirty LLC
    00000000_030ba000     4096 3e 02 Y dirty LLC
    00000000_0f4f0000     4096 3e 00 Y dirty LLC
    00000000_0c42b000     4096 3f 00 Y dirty LLC
    00000000_19271000     4096 3f 00 Y dirty LLC
    00000000_0c42a000     4096 3e 00 Y dirty LLC
    00000000_01c2d000 20971520 3e 02 X dirty LLC
    00000000_169e6000     4096 3e 00 dirty LLC
    00000000_0f4de000     8192 3e 02 Y LLC
    00000000_014b9000     4096 3f 00 LLC
    00000000_0c4dd000    12288 3f 00 LLC
    00000000_0f460000   458752 3e 02 Y LLC
    00000000_0f4d0000    57344 3e 02 Y LLC
    00000000_0f324000   114688 3e 02 LLC
    00000000_18793000     4096 3f 00 Y LLC
    00000000_18794000     4096 3f 00 Y LLC
    00000000_053f3000    81920 3f 00 LLC
    00000000_0a677000  1310720 3f 00 X LLC
    00000000_031fd000  4194304 3f 00 X LLC
    00000000_0512e000  1048576 3f 00 Y LLC
    00000000_0b9f5000    20480 3f 00 dirty purgeable LLC
    00000000_0c41a000    65536 3f 00 LLC
    00000000_0b9fa000     4096 3f 00 LLC
    00000000_0be0a000  1835008 3f 00 X LLC
    00000000_1927a000    12288 3f 00 Y LLC
    00000000_0c660000  5242880 3f 00 X LLC
    00000000_18e5a000  1310720 3f 00 Y LLC
    00000000_0b8ac000  1310720 3f 00 X LLC
    00000000_09e62000  6291456 3f 00 X LLC
    00000000_0c4e0000  1572864 3f 00 Y LLC
    00000000_169ec000  1835008 3f 00 X LLC
    00000000_0f455000    16384 3f 00 Y LLC
    00000000_1777a000  6291456 3e 02 X LLC
    00000000_0d5aa000  1572864 3f 00 Y LLC
    00000000_0b9ec000    20480 3f 00 dirty LLC
    00000000_0f45b000    16384 3f 00 purgeable LLC
    00000000_0f4e0000    65536 3f 00 LLC
Pinned (global) [18]:
    00000000_7ffff000     4096 41 00 LLC
    00000000_7fffe000     4096 01 01 purgeable LLC
    00000000_7fffd000     4096 01 01 purgeable LLC
    00000000_7fffc000     4096 01 01 purgeable LLC
    00000000_7fffb000     4096 01 01 purgeable LLC
    00000000_00001000   131072 41 00 LLC
    00000000_7ffea000    69632 01 01 dirty LLC
    00000000_00021000   131072 41 00 LLC
    00000000_00041000   131072 41 00 LLC
    00000000_00061000   131072 41 00 LLC
    00000000_00081000  9216000 41 00 uncached
    00000000_057f0000 20971520 3e 00 X dirty uncached (fence: 3)
    00000000_06bf4000  8388608 3e 02 X dirty uncached
    00000000_0094b000   262144 41 00 dirty uncached
    00000000_033e0000    69632 01 01 dirty LLC
    00000000_7fdc0000    69632 01 01 dirty LLC
    00000000_7ffe4000    20480 3f 00 dirty purgeable LLC
    00000000_7fba0000    69632 01 01 dirty LLC
rcs0 --- 5 requests
  pid 23430, seqno        4:009bd81d+, prio -2147483648, emitted 99ms, start 00001000, head 0000fd48, tail 0000fe60
  pid 23073, seqno        4:009bd81e+, prio -2147483648, emitted 100ms, start 00001000, head 0000fe60, tail 0000ff78
  pid 28719, seqno        4:009bd81f, prio -2147483648, emitted 100ms, start 00001000, head 0000ff78, tail 00010090
  pid 23430, seqno        4:009bd820, prio -2147483648, emitted 552ms, start 00001000, head 00010090, tail 000101a8
  pid 23073, seqno        4:009bd821+, prio -2147483648, emitted 656ms, start 00001000, head 000101a8, tail 000102c0
Num Pipes: 2
PWR_WELL_CTL2: c0000000
Pipe [0]:
  Power: on
  SRC: 063f04af
  STAT: 00000000
Plane [0]:
  CNTR: d8000400
  STRIDE: 00002a00
  SURF: 05cb2000
  TILEOFF: 00070000
Cursor [0]:
  CNTR: 04000023
  POS: 02070196
  BASE: 0094b000
Pipe [1]:
  Power: on
  SRC: 077f0437
  STAT: 00000000
Plane [1]:
  CNTR: d9000400
  STRIDE: 00001e00
  SURF: 06bf4000
  TILEOFF: 00000000
Cursor [1]:
  CNTR: 00000000
  POS: 00000000
  BASE: 00000000
CPU transcoder: A
  Power: on
  CONF: c0000000
  HTOTAL: 086f063f
  HBLANK: 086f063f
  HSYNC: 073f067f
  VTOTAL: 04e104af
  VBLANK: 04e104af
  VSYNC: 04b304b0
CPU transcoder: B
  Power: on
  CONF: c0000000
  HTOTAL: 0897077f
  HBLANK: 0897077f
  HSYNC: 080307d7
  VTOTAL: 04640437
  VBLANK: 04640437
  VSYNC: 0440043b
CPU transcoder: C
  Power: on
  CONF: 00000000
  HTOTAL: 00000000
  HBLANK: 00000000
  HSYNC: 00000000
  VTOTAL: 00000000
  VBLANK: 00000000
  VSYNC: 00000000
CPU transcoder: EDP
  Power: on
  CONF: 00000000
  HTOTAL: 00000000
  HBLANK: 00000000
  HSYNC: 00000000
  VTOTAL: 00000000
  VBLANK: 00000000
  VSYNC: 00000000
is_mobile: no
is_lp: no
require_force_probe: no
has_64bit_reloc: no
gpu_reset_clobbers_display: no
has_reset_engine: no
has_fpga_dbg: yes
has_guc: no
has_l3_dpf: yes
has_llc: yes
has_logical_ring_contexts: no
has_logical_ring_elsq: no
has_logical_ring_preemption: no
has_pooled_eu: no
has_rc6: yes
has_rc6p: no
has_rps: yes
has_runtime_pm: yes
has_snoop: no
has_coherent_ggtt: yes
unfenced_needs_alignment: no
hws_needs_physical: no
cursor_needs_physical: no
has_csr: no
has_ddi: yes
has_dp_mst: yes
has_fbc: yes
has_gmch: no
has_hotplug: yes
has_ipc: no
has_overlay: no
has_psr: yes
overlay_needs_physical: no
supports_tv: no
Has logical contexts? yes
scheduler: 0
slice0: 1 subslice(s) (0x1):
	subslice0: 10 EUs (0x3ff)
i915.vbt_firmware=(null)
i915.modeset=-1
i915.lvds_channel_mode=0
i915.panel_use_ssc=-1
i915.vbt_sdvo_panel_type=-1
i915.enable_dc=-1
i915.enable_fbc=0
i915.enable_psr=-1
i915.disable_power_well=1
i915.enable_ips=1
i915.invert_brightness=0
i915.enable_guc=0
i915.guc_log_level=0
i915.guc_firmware_path=(null)
i915.huc_firmware_path=(null)
i915.dmc_firmware_path=(null)
i915.mmio_debug=1
i915.edp_vswing=0
i915.reset=2
i915.inject_load_failure=0
i915.fastboot=-1
i915.force_probe=
i915.alpha_support=no
i915.enable_hangcheck=yes
i915.prefault_disable=no
i915.load_detect_test=no
i915.force_reset_modeset_test=no
i915.error_capture=yes
i915.disable_display=no
i915.verbose_state_checks=yes
i915.nuclear_pageflip=no
i915.enable_dp_mst=yes
i915.enable_dpcd_backlight=no
i915.enable_gvt=no

Hopefully this is enough info.  Happy to provide whatever else may be needed.
Comment 1 Brian J. Murrell 2019-10-27 21:25:12 UTC
This sort of thing was only hitting randomly every few weeks until just now.

Now I can't seem to be logged into a GNOME session for a few minutes before this hits me.

Obviously this makes my machine pretty useless as this is my main desktop machine.

Any help would be greatly appreciated.
Comment 2 Lakshmi 2019-10-30 11:40:04 UTC
rcs0 command stream:
  IDLE?: no
  START: 0x00001000
  HEAD:  0xa520fd48 [0x0000fd48]
  TAIL:  0x000102c0 [0x0000fe48, 0x0000fe60]
  CTL:   0x0001f001
  MODE:  0x00004000
  HWS:   0x7fffe000
  ACTHD: 0x00000000 a520fd48
  IPEIR: 0x00000000
  IPEHR: 0x780c0000

Considering HEAD!=TAIL and IPEHR: 0x780c0000 is 3DSTATE_VF, NOTOURBUG? Mesa issue? 

Mesa issues can be reported here.
https://gitlab.freedesktop.org/mesa/mesa/issues

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.