Bug 103381 - GPU hang, Kicking stuck wait on rcs0
Summary: GPU hang, Kicking stuck wait on rcs0
Status: CLOSED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-10-21 00:51 UTC by fkyleung
Modified: 2018-05-04 09:55 UTC (History)
3 users (show)

See Also:
i915 platform: KBL
i915 features: GPU hang


Attachments
GPU crash dump (73.73 KB, text/x-log)
2017-10-21 00:51 UTC, fkyleung
no flags Details

Description fkyleung 2017-10-21 00:51:53 UTC
Created attachment 134954 [details]
GPU crash dump

Following crash report instructions in dmesg. This was encountered during a play session of Starbound through Steam. Working with a Xeon E3 1275 V6.

[78854.543221] [drm] GPU HANG: ecode 9:-1:0x00000000, reason: Kicking stuck wait on rcs0, action: continue
[78854.543222] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
[78854.543222] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
[78854.543223] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
[78854.543223] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
[78854.543224] [drm] GPU crash dump saved to /sys/class/drm/card1/error
[78858.379785] asynchronous wait on fence i915:gnome-shell[26449]/1:2b3d0 timed out
Comment 1 Elizabeth 2017-10-24 14:39:27 UTC
From error state: 

ERROR: 0x00000001
FAULT_TLB_DATA: 0x00000019 0xd4113ff9
    Address 0x00009d4113ff9000 GGTT
DONE_REG: 0xdfffffff
render command stream:
...
  INSTDONE: 0xffdffffe
    busy: CS
...
FAULT_REG: 0x00000819
    Valid
    Invalid PTE Fault
    Engine GFX
    Source ID 3
...
  seqno: 0x0053e773
  last_seqno: 0x0053f04a
  waiting: yes
  ring->head: 0x00001b80
  ring->tail: 0x00002320
  hangcheck stall: no
  hangcheck action: active subunits
  hangcheck action timestamp: 4318531840, 1483370 ms ago
  ELSP[0]:  pid 2391, ban score 0, seqno        2:0053f049, emitted 1483780ms ago, head 00002520, tail 000025a0
  ELSP[1]:  pid 19465, ban score 0, seqno       1e:0053f04a, emitted 1483786ms ago, head 000022a8, tail 00002320
  Active context: starbound[19465] user_handle 1 hw_id 30, ban score 0 guilty 0 active 0
...
batch (rcs0 (submitted by starbound [19465], ctx 1 [30], score 0)) at 0x00000000_000a7000
Bad count in PIPE_CONTROL
0x000a7000:      0x7a000004: PIPE_CONTROL: no write, no depth stall, no RC write flush, no inst flush
0x000a7004:      0x00105021:    destination address
0x000a7008:      0x00000000:    immediate dword low
0x000a700c:      0x00000000:    immediate dword high
Bad length 19 in STATE_BASE_ADDRESS, expected 6-10
0x000a7018:      0x61010011: STATE_BASE_ADDRESS
Bad count in STATE_BASE_ADDRESS
0x000a701c:      0x00000041:    general state base address 0x00000040
0x000a7020:      0x00000000:    surface state base not updated
0x000a7024:      0x00040000:    indirect state base not updated
0x000a7028:      0x000a7041:    general state upper bound 0x000a7040
0x000a702c:      0x00000000:    indirect state upper bound not updated
Bad count in PIPE_CONTROL
0x000a7064:      0x7a000004: PIPE_CONTROL: no write, no depth stall, no RC write flush, no inst flush
0x000a7068:      0x00000c04:    destination address
0x000a706c:      0x00000000:    immediate dword low
0x000a7070:      0x00000000:    immediate dword high
0x000a707c:      0x78230000: 3D UNKNOWN: 3d_965 opcode = 0x7823
0x000a7080:      0x00007fe0: MI_NOOP
0x000a7084:      0x78210000: 3D UNKNOWN: 3d_965 opcode = 0x7821
0x000a7088:      0x00007f80: MI_NOOP
0x000a708c:      0x78240000: 3D UNKNOWN: 3d_965 opcode = 0x7824
0x000a7090:      0x00007f41: MI_NOOP
0x000a7094:      0x780e0000: 3D UNKNOWN: 3d_965 opcode = 0x780e
0x000a7098:      0x00007f01: MI_NOOP
0x000a709c:      0x78150009: 3D UNKNOWN: 3d_965 opcode = 0x7815
0x000a70a0:      0x00000000: MI_NOOP
0x000a70a4:      0x00030000: MI_NOOP
0x000a70a8:      0x00000000: MI_NOOP
0x000a70ac:      0x00000000: MI_NOOP
0x000a70b0:      0x00000000: MI_NOOP
0x000a70b4:      0x00000000: MI_NOOP
0x000a70b8:      0x00000000: MI_NOOP
0x000a70bc:      0x00000000: MI_NOOP
0x000a70c0:      0x0180e000: MI_WAIT_FOR_EVENT, pipe B hblank wait
0x000a70c4:      0x00000000: MI_NOOP
0x000a70c8:      0x78170009: 3D UNKNOWN: 3d_965 opcode = 0x7817
0x000a70cc:      0x00000000: MI_NOOP
0x000a70d0:      0x00010000: MI_NOOP
0x000a70d4:      0x00000000: MI_NOOP
0x000a70d8:      0x00000000: MI_NOOP
0x000a70dc:      0x00000000: MI_NOOP
0x000a70e0:      0x00000000: MI_NOOP
0x000a70e4:      0x00000000: MI_NOOP
0x000a70e8:      0x00000000: MI_NOOP
0x000a70ec:      0x0180e060: MI_WAIT_FOR_EVENT, pipe B hblank wait, pipe A hblank wait
0x000a70f0:      0x00000000: MI_NOOP
0x000a70f4:      0x78260000: 3DSTATE_BINDING_TABLE_POINTERS_VS
0x000a70f8:      0x00000000:    dword 1
0x000a70fc:      0x782a0000: 3DSTATE_BINDING_TABLE_POINTERS_PS
0x000a7100:      0x00007e20:    dword 1
0x000a7104:      0x782f0000: 3DSTATE_SAMPLER_STATE_POINTERS_PS
0x000a7108:      0x00007e00:    dword 1
Bad length 9 in 3DSTATE_VS, expected 6-6
0x000a710c:      0x78100007: 3DSTATE_VS
0x000a7110:      0x00000d80:    kernel pointer
0x000a7114:      0x00000000:    SPF=0, VME=0, Sampler Count 0, Binding table count 0
0x000a7118:      0x00000000:    scratch offset
0x000a711c:      0x00000000:    Dispatch GRF start 0, VUE read length 0, VUE read offset 0
0x000a7120:      0x00000000:    Max Threads 1, Vertex Cache enable, VS func disable
Bad length 9 in 3DSTATE_HS, expected 7-7
0x000a7130:      0x781b0007: 3DSTATE_HS
0x000a7134:      0x00000000:    dword 1
0x000a7138:      0x00000000:    dword 2
0x000a713c:      0x00000000:    dword 3
0x000a7140:      0x00000000:    dword 4
0x000a7144:      0x00000000:    dword 5
0x000a7148:      0x00000000:    dword 6
0x000a714c:      0x00000000:    dword 7
0x000a7150:      0x00000000:    dword 8
Bad length 11 in 3DSTATE_DS, expected 6-6
0x000a7154:      0x781d0009: 3DSTATE_DS
0x000a7158:      0x00000000:    dword 1
0x000a715c:      0x00000000:    dword 2
0x000a7160:      0x00000000:    dword 3
0x000a7164:      0x00000000:    dword 4
0x000a7168:      0x00000000:    dword 5
0x000a716c:      0x00000000:    dword 6
0x000a7170:      0x00000000:    dword 7
0x000a7174:      0x00000000:    dword 8
0x000a7178:      0x00000000:    dword 9
0x000a717c:      0x00000000:    dword 10
Bad length 10 in 3DSTATE_GS, expected 7-7
0x000a7180:      0x78110008: 3DSTATE_GS
0x000a7184:      0x00000000:    kernel pointer
0x000a7188:      0x00000000:    SPF=0, VME=0, Sampler Count 0, Binding table count 0
0x000a718c:      0x00000000:    scratch offset
0x000a7190:      0x00000000:    Dispatch GRF start 0, VUE read length 0, VUE read offset 0
0x000a7194:      0x00000000:    Max Threads 1, Rendering disable
0x000a7198:      0x00000000:    Reorder disable, Discard Adjaceny disable, GS disable
Bad length 5 in 3DSTATE_STREAMOUT, expected 3-3
0x000a71a8:      0x781e0003: 3DSTATE_STREAMOUT
0x000a71ac:      0x00000000:    dword 1
0x000a71b0:      0x00000000:    dword 2
0x000a71b4:      0x00000000:    dword 3
0x000a71b8:      0x00000000:    dword 4
Bad length 12 in 3DSTATE_PS, expected 8-8
0x000a71bc:      0x7820000a: 3DSTATE_PS
0x000a71c0:      0x00000400:    dword 1
0x000a71c4:      0x00000000:    dword 2
0x000a71c8:      0x480c0000:    dword 3
0x000a71cc:      0x00000000:    dword 4
0x000a71d0:      0x00000000:    dword 5
0x000a71d4:      0x1f800803:    dword 6
0x000a71d8:      0x00040006:    dword 7
0x000a71dc:      0x00000000:    dword 8
0x000a71e0:      0x00000000:    dword 9
0x000a71e4:      0x000008c0:    dword 10
0x000a71e8:      0x00000000:    dword 11
0x000a71ec:      0x780f0000: 3DSTATE_SCISSOR_POINTERS
0x000a71f0:      0x00007de0:    scissor rect offset
Bad length 8 in 3DSTATE_URB, expected 3-3
0x000a71f4:      0x78050006: 3DSTATE_URB
0x000a71f8:      0x204c27ff:    VS entries 10239, alloc size 77 (1024bit row)
0x000a71fc:      0x1076e000:    GS entries 736, alloc size 1 (1024bit row)
0x000a7214:      0x78070003: 3D UNKNOWN: 3d_965 opcode = 0x7807
0x000a7218:      0x080009ff: MI UNKNOWN
0x000a721c:      0x00282000: MI_NOOP
0x000a7220:      0x00000000: MI_NOOP
0x000a7224:      0x00000168: MI_NOOP
Bad length 5 in 3DSTATE_STENCIL_BUFFER, expected 3-3
0x000a7228:      0x78060003: 3DSTATE_STENCIL_BUFFER
0x000a722c:      0x810013ff:    dword 1
0x000a7230:      0x1176e000:    dword 2
0x000a7234:      0x00000000:    dword 3
0x000a7238:      0x00000168:    dword 4
0x000a723c:      0x78040001: 3DSTATE_CLEAR_PARAMS
0x000a7240:      0x00000000:    dword 1
0x000a7244:      0x00000001:    dword 2
0x000a7248:      0x784a0000: 3D UNKNOWN: 3d_965 opcode = 0x784a
0x000a724c:      0x00000000: MI_NOOP
0x000a7250:      0x78080003: 3DSTATE_VERTEX_BUFFERS
0x000a7254:      0x00044018:    buffer 0: sequential, pitch 24b
0x000a7258:      0x132cf000:    buffer address
0x000a725c:      0x00000000:    max index
0x000a7260:      0x00023550:    mbz
0x000a7264:      0x78090007: 3DSTATE_VERTEX_ELEMENTS
0x000a7268:      0x02850000:    buffer 0: invalid, type 0x0085, src offset 0x0000 bytes
0x000a726c:      0x11230000:    (X, Y, 0.0, 1.0), dst offset 0x00 bytes
0x000a7270:      0x02850008:    buffer 0: invalid, type 0x0085, src offset 0x0008 bytes
0x000a7274:      0x11230000:    (X, Y, 0.0, 1.0), dst offset 0x00 bytes
0x000a7278:      0x02c70010:    buffer 0: invalid, type 0x00c7, src offset 0x0010 bytes
0x000a727c:      0x11110000:    (X, Y, Z, W), dst offset 0x00 bytes
0x000a7280:      0x02d80014:    buffer 0: invalid, type 0x00d8, src offset 0x0014 bytes
0x000a7284:      0x12230000:    (X, 0.0, 0.0, 1.0), dst offset 0x00 bytes
0x000a7288:      0x78490001: 3D UNKNOWN: 3d_965 opcode = 0x7849
0x000a728c:      0x00000000: MI_NOOP
0x000a7290:      0x00000000: MI_NOOP
0x000a7294:      0x78490001: 3D UNKNOWN: 3d_965 opcode = 0x7849
0x000a7298:      0x00000001: MI_NOOP
0x000a729c:      0x00000000: MI_NOOP
0x000a72a0:      0x78490001: 3D UNKNOWN: 3d_965 opcode = 0x7849
0x000a72a4:      0x00000002: MI_NOOP
0x000a72a8:      0x00000000: MI_NOOP
0x000a72ac:      0x78490001: 3D UNKNOWN: 3d_965 opcode = 0x7849
0x000a72b0:      0x00000003: MI_NOOP
0x000a72b4:      0x00000000: MI_NOOP
Bad length 7 in (null), expected 6-6
0x000a72b8:      0x7b000005: 3DPRIMITIVE: fail sequential
0x000a72bc:      0x00000004:    vertex count
0x000a72c0:      0x00000036:    start vertex
0x000a72c4:      0x00000000:    instance count
0x000a72c8:      0x00000001:    start instance
0x000a72cc:      0x00000000:    index bias
0x000a72d0:      0x00000000: MI_NOOP
0x000a72d4:      0x05000000: MI_BATCH_BUFFER_END
Comment 2 Jani Saarinen 2018-03-29 07:10:01 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 3 Jani Saarinen 2018-04-25 08:20:14 UTC
Mika, Chris, any advice here?
Comment 4 Jani Saarinen 2018-05-04 09:54:58 UTC
No updates, Closing, please re-open if occurs again.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.