Bug 86150 - [ivb] GPU HANG: ecode 2:0xfffffffe, in X [2488], reason: Ring hung, action: reset (DRAM failure?)
Summary: [ivb] GPU HANG: ecode 2:0xfffffffe, in X [2488], reason: Ring hung, action: r...
Status: CLOSED INVALID
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-11-11 13:49 UTC by Alin M Elena
Modified: 2017-07-06 17:33 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
GPU HANG: ecode 2:0xfffffffe, in X [2488], reason: Ring hung, action: reset (2.04 MB, text/plain)
2014-11-11 13:49 UTC, Alin M Elena
no flags Details

Description Alin M Elena 2014-11-11 13:49:10 UTC
Created attachment 109269 [details]
GPU HANG: ecode 2:0xfffffffe, in X [2488], reason: Ring hung, action: reset

GPU HANG: ecode 2:0xfffffffe, in X [2488], reason: Ring hung, action: reset

error code attached
Comment 1 Chris Wilson 2014-11-11 14:02:43 UTC
Last blitter seqno LRI: 0x00000f6f

Value in blitter HWS: 0x00000b6f

Single bit difference, oh dear.
Comment 2 Jesse Barnes 2015-03-25 22:06:03 UTC
Alin, can you run memtest on your machine and see how well it does?  Chris's comment indicates you could be seeing single bit errors (which are more common than they ought to be; why don't we have ECC everywhere yet?).
Comment 3 Alin M Elena 2015-03-26 09:37:22 UTC
Hi Jesse,

Ii have already run memtest on the machine few passes one long night and nothing came out of it... 
I would love to have something like... memory is broken... will save me a lot of time.

Since reporting the bug just to update.. 
I discovered that on kernel 3.12 much less crashes with dri enabled
kernel 3.19 almost unusable, kwin,x and plasma crash... 
with dri disabled things seem to be more stable but still crashes from time to time.

Alin
Comment 4 Jesse Barnes 2015-04-02 21:05:39 UTC
Ugg, well something is amiss here... have you tried disabling rc6?  You can pass 'i915.enable_rc6=0' on the kernel boot line to try that.
Comment 5 Alin M Elena 2015-04-03 08:45:41 UTC
already there.

i915.enable_rc6=0, i mean all the errors are reported with it set to 0.

Alin
Comment 6 Alin M Elena 2015-04-03 08:47:41 UTC
anyhow asked for a memory replacement.. unfortunatelly never trust dell next bussines day service, seem to be next week service. I will report once again when the new memory is in. (it will be a full motherboard)

Alin
Comment 7 Alin M Elena 2015-04-09 22:21:11 UTC
Turned out was the memory... 
So we shall close this.

Alin


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.