Bug 90195 - [SKL] Running torcs for a while hangs the machine
Summary: [SKL] Running torcs for a while hangs the machine
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-04-27 15:04 UTC by Timo Aaltonen
Modified: 2017-07-24 22:47 UTC (History)
2 users (show)

See Also:
i915 platform: SKL
i915 features:


Attachments

Description Timo Aaltonen 2015-04-27 15:04:15 UTC
Running torcs on drm-intel-next-2015-04-23 or earlier, with mesa 10.5.2 or current master, hangs the machine hard or can also segfault torcs. Can't get anything useful from the machine when it happens, since it's dead.

Reproduced with SKL-S beta and SKL-Y alpha sdp's.
Comment 1 Chris Wilson 2015-04-27 15:15:01 UTC
The segfault suggests an oops, so netconsole? Also try disabling fbdev to disable the kms panic handler.
Comment 2 Timo Aaltonen 2015-05-12 17:35:48 UTC
netconsole shows nothing

I don't know what the other sentence means.. build a special kernel?
Comment 3 Ander Conselvan de Oliveira 2015-05-18 10:00:28 UTC
(In reply to Timo Aaltonen from comment #2)
> netconsole shows nothing
> 
> I don't know what the other sentence means.. build a special kernel?

Chris, did you mean to compile with CONFIG_DRM_I915_FBDEV disabled?
Comment 4 Chris Wilson 2015-05-18 10:04:01 UTC
Yes, but even more so to compile without FBCON as we (tjaalton and I) discussed on irc. Basically it just prevents a machine hang in case we panic whilst holding the struct_mutex (because our atomic panic handler is not so atomic).
Comment 5 Timo Aaltonen 2015-05-18 19:13:12 UTC
so I built a kernel without CONFIG_FRAMEBUFFER* and CONFIG_DRM_I915_FBDEV, but it still hangs the same way.. The kernel is 3.19 + backported i915 for SKL/CHV from 4.1-rc3 + WA fixes etc, but IIRC a recent-ish nightly hangs too.
Comment 6 Chris Wilson 2015-05-18 19:43:35 UTC
Ok, that means you have a "it can't possibly happen but it does" GPU hang that kills the machine (as opposed easier to debug driver deadlock).
Comment 7 Ander Conselvan de Oliveira 2015-06-23 08:08:32 UTC
Does the hard hang still happens with latest -nightly, which contains the following commit?

commit 7fd2d26921d1dd70732d8765d714ec3a023a3ca9
Author: Mika Kuoppala <mika.kuoppala@linux.intel.com>
Date:   Thu Jun 18 12:51:40 2015 +0300

    drm/i915: Reset request handling for gen8+
Comment 8 Timo Aaltonen 2015-06-23 11:19:19 UTC
I tried drm-intel-next-2015-06-19 which has it too, but it still hung the same way, also with i915.reset=0 like before.
Comment 9 Timo Aaltonen 2015-06-23 12:33:58 UTC
ran that kernel plus mesa 10.6.0 with the patch from 90425 added, and now torcs just segfaults
Comment 10 Timo Aaltonen 2015-08-18 15:25:06 UTC
I believe this got fixed by WaEnableGapsTsvCreditFix (245d96670d2655f7)
Comment 11 Jani Nikula 2015-08-18 15:54:32 UTC
Thanks for the follow-up, Timo!


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.