Summary: | [ivb] IPEHR:0xffffffff upon context restore | ||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | Ross Lagerwall <rosslagerwall> | ||||||||||||||||||
Component: | DRM/Intel | Assignee: | Ben Widawsky <ben> | ||||||||||||||||||
Status: | CLOSED INVALID | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||||||||||||||||
Severity: | normal | ||||||||||||||||||||
Priority: | medium | ||||||||||||||||||||
Version: | unspecified | ||||||||||||||||||||
Hardware: | x86-64 (AMD64) | ||||||||||||||||||||
OS: | Linux (All) | ||||||||||||||||||||
Whiteboard: | |||||||||||||||||||||
i915 platform: | i915 features: | ||||||||||||||||||||
Attachments: |
|
Description
Ross Lagerwall
2013-04-30 07:37:50 UTC
Created attachment 78629 [details]
i915 error state
Created attachment 78630 [details]
System log
Created attachment 78631 [details]
Xorg log
Created attachment 78632 [details]
lspci -nn
Created attachment 78633 [details]
glxinfo
Created attachment 78634 [details]
cat /proc/cpuinfo
This will be interesting to see if commit 4615d4c9e27eda42c3e965f208a4b4065841498c Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Mon Apr 8 14:28:40 2013 +0100 drm/i915: Use MLC (l3$) for context objects has any impact. Can you please try the current drm-intel-nightly kernel from ppa:mainline? (In reply to comment #7) > This will be interesting to see if > > commit 4615d4c9e27eda42c3e965f208a4b4065841498c > Author: Chris Wilson <chris@chris-wilson.co.uk> > Date: Mon Apr 8 14:28:40 2013 +0100 > > drm/i915: Use MLC (l3$) for context objects > > has any impact. Can you please try the current drm-intel-nightly kernel from > ppa:mainline? Yes, it seems to work well with the drm-intel-nightly kernel. Can you please check whether cherry-picking the referenced patch to a stable kernel fixes the issues, too? (In reply to comment #9) > Can you please check whether cherry-picking the referenced patch to a stable > kernel fixes the issues, too? Yes, applying it on top of the Ubuntu 3.8 kernel worked fine. I've just sent out the stable backport request, so this should get fixed in the next stable kernel releases (or one of the next, around the merge window there's a bit a lag usually due to the high patch load). Thanks for reporting this issue and please reopen if it breaks again. On the same machine, I tried running SuperTuxKart on Arch Linux with Linux kernel 3.10-rc2, mesa 9.1.2 and Intel drivers 2.21.6. I seemed to get the same hang, even though 3.10-rc2 contains the above-mentioned commit. I will attach the error state and relevant dmesg log. Created attachment 79626 [details]
i915 error state from v3.10-rc2
Created attachment 79627 [details]
System log from v3.10-rc2
Aye, that appears to be same hang. Note that with IVB and MSAA I see lots of corruption with large swaths of memory being overwritten with pixel values (lots of 0xffffffff especially). That would include the possibility of overwritting context memory. Isolating MSAA in mesa would be tricky... perhaps a hack to disable? I can confirm that I did see strange white corruption when playing the game, but I thought it was unrelated or an application error. Unfortunately, I don't have access to the hardware anymore so I cannot further test anything. However, given that the hangs happened on two different OSes, with the latest kernel versions, it should be easy enough to reproduce. (In reply to comment #17) > I can confirm that I did see strange white corruption when playing the > game, but I thought it was unrelated or an application error. > > Unfortunately, I don't have access to the hardware anymore so I cannot > further test anything. However, given that the hangs happened on two > different OSes, with the latest kernel versions, it should be easy > enough to reproduce. If someone can reproduce this, can they read back register 0x20f4? On Wed, Jun 26, 2013 at 11:04:43PM +0000, bugzilla-daemon@freedesktop.org wrote: > https://bugs.freedesktop.org/show_bug.cgi?id=64073 > > --- Comment #18 from Ben Widawsky <ben@bwidawsk.net> --- > If someone can reproduce this, can they read back register 0x20f4? > Would that not be in the error state dump I attached? Hopefully https://patchwork.kernel.org/patch/2841344/ is the right fix. (In reply to comment #19) > On Wed, Jun 26, 2013 at 11:04:43PM +0000, bugzilla-daemon@freedesktop.org > wrote: > > https://bugs.freedesktop.org/show_bug.cgi?id=64073 > > > > --- Comment #18 from Ben Widawsky <ben@bwidawsk.net> --- > > If someone can reproduce this, can they read back register 0x20f4? > > > > Would that not be in the error state dump I attached? No. But I can no longer remember what I wanted anyway. (In reply to comment #20) > Hopefully https://patchwork.kernel.org/patch/2841344/ is the right fix. Can you please test the above patch? (In reply to comment #22) > (In reply to comment #20) > > Hopefully https://patchwork.kernel.org/patch/2841344/ is the right fix. > > Can you please test the above patch? Unfortunately, as I said in comment #17, I don't have access to the IVB hardware anymore so I can't test the patch. Hw no longer available for testing, so closing. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.