Summary: | [BSW/HD400] Aztec Ruins GL version GPU hangs | ||
---|---|---|---|
Product: | Mesa | Reporter: | Eero Tamminen <eero.t.tamminen> |
Component: | Drivers/DRI/i965 | Assignee: | Jordan Justen <jljusten> |
Status: | VERIFIED FIXED | QA Contact: | Intel 3D Bugs Mailing List <intel-3d-bugs> |
Severity: | normal | ||
Priority: | medium | CC: | clayton.a.craft |
Version: | git | ||
Hardware: | Other | ||
OS: | All | ||
See Also: | https://bugs.freedesktop.org/show_bug.cgi?id=105290 | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: |
error state for GPU hang with aztec ruins on GLK
BSW GPU error state for GL Aztec Ruins |
Description
Eero Tamminen
2018-01-15 12:03:54 UTC
We are seeing GL hang for aztec ruins on GLK Linux. Can you share the error state for BSW hang. Attaching GLK one for reference. Created attachment 137126 [details]
error state for GPU hang with aztec ruins on GLK
Created attachment 137127 [details]
BSW GPU error state for GL Aztec Ruins
On BSW, there's GPU hang in GL Azec Ruins approximately on 4 runs out of 5. Error state attached. Vulkan version works still fine.
(In reply to Sagar Kamble from comment #2) > Created attachment 137126 [details] > error state for GPU hang with aztec ruins on GLK Hi Sagar, Based on your error state, it looks like your Mesa does not have commit 55a97db52347f62111a24715078c6035380d3e19, which ought to fix that hang. We shipped this fix in 18.0-rc1, but it looks like I forgot to get it included in the 17.3.x releases. I just nominated it [ https://lists.freedesktop.org/archives/mesa-stable/2018-March/007941.html ] so hopefully it'll hit 17.3.7. Your bug is different than Eero's. His is hanging after a BLORP operation. Yours was hanging on a compute shader. I can't reproduce Eero's BSW hang. (In reply to Eero Tamminen from comment #3) > Created attachment 137127 [details] > BSW GPU error state for GL Aztec Ruins > > On BSW, there's GPU hang in GL Azec Ruins approximately on 4 runs out of 5. > Error state attached. Vulkan version works still fine. Hey Eero, are you using a HD 400 (12 EU) or HD 405 (16 EU)? Unfortunately, Braswell is not identifiable by PCI ID alone :( *** Bug 105210 has been marked as a duplicate of this bug. *** Both Eero and Clayton's error states look identical, and the hanging BLORP operation has a totally bogus pixel shader. The one and only instruction is: illegal(1) { align1 1N }; So that's clearly not going to work out. Now the question is...how did that happen? It looks like somebody has scribbled over the program cache. It appears to contain: - Zeroes - <0x14, 0x14, 0x14, 0x14> (offset 0x280) - Zeroes - <14.5f, 14.5f, 14.5f, 14.5f> (offset 0x680) - <15.5f, 15.5f, 15.5f, 15.5f> (offset 0x690) - Zeroes - <1.5f, 5.5f, 9.5f, 13.5f>, repeated twice (offset 0x720) - Zeroes - <14.5f, 14.5f, 14.5f, 14.5f> (offset 0x840) - <15.5f, 15.5f, 15.5f, 15.5f> (offset 0x850) - Zeroes - some more floating point numbers and zeroes - wildly different looking data starting at 0xc80 - probably the real data (In reply to Kenneth Graunke from comment #5) > Hey Eero, are you using a HD 400 (12 EU) or HD 405 (16 EU)? Unfortunately, > Braswell is not identifiable by PCI ID alone :( It's same as what Clayton has: "Device: Mesa DRI Intel(R) HD Graphics 400 (Braswell) (0x22b1)" (In reply to Eero Tamminen from comment #9) > It's same as what Clayton has: > "Device: Mesa DRI Intel(R) HD Graphics 400 (Braswell) (0x22b1)" Ah, terrific, thanks! I believe compute shader scratch is scribbling over the program cache, but only for the HD 400 (6 EU) model. The HD 405 (8 EU) model that I've been testing with doesn't suffer from this bug, which is why I couldn't reproduce it. Jordan sent a patch to the mailing list which should fix this bug: https://patchwork.freedesktop.org/patch/208502/ *** Bug 105290 has been marked as a duplicate of this bug. *** Should be fixed by: commit 06e3bd02c01e499332a9c02b40f506df9695bced i965: Hard code CS scratch_ids_per_subslice for Cherryview Verified. I've seen once a GPU hang in the Aztec Ruins Vulkan version after this, but not in the GL/GLES versions. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.