When we run one single dEQP test case(dEQP-EGL.functional.sharing.gles2.multithread.random.textures.gen_delete.15), we encountered GPU hang on Android with APL/KBL. It could be reproduced with 100%. Note: It could be reproduced with 100%, when we run just the one test at a time.
Created attachment 145164 [details] GPU error state log
Created attachment 145165 [details] GPU error decode with aubinator_error_decode
Created attachment 145166 [details] Kernel Batch buffer log
Created attachment 145167 [details] [review] One simple change to help print batch from kernel
From kernel log, we could find NULL batch info: deqp:testercore-25325 [003] .... 3541.743874: i915_gem_do_execbuffer: Android - kernel - bb obj 00000000b23761e1 start addr fffee2014000: deqp:testercore-25325 [003] .... 3541.743876: i915_gem_do_execbuffer: Android - kernel - 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
One thing that bothers is that this bug cannot be reproduced on desktop Linux. However, I've investigated these multithread tests on desktop with helgrind tool and found couple of possible data races, one in Mesa and one in dEQP framework. I've sent patches for both of these, unfortunately these don't seem to help the bug though.
since you can reproduce it, you should be able to bisect it. If you can't bisect it, then perhaps there is a bug elsewhere in your stack? Test passes reliably on debian for all platforms: https://mesa-ci.01.org/mesa_master_daily/builds/5203/group/b22ced51e2e7542022a306128586f887
Thanks Tapani & Mark. I just pull latest mesa and build on Android. But issue is still here, there should be some specific issues on Android. :(
(In reply to Ren Chenglei from comment #8) > Thanks Tapani & Mark. I just pull latest mesa and build on Android. But > issue is still here, there should be some specific issues on Android. :( Did we always have this issue with these tests, also with Mesa 18.x?
(In reply to Tapani Pälli from comment #9) > (In reply to Ren Chenglei from comment #8) > > Thanks Tapani & Mark. I just pull latest mesa and build on Android. But > > issue is still here, there should be some specific issues on Android. :( > > Did we always have this issue with these tests, also with Mesa 18.x? Yes, this issue also can be reproduced on 18.2.
-- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/mesa/mesa/issues/1830.
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.