Bug 101252

Summary: eglGetDisplay() is not thread safe
Product: Mesa Reporter: Emre Ucan <eucan>
Component: EGLAssignee: mesa-dev
Status: RESOLVED FIXED QA Contact: mesa-dev
Severity: normal    
Priority: medium CC: eucan
Version: git   
Hardware: Other   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments: Sample code for reproducing the bug

Description Emre Ucan 2017-05-31 12:41:34 UTC
Created attachment 131599 [details]
Sample code for reproducing the bug

Hello,

We found out this issue in our GPU driver tests. It happens when many threads are created and every thread creates wl_display object and calls eglGetDisplay(). Then, some threads are getting EGL_NO_DISPLAY.

I implemented a sample code for reproducing the bug (main.c). The issue occurs very often when I run the binary with 300 threads. I tried it in mesa 11.2 but I saw the related code for upstream is the same.

I debugged mesa code and I found out that a thread gets EGL_NO_DISPLAY only when _eglGetNativePlatform() returns _EGL_INVALID_PLATFORM.

native_platform variable is defined as static in _eglGetNativePlatform(). Therefore, different threads are sharing the same variable. When I remove the static keyword and recompile mesa binaries, the issue never occurs.

In my platform, EGL_PLATFORM and EGL_DISPLAY env variables are never defined. Therefore, _eglGetNativePlatformFromEnv() function always returns _EGL_INVALID_PLATFORM.

I think issue occurs when a thread is in the end of _eglGetNativePlatform() just before returning found native_platform, and the other thread overwrites this static variable in _eglNativePlatformFromEnv() function.

Most likely issue is introduced by this commit: 7adb9b094894a512c019b3378eb9e3c69d830edc

Best Regards,
Emre Ucan
Comment 1 Emil Velikov 2017-06-16 11:22:13 UTC
Thanks for the report Emre.

Should be fixed in master as of
commit 311c09165881111c4a596ca7e7b4bce89b059e0f
Author: Eric Engestrom <eric@engestrom.ch>
Date:   Thu Jun 15 23:53:55 2017 +0100

    egl/display: make platform detection thread-safe


Please give it a try and reopen if the issue persists.
Meanwhile we'll pick those for the stable release (might be 17.1.4, since the 17.1.3 queue is already out).

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.