6 | case 1: return __builtin_ptx_read_global_offset_y(); |
7 | case 2: return __builtin_ptx_read_global_offset_z();*/ |
8 | default: return 0; |
9 | } |
10 | } |
Why is this commented out? Also the function name is wrong: get_local_offset(). |
14 | %z = call i32 @llvm.AMDGPU.read.global.offset.z() nounwind readnone |
15 | ret i32 %z |
16 | default: |
17 | ret i32 0 |
18 | } |
This should be implemented in OpenCL C rather than LLVM IR. Now that DataLayouts are mandatory, it makes implementing common code in LLVM IR much more difficult. You can use the builtins defined in your previous patch for this: __builtin_amdgpu_read_global_offset_x ... |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.