Bug 86326: clEnqueueNDRangeKernel global_work_offset ignored - Luke-Jr <luke-jr+freedesktopbugs@utopios.org> - 11/16/2014 Back to Bug | Your Reviews | Help
Attachment 114337: libclc get_global_offset() and get_global_id() patch - Ronie Salgado <roniesalg@gmail.com> - 3/16/2015 (View )

Show Quick Help

<Overall Comment>
Previous Reviews
Tom Stellard <tstellar@gmail.com>
3/16/2015
-----------------------------------------------------------------
Hi,
Thanks for the patch. Would you be able to send your updated version to: libclc-dev@pcc.me.uk
ptx-nvidiacl/lib/workitem/get_global_offset.cl
6 case 1: return __builtin_ptx_read_global_offset_y();
7 case 2: return __builtin_ptx_read_global_offset_z();*/
8 default: return 0;
9 }
10}
Why is this commented out? Also the function name is wrong: get_local_offset().
r600/lib/workitem/get_global_offset.ll
14 %z = call i32 @llvm.AMDGPU.read.global.offset.z() nounwind readnone
15 ret i32 %z
16default:
17 ret i32 0
18}
This should be implemented in OpenCL C rather than LLVM IR. Now that DataLayouts are mandatory, it makes implementing common code in LLVM IR much more difficult.
You can use the builtins defined in your previous patch for this:
__builtin_amdgpu_read_global_offset_x ...
Powered by Splinter

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.