Summary: | Einstein@Home openCL app returns incorect results | ||
---|---|---|---|
Product: | Beignet | Reporter: | Pavel Ondračka <pavel.ondracka> |
Component: | Beignet | Assignee: | Zhigang Gong <zhigang.gong> |
Status: | RESOLVED MOVED | QA Contact: | |
Severity: | normal | ||
Priority: | medium | CC: | elhoir, porcelain_mouse, ruiling.song, xiuli.pan, yejun.guo |
Version: | unspecified | Keywords: | bisected, regression |
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: |
testcase
stderr.txt |
Description
Pavel Ondračka
2016-10-08 12:00:35 UTC
Created attachment 127140 [details]
testcase
Created attachment 127148 [details]
stderr.txt
I tried it on my skylake machine with ubuntu 16.04, and observed the difference when FP_CONTRACT is on/off. I'm also curious why the math accuracy makes such a difference. Could you narrow down the application to a simple one? I'm not sure if it is a driver issue, or if the application algorithm itself is accuracy sensitive. As a workaround, you can try to make FP_CONTRACT off in the opencl kernel. > I'm also curious why the math accuracy makes such a difference. Could you
> narrow down the application to a simple one? I'm not sure if it is a driver
> issue, or if the application algorithm itself is accuracy sensitive.
By narrowing down the application you mean to narrowing it down to specific kernel? To tell the truth I'm mostly a user of the application and don't have much idea what is going on, in my understanding it should be some sort of FFT, but how much is it accuracy sensitive I have no idea. I'll try to get more info from some person who actually understands the code and I'll have a look at the source code again to see if I can narrow this down.
BTW is there some switch or debug variable to dump the compiled kernels in some human readable format so I can actually see what is the difference in produced instructions with and without FP_CONTRACT. Hi Pavel, You can find these in docs/Beignet/Backend.mdwn. In you case, I think OCL_OUTPUT_LLVM_BEFORE_LINK=1 is enough. Thanks Xiuli -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/beignet/beignet/issues/75. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.