Summary: | Same kernel but huge performance difference under linux and windows | ||
---|---|---|---|
Product: | Beignet | Reporter: | wangbiaouestc |
Component: | Beignet | Assignee: | Zhigang Gong <zhigang.gong> |
Status: | RESOLVED MOVED | QA Contact: | |
Severity: | normal | ||
Priority: | medium | ||
Version: | unspecified | ||
Hardware: | x86-64 (AMD64) | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: |
Description
wangbiaouestc
2015-02-24 22:50:14 UTC
Functionally the kernel works, but I am curious why there is so much difference under windows and linux Could you share your kernel here? Or you can share the LLVM IR? To get the LLVM IR and GEN IR,you can set the following environment variable before run your application: # export OCL_OUTPUT_LLVM_AFTER_GEN=1 # export OCL_OUTPUT_GEN_IR=1 Then run your application # ./test_app > ir.log Then you can paster the ir.log here. Could you try to add the following code right before the for() loops in the kernel? #pragma unroll 1024 -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/beignet/beignet/issues/72. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.