Summary: | Beignet: self-test failed: (3, 7, 5) + (5, 7, 3) returned (6, 7, 5) when running GEGL tests or Darktable | ||
---|---|---|---|
Product: | Beignet | Reporter: | Jan Vesely <jano.vesely> |
Component: | Beignet | Assignee: | Zhigang Gong <zhigang.gong> |
Status: | RESOLVED MOVED | QA Contact: | |
Severity: | normal | ||
Priority: | medium | CC: | alexis.engelke+fdbz, eipiminusone+freedesktop, zhangchi866 |
Version: | unspecified | ||
Hardware: | Other | ||
OS: | All | ||
Whiteboard: | |||
i915 platform: | i915 features: | ||
Attachments: | patch for 1.3.2 |
Description
Jan Vesely
2017-08-10 02:44:12 UTC
Same problem here. This happens on Debian once the kernel is updated to: $ uname -a Linux prime 4.14.0-2-amd64 #1 SMP Debian 4.14.7-1 (2017-12-22) x86_64 GNU/Linux Previous 4.13 kernel from Debian worked fine (maybe Debian has some patches?). $ lsb_release -a No LSB modules are available. Distributor ID: Debian Description: Debian GNU/Linux stable-updates (sid) Release: stable-updates Codename: sid BOINC stdoutgpudetect.txt: 27-Dec-2017 16:43:36 [---] cc_config.xml not found - using defaults Beignet: self-test failed: (3, 7, 5) + (5, 7, 3) returned (6, 7, 5) This can usually be fixed by upgrading Linux to >= 4.2, see /usr/share/doc/beignet-dev/Beignet.html or https://www.freedesktop.org/wiki/Software/Beignet/ Beignet: disabling non-working device Beignet: disabling non-working device Beignet: disabling non-working device $ dmesg | grep i915 [ 2.766105] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem [ 2.768868] [drm] Initialized i915 1.6.0 20170818 for 0000:00:02.0 on minor 0 [ 2.784737] i915 0000:00:02.0: firmware: direct-loading firmware i915/skl_dmc_ver1_26.bin [ 2.785034] [drm] Finished loading DMC firmware i915/skl_dmc_ver1_26.bin (v1.26) [ 3.097450] snd_hda_intel 0000:00:1f.3: bound 0000:00:02.0 (ops i915_audio_component_bind_ops [i915]) [ 3.502893] i915 0000:00:02.0: fb0: inteldrmfb frame buffer device $ dpkg --list | egrep -i '(beignet|ocl|opencl|llvm)' ii beignet-dev:amd64 1.3.2-1 amd64 OpenCL for Intel GPUs (development files and documentation) ii beignet-opencl-icd:amd64 1.3.2-1 amd64 OpenCL library for Intel GPUs ii beignet-opencl-icd:i386 1.3.2-1 i386 OpenCL library for Intel GPUs ii clang 1:4.0-40 amd64 C, C++ and Objective-C compiler (LLVM based) ii clang-4.0 1:4.0.1-8 amd64 C, C++ and Objective-C compiler (LLVM based) ii clinfo 2.1.16.01.12-1+b1 amd64 Query OpenCL system information ii geoclue-2.0 2.4.7-1 amd64 geoinformation service ii libgeoclue-2-0:amd64 2.4.7-1 amd64 convenience library to interact with geoinformation service ii libllvm4.0:amd64 1:4.0.1-8 amd64 Modular compiler and toolchain technologies, runtime library ii libllvm5.0:amd64 1:5.0.1-1 amd64 Modular compiler and toolchain technologies, runtime library ii libllvm5.0:i386 1:5.0.1-1 i386 Modular compiler and toolchain technologies, runtime library ii libnamespace-autoclean-perl 0.28-1 all module to remove imported symbols after compilation ii libpoclu-dev 0.14-5 amd64 development files for the pocl utility library ii libpoclu1:amd64 0.14-5 amd64 utility library for writing OpenCL programs ii llvm 1:4.0-40 amd64 Low-Level Virtual Machine (LLVM) ii llvm-4.0 1:4.0.1-8 amd64 Modular compiler and toolchain technologies ii llvm-4.0-dev 1:4.0.1-8 amd64 Modular compiler and toolchain technologies, libraries and headers ii llvm-4.0-runtime 1:4.0.1-8 amd64 Modular compiler and toolchain technologies, IR interpreter ii llvm-runtime 1:4.0-40 amd64 Low-Level Virtual Machine (LLVM), bytecode interpreter ii ocl-icd-dev:amd64 2.2.12-1 amd64 Development files to build a ICD Loader ii ocl-icd-libopencl1:amd64 2.2.12-1 amd64 Generic OpenCL ICD Loader ii ocl-icd-libopencl1:i386 2.2.12-1 i386 Generic OpenCL ICD Loader ii ocl-icd-opencl-dev:amd64 2.2.12-1 amd64 OpenCL development files ii opencl-c-headers 2.1-1 all OpenCL (Open Computing Language) C header files ii opencl-clhpp-headers 2.0.10-1 all C++ headers for OpenCL development ii opencl-headers 2.1-1 all OpenCL (Open Computing Language) header files lspci: 00:02.0 VGA compatible controller: Intel Corporation HD Graphics 530 (rev 06) 00:02.0 0300: 8086:1912 (rev 06) (prog-if 00 [VGA controller]) Subsystem: 1849:1912 Flags: bus master, fast devsel, latency 0, IRQ 129 Memory at dd000000 (64-bit, non-prefetchable) [size=16M] Memory at b0000000 (64-bit, prefetchable) [size=256M] I/O ports at f000 [size=64] [virtual] Expansion ROM at 000c0000 [disabled] [size=128K] Capabilities: <access denied> Kernel driver in use: i915 Kernel modules: i915 CPU: Intel(R) Core(TM) i3-6320 CPU @ 3.90GHz After switching to an older kernel that was working, the problem persists. So I guess an update to Beignet caused the problem. Unfortunately, I cannot recall the last working version. Also with gegl 0.3.20 only 1 (clones.xml) fails on my machine. $ uname -a Linux prime 4.9.0-3-amd64 #1 SMP Debian 4.9.30-2+deb9u5 (2017-09-19) x86_64 GNU/Linux $ clinfo Number of platforms 1 Platform Name Intel Gen OCL Driver Platform Vendor Intel Platform Version OpenCL 2.0 beignet 1.3 Platform Profile FULL_PROFILE Platform Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short cl_khr_gl_sharing Platform Extensions function suffix Intel Platform Name Intel Gen OCL Driver Number of devices 1 Device Name Intel(R) HD Graphics Skylake Desktop GT2 Device Vendor Intel Device Vendor ID 0x8086 Device Version OpenCL 2.0 beignet 1.3 Driver Version 1.3 Device OpenCL C Version OpenCL C 2.0 beignet 1.3 Device Type GPU Device Profile FULL_PROFILE Max compute units 23 Max clock frequency 1000MHz Device Partition (core) Max number of sub-devices 1 Supported partition types None, None, None Max work item dimensions 3 Max work item sizes 512x512x512 Max work group size 512 Preferred work group size multiple 16 Preferred / native vector sizes char 16 / 8 short 8 / 8 int 4 / 4 long 2 / 2 half 0 / 8 (cl_khr_fp16) float 4 / 4 double 0 / 2 (n/a) Half-precision Floating-point support (cl_khr_fp16) Denormals No Infinity and NANs Yes Round to nearest Yes Round to zero No Round to infinity No IEEE754-2008 fused multiply-add No Support is emulated in software No Correctly-rounded divide and sqrt operations No Single-precision Floating-point support (core) Denormals No Infinity and NANs Yes Round to nearest Yes Round to zero No Round to infinity No IEEE754-2008 fused multiply-add No Support is emulated in software No Correctly-rounded divide and sqrt operations No Double-precision Floating-point support (n/a) Address bits 32, Little-Endian Global memory size 4294967296 (4GiB) Error Correction support No Max memory allocation 3221225472 (3GiB) Unified memory for Host and Device Yes Shared Virtual Memory (SVM) capabilities (core) Coarse-grained buffer sharing Yes Fine-grained buffer sharing No Fine-grained system sharing No Atomics No Minimum alignment for any data type 128 bytes Alignment of base address 1024 bits (128 bytes) Preferred alignment for atomics SVM 0 bytes Global 0 bytes Local 0 bytes Max size for global variable 65536 (64KiB) Preferred total size of global vars 65536 (64KiB) Global Memory cache type Read/Write Global Memory cache size 8192 Global Memory cache line 64 bytes Image support Yes Max number of samplers per kernel 16 Max size for 1D images from buffer 65536 pixels Max 1D or 2D image array size 2048 images Base address alignment for 2D image buffers 4096 bytes Pitch alignment for 2D image buffers 1 bytes Max 2D image size 8192x8192 pixels Max 3D image size 8192x8192x2048 pixels Max number of read image args 128 Max number of write image args 8 Max number of read/write image args 8 Max number of pipe args 16 Max active pipe reservations 1 Max pipe packet size 1024 Local memory type Local Local memory size 65536 (64KiB) Max constant buffer size 134217728 (128MiB) Max number of constant args 8 Max size of kernel argument 1024 Queue properties (on host) Out-of-order execution No Profiling Yes Queue properties (on device) Out-of-order execution Yes Profiling Yes Preferred size 16384 (16KiB) Max size 262144 (256KiB) Max queues on device 1 Max events on device 1024 Prefer user sync for interop Yes Profiling timer resolution 80ns Execution capabilities Run OpenCL kernels Yes Run native kernels Yes SPIR versions 1.2 printf() buffer size 1048576 (1024KiB) Built-in kernels __cl_copy_region_align4;__cl_copy_region_align16;__cl_cpy_region_unalign_same_offset;__cl_copy_region_unalign_dst_offset;__cl_copy_region_unalign_src_offset;__cl_copy_buffer_rect;__cl_copy_image_1d_to_1d;__cl_copy_image_2d_to_2d;__cl_copy_image_3d_to_2d;__cl_copy_image_2d_to_3d;__cl_copy_image_3d_to_3d;__cl_copy_image_2d_to_buffer;__cl_copy_image_3d_to_buffer;__cl_copy_buffer_to_image_2d;__cl_copy_buffer_to_image_3d;__cl_fill_region_unalign;__cl_fill_region_align2;__cl_fill_region_align4;__cl_fill_region_align8_2;__cl_fill_region_align8_4;__cl_fill_region_align8_8;__cl_fill_region_align8_16;__cl_fill_region_align128;__cl_fill_image_1d;__cl_fill_image_1d_array;__cl_fill_image_2d;__cl_fill_image_2d_array;__cl_fill_image_3d; Device Available Yes Compiler Available Yes Linker Available Yes Device Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short cl_khr_gl_sharing cl_khr_fp16 NULL platform behavior clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) Intel Gen OCL Driver clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [Intel] clCreateContext(NULL, ...) [default] Success [Intel] clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) Success (1) Platform Name Intel Gen OCL Driver Device Name Intel(R) HD Graphics Skylake Desktop GT2 clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) Success (1) Platform Name Intel Gen OCL Driver Device Name Intel(R) HD Graphics Skylake Desktop GT2 ICD loader properties ICD loader Name OpenCL ICD Loader ICD loader Vendor OCL Icd free software ICD loader Version 2.2.12 ICD loader Profile OpenCL 2.2 NOTE: your OpenCL library declares to support OpenCL 2.2, but it seems to support up to OpenCL 2.1 only. (In reply to Zhang Chi from comment #1) > Same problem here. > > This happens on Debian once the kernel is updated to: > > $ uname -a > Linux prime 4.14.0-2-amd64 #1 SMP Debian 4.14.7-1 (2017-12-22) x86_64 > GNU/Linux > > Previous 4.13 kernel from Debian worked fine (maybe Debian has some > patches?). > > $ lsb_release -a > No LSB modules are available. > Distributor ID: Debian > Description: Debian GNU/Linux stable-updates (sid) > Release: stable-updates > Codename: sid > > BOINC stdoutgpudetect.txt: > > 27-Dec-2017 16:43:36 [---] cc_config.xml not found - using defaults > Beignet: self-test failed: (3, 7, 5) + (5, 7, 3) returned (6, 7, 5) > This can usually be fixed by upgrading Linux to >= 4.2, > see /usr/share/doc/beignet-dev/Beignet.html or > https://www.freedesktop.org/wiki/Software/Beignet/ > Beignet: disabling non-working device > Beignet: disabling non-working device > Beignet: disabling non-working device > > $ dmesg | grep i915 > [ 2.766105] i915 0000:00:02.0: vgaarb: changed VGA decodes: > olddecodes=io+mem,decodes=none:owns=io+mem > [ 2.768868] [drm] Initialized i915 1.6.0 20170818 for 0000:00:02.0 on > minor 0 > [ 2.784737] i915 0000:00:02.0: firmware: direct-loading firmware > i915/skl_dmc_ver1_26.bin > [ 2.785034] [drm] Finished loading DMC firmware i915/skl_dmc_ver1_26.bin > (v1.26) > [ 3.097450] snd_hda_intel 0000:00:1f.3: bound 0000:00:02.0 (ops > i915_audio_component_bind_ops [i915]) > [ 3.502893] i915 0000:00:02.0: fb0: inteldrmfb frame buffer device > > $ dpkg --list | egrep -i '(beignet|ocl|opencl|llvm)' > ii beignet-dev:amd64 1.3.2-1 > amd64 OpenCL for Intel GPUs (development files and documentation) > ii beignet-opencl-icd:amd64 1.3.2-1 > amd64 OpenCL library for Intel GPUs > ii beignet-opencl-icd:i386 1.3.2-1 > i386 OpenCL library for Intel GPUs > ii clang 1:4.0-40 > amd64 C, C++ and Objective-C compiler (LLVM based) > ii clang-4.0 1:4.0.1-8 > amd64 C, C++ and Objective-C compiler (LLVM based) > ii clinfo > 2.1.16.01.12-1+b1 amd64 Query OpenCL system > information > ii geoclue-2.0 2.4.7-1 > amd64 geoinformation service > ii libgeoclue-2-0:amd64 2.4.7-1 > amd64 convenience library to interact with geoinformation service > ii libllvm4.0:amd64 1:4.0.1-8 > amd64 Modular compiler and toolchain technologies, runtime library > ii libllvm5.0:amd64 1:5.0.1-1 > amd64 Modular compiler and toolchain technologies, runtime library > ii libllvm5.0:i386 1:5.0.1-1 > i386 Modular compiler and toolchain technologies, runtime library > ii libnamespace-autoclean-perl 0.28-1 > all module to remove imported symbols after compilation > ii libpoclu-dev 0.14-5 > amd64 development files for the pocl utility library > ii libpoclu1:amd64 0.14-5 > amd64 utility library for writing OpenCL programs > ii llvm 1:4.0-40 > amd64 Low-Level Virtual Machine (LLVM) > ii llvm-4.0 1:4.0.1-8 > amd64 Modular compiler and toolchain technologies > ii llvm-4.0-dev 1:4.0.1-8 > amd64 Modular compiler and toolchain technologies, libraries and > headers > ii llvm-4.0-runtime 1:4.0.1-8 > amd64 Modular compiler and toolchain technologies, IR interpreter > ii llvm-runtime 1:4.0-40 > amd64 Low-Level Virtual Machine (LLVM), bytecode interpreter > ii ocl-icd-dev:amd64 2.2.12-1 > amd64 Development files to build a ICD Loader > ii ocl-icd-libopencl1:amd64 2.2.12-1 > amd64 Generic OpenCL ICD Loader > ii ocl-icd-libopencl1:i386 2.2.12-1 > i386 Generic OpenCL ICD Loader > ii ocl-icd-opencl-dev:amd64 2.2.12-1 > amd64 OpenCL development files > ii opencl-c-headers 2.1-1 > all OpenCL (Open Computing Language) C header files > ii opencl-clhpp-headers 2.0.10-1 > all C++ headers for OpenCL development > ii opencl-headers 2.1-1 > all OpenCL (Open Computing Language) header files > > lspci: > > 00:02.0 VGA compatible controller: Intel Corporation HD Graphics 530 (rev 06) > > 00:02.0 0300: 8086:1912 (rev 06) (prog-if 00 [VGA controller]) > Subsystem: 1849:1912 > Flags: bus master, fast devsel, latency 0, IRQ 129 > Memory at dd000000 (64-bit, non-prefetchable) [size=16M] > Memory at b0000000 (64-bit, prefetchable) [size=256M] > I/O ports at f000 [size=64] > [virtual] Expansion ROM at 000c0000 [disabled] [size=128K] > Capabilities: <access denied> > Kernel driver in use: i915 > Kernel modules: i915 > > CPU: Intel(R) Core(TM) i3-6320 CPU @ 3.90GHz Created attachment 136558 [details] [review] patch for 1.3.2 This error also occurs on starting Darktable, and is *not* hardware or Linux version specific: I see it on Ivy Bridge [8086:0166]/Linux 4.9, and it's also been reported from Haswell [8086:0412]/Linux 4.13: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=885423 In 1.3.x, reverting 81755054c4c19d821e58456a1a7d601806e60e92 fixed it; I haven't tried master. Gettings the same issue running darktable on an i7-8550U (In reply to Rebecca Palmer from comment #4) > This error also occurs on starting Darktable, and is *not* hardware or Linux > version specific: I see it on Ivy Bridge [8086:0166]/Linux 4.9, and it's > also been reported from Haswell [8086:0412]/Linux 4.13: > https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=885423 > > In 1.3.x, reverting 81755054c4c19d821e58456a1a7d601806e60e92 fixed it; I > haven't tried master. I can confirm that the following change fixes the issue on master: diff --git a/backend/src/backend/gen_insn_selection_optimize.cpp b/backend/src/backend/gen_insn_selection_optimize.cpp index e52d562e..c196ff05 100644 --- a/backend/src/backend/gen_insn_selection_optimize.cpp +++ b/backend/src/backend/gen_insn_selection_optimize.cpp @@ -292,7 +292,7 @@ namespace gbe if (insn.opcode == SEL_OP_MOV) addToReplaceInfoMap(insn); - doZeroAddedOptimization(insn); +// doZeroAddedOptimization(insn); } cleanReplaceInfoMap(); } -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/beignet/beignet/issues/7. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.