[Beignet] issue with cl

Pan, Xiuli xiuli.pan at intel.com
Mon Apr 10 02:58:39 UTC 2017


Hi Michal,

Could you report a bug on https://bugs.freedesktop.org/ and provide the full log.

Thanks
Xiuli

-----Original Message-----
From: Beignet [mailto:beignet-bounces at lists.freedesktop.org] On Behalf Of Michal B
Sent: Sunday, April 9, 2017 11:41 PM
To: beignet at lists.freedesktop.org
Subject: [Beignet] issue with cl

Hi everybody,

please could you help me ... i'm getting errors even after patch ...
my hardware: intel i5-4250U , intel graphics HD 5000, i'm using debian , installed packages:
libdrm  2.4.74-1
llvm 3.5, 3.8
kernel 4.9.13-1~bpo8+1 (2017-02-27) x86_64

i used patch :
https://www.mail-archive.com/beignet@lists.freedesktop.org/msg07315.html

but i'm still getting error:

drm_intel_gem_bo_context_exec() failed: Device or resource busy
Beignet: "Exec event 0x1d8d5f0 error, type is 4592, error staus is -5"

please could you give some hints how to solve it ?

i ran included benchmark tools :
... skipping successes ...
  Vector size 16:
    Offset 0 :    43.7GB/S
    Offset 1 :    41.1GB/S
    Offset 2 :    41.1GB/S
    Offset 3 :    41.1GB/S
    Offset 4 :    43.8GB/S
    Offset 5 :    41.1GB/S
    Offset 6 :    41.1GB/S
    Offset 7 :    41.2GB/S
    Offset 8 :    43.9GB/S
    Offset 9 :    41.0GB/S
    Offset 10 :    41.0GB/S
    Offset 11 :    41.1GB/S
    Offset 12 :    43.7GB/S
    Offset 13 :    41.0GB/S
    Offset 14 :    41.1GB/S
    Offset 15 :    41.1GB/S
    [Result: 71.372 GB/S]    [SUCCESS]
benchmark_copy_buf()    [Result: 4.157 GB/S]    [SUCCESS]
benchmark_use_host_ptr_buffer()    [Result: 9.369 GB/S]    [SUCCESS]
benchmark_read_buffer()    [Result: 10.984 GB/S]    [SUCCESS]
benchmark_read_image()    [Result: 9.871 GB/S]    [SUCCESS]
benchmark_copy_buffer_to_image()    [Result: 4.802 GB/S]    [SUCCESS]
benchmark_copy_image_to_buffer()    [Result: 4.805 GB/S]    [SUCCESS]
benchmark_copy_buffer_uchar()    [Result: 610.909 FPS]    [SUCCESS]
benchmark_copy_buffer_ushort()    [Result: 287.810 FPS]    [SUCCESS]
benchmark_copy_buffer_uint()    [Result: 144.271 FPS]    [SUCCESS]
benchmark_filter_buffer_uchar()    [Result: 302.743 FPS]    [SUCCESS]
benchmark_filter_buffer_ushort()    [Result: 269.329 FPS]    [SUCCESS]
benchmark_filter_buffer_uint()    [Result: 144.244 FPS]    [SUCCESS]
benchmark_copy_image_uchar()    [Result: 417.807 FPS]    [SUCCESS]
benchmark_copy_image_ushort()    [Result: 290.080 FPS]    [SUCCESS]
benchmark_copy_image_uint()    [Result: 144.758 FPS]    [SUCCESS]
benchmark_filter_image_uchar()    [Result: 301.701 FPS]    [SUCCESS]
benchmark_filter_image_ushort()    [Result: 179.831 FPS]    [SUCCESS]
benchmark_filter_image_uint()    [Result: 89.430 FPS]    [SUCCESS]
benchmark_workgroup_broadcast_1D_int()ASSERTION FAILED: 0
  at file .../Beignet-1.3.1-Source/backend/src/backend/gen_context.cpp,
function virtual void gbe::GenContext::emitUnpackLongInstruction(const
gbe::SelectionInstruction&), line 2313
Trace/breakpoint trap

result of clinfo:
Number of platforms:                 2
  Platform Profile:                 FULL_PROFILE
  Platform Version:                 OpenCL 1.2 beignet 1.3
  Platform Name:                 Intel Gen OCL Driver
  Platform Vendor:                 Intel
  Platform Extensions:
cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics
cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics
cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short
  Platform Profile:                 FULL_PROFILE
  Platform Version:                 OpenCL 1.2 beignet 1.3
  Platform Name:                 Intel Gen OCL Driver
  Platform Vendor:                 Intel
  Platform Extensions:
cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics
cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics
cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short cl_khr_gl_sharing


  Platform Name:                 Intel Gen OCL Driver
Number of devices:                 1
  Device Type:                     CL_DEVICE_TYPE_GPU
  Device ID:                     32902
  Max compute units:                 40
  Max work items dimensions:             3
    Max work items[0]:                 512
    Max work items[1]:                 512
    Max work items[2]:                 512
  Max work group size:                 512
  Preferred vector width char:             16
  Preferred vector width short:             8
  Preferred vector width int:             4
  Preferred vector width long:             2
  Preferred vector width float:             4
  Preferred vector width double:         0
  Native vector width char:             8
  Native vector width short:             8
  Native vector width int:             4
  Native vector width long:             2
  Native vector width float:             4
  Native vector width double:             2
  Max clock frequency:                 1000Mhz
  Address bits:                     32
  Max memory allocation:             1513095168
  Image support:                 Yes
  Max number of images read arguments:         128
  Max number of images write arguments:         8
  Max image 2D width:                 8192
  Max image 2D height:                 8192
  Max image 3D width:                 8192
  Max image 3D height:                 8192
  Max image 3D depth:                 2048
  Max samplers within kernel:             16
  Max size of kernel argument:             1024
  Alignment (bits) of base address:         1024
  Minimum alignment (bytes) for any datatype:     128
  Single precision floating point capability
    Denorms:                     No
    Quiet NaNs:                     Yes
    Round to nearest even:             Yes
    Round to zero:                 No
    Round to +ve and infinity:             No
    IEEE754-2008 fused multiply-add:         No
  Cache type:                     Read/Write
  Cache line size:                 64
  Cache size:                     8192
  Global memory size:                 2018508800
  Constant buffer size:                 134217728
  Max number of constant args:             8
  Local memory type:                 Global
  Local memory size:                 65536
  Error correction support:             0
  Unified memory for Host and Device:         1
  Profiling timer resolution:             80
  Device endianness:                 Little
  Available:                     Yes
  Compiler available:                 Yes
  Execution capabilities:
    Execute OpenCL kernels:             Yes
    Execute native function:             Yes
  Queue properties:
    Out-of-Order:                 No
    Profiling :                     Yes
  Platform ID:                     0x7f1728671840
  Name:                         Intel(R) HD Graphics Haswell Ultrabook
GT3 Mobile
  Vendor:                     Intel
  Device OpenCL C version:             OpenCL C 1.2 beignet 1.3
  Driver version:                 1.3
  Profile:                     FULL_PROFILE
  Version:                     OpenCL 1.2 beignet 1.3
  Extensions:                     cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short


  Platform Name:                 Intel Gen OCL Driver
Number of devices:                 1
  Device Type:                     CL_DEVICE_TYPE_GPU
  Device ID:                     32902
  Max compute units:                 40
  Max work items dimensions:             3
    Max work items[0]:                 512
    Max work items[1]:                 512
    Max work items[2]:                 512
  Max work group size:                 512
  Preferred vector width char:             16
  Preferred vector width short:             8
  Preferred vector width int:             4
  Preferred vector width long:             2
  Preferred vector width float:             4
  Preferred vector width double:         0
  Native vector width char:             8
  Native vector width short:             8
  Native vector width int:             4
  Native vector width long:             2
  Native vector width float:             4
  Native vector width double:             2
  Max clock frequency:                 1000Mhz
  Address bits:                     32
  Max memory allocation:             1513095168
  Image support:                 Yes
  Max number of images read arguments:         128
  Max number of images write arguments:         8
  Max image 2D width:                 8192
  Max image 2D height:                 8192
  Max image 3D width:                 8192
  Max image 3D height:                 8192
  Max image 3D depth:                 2048
  Max samplers within kernel:             16
  Max size of kernel argument:             1024
  Alignment (bits) of base address:         1024
  Minimum alignment (bytes) for any datatype:     128
  Single precision floating point capability
    Denorms:                     No
    Quiet NaNs:                     Yes
    Round to nearest even:             Yes
    Round to zero:                 No
    Round to +ve and infinity:             No
    IEEE754-2008 fused multiply-add:         No
  Cache type:                     Read/Write
  Cache line size:                 64
  Cache size:                     8192
  Global memory size:                 2018508800
  Constant buffer size:                 134217728
  Max number of constant args:             8
  Local memory type:                 Local
  Local memory size:                 65536
  Error correction support:             0
  Unified memory for Host and Device:         1
  Profiling timer resolution:             80
  Device endianness:                 Little
  Available:                     Yes
  Compiler available:                 Yes
  Execution capabilities:
    Execute OpenCL kernels:             Yes
    Execute native function:             Yes
  Queue properties:
    Out-of-Order:                 No
    Profiling :                     Yes
  Platform ID:                     0x7f17245c4600
  Name:                         Intel(R) HD Graphics Haswell Ultrabook
GT3 Mobile
  Vendor:                     Intel
  Device OpenCL C version:             OpenCL C 1.2 beignet 1.3
  Driver version:                 1.3
  Profile:                     FULL_PROFILE
  Version:                     OpenCL 1.2 beignet 1.3
  Extensions:                     cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short cl_khr_gl_sharing


Thank you,

Kind regards,
Michal
_______________________________________________
Beignet mailing list
Beignet at lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/beignet


More information about the Beignet mailing list