[Rusticl] darktable with radeonsi on Ryzen 5600G

Bernhard Held berny156 at gmx.de
Mon Dec 25 10:29:52 UTC 2023


Hello,

I'm trying to run darktable using Rusticl on a Ryzen 5600G (Vega GCN 5.1 / fx90c:xnack-).

Is this supposed to work already? I get black images only even with simplest workflow (loading JPG). No errors are reported.

Mesa: 23.2.1
darktable 4.4.2

Any hint would be welcome!

Thanks
Berny

###############################################################################################

darktable -d opencl -d perf
      0,0189 [dt_get_sysresource_level] switched to 0 as `small'
      0,0189   total mem:       27947MB
      0,0189   mipmap cache:    1746MB
      0,0189   available mem:   3493MB
      0,0189   singlebuff:      109MB
      0,0189   OpenCL tune mem: OFF
      0,0189   OpenCL pinned:   OFF
[opencl_init] opencl related configuration options:
[opencl_init] opencl: ON
[opencl_init] opencl_scheduling_profile: 'default'
[opencl_init] opencl_library: 'default path'
[opencl_init] opencl_device_priority: '*/!0,*/*/*/!0,*'
[opencl_init] opencl_mandatory_timeout: 200
[opencl_init] opencl library 'libOpenCL' found on your system and loaded
[opencl_init] found 1 platform
[opencl_init] found 1 device

[dt_opencl_device_init]
    DEVICE:                   0: 'AMD Radeon Graphics (renoir, LLVM 17.0.6, DRM 3.54, 6.6.6-1-default)'
    PLATFORM NAME & VENDOR:   rusticl, Mesa/X.org
    CANONICAL NAME:           rusticlamdradeongraphics
    DRIVER VERSION:           23.2.1
    DEVICE VERSION:           OpenCL 3.0
    DEVICE_TYPE:              GPU
    GLOBAL MEM SIZE:          13974 MB
    MAX MEM ALLOC:            2048 MB
    MAX IMAGE SIZE:           16384 x 16384
    MAX WORK GROUP SIZE:      1024
    MAX WORK ITEM DIMENSIONS: 3
    MAX WORK ITEM SIZES:      [ 1024 1024 1024 ]
    ASYNC PIXELPIPE:          NO
    PINNED MEMORY TRANSFER:   NO
    MEMORY TUNING:            NO
    FORCED HEADROOM:          400
    AVOID ATOMICS:            NO
    MICRO NAP:                250
    ROUNDUP WIDTH:            16
    ROUNDUP HEIGHT:           16
    CHECK EVENT HANDLES:      128
    PERFORMANCE:              1.725
    TILING ADVANTAGE:         0.000
    DEFAULT DEVICE:           NO
    KERNEL BUILD DIRECTORY:   /usr/share/darktable/kernels
    KERNEL DIRECTORY:         /home/berny/.cache/darktable/cached_v1_kernels_for_rusticlAMDRadeonGraphics_2321
    CL COMPILER OPTION:       -cl-fast-relaxed-math
    KERNEL LOADING TIME:       0.1892 sec
[opencl_init] OpenCL successfully initialized. Internal numbers and names of available devices:
[opencl_init]           0       'rusticl AMD Radeon Graphics'
[opencl_init] FINALLY: opencl is AVAILABLE and ENABLED.
[dt_opencl_update_priorities] these are your device priorities:
[dt_opencl_update_priorities]           image   preview export  thumbs  preview2
[dt_opencl_update_priorities]           0       -1      0       0       -1
[dt_opencl_update_priorities] show if opencl use is mandatory for a given pixelpipe:
[dt_opencl_update_priorities]           image   preview export  thumbs  preview2
[dt_opencl_update_priorities]           0       0       0       0       0
[opencl_synchronization_timeout] synchronization timeout set to 200
[dt_opencl_update_priorities] these are your device priorities:
[dt_opencl_update_priorities]           image   preview export  thumbs  preview2
[dt_opencl_update_priorities]           0       -1      0       0       -1
[dt_opencl_update_priorities] show if opencl use is mandatory for a given pixelpipe:
[dt_opencl_update_priorities]           image   preview export  thumbs  preview2
[dt_opencl_update_priorities]           0       0       0       0       0
[opencl_synchronization_timeout] synchronization timeout set to 200
      1,1716 [dt_dev_load_raw] loading the image. took 0,040 secs (0,178 CPU)
      1,2811 [export] creating pixelpipe took 0,106 secs (1,007 CPU)
      1,2811 [dt_opencl_check_tuning] use 5302MB (tunemem=OFF, pinning=OFF) on device `rusticl AMD Radeon Graphics' id=0
      1,3144 [resample_plain] plan 0,000 secs (0,000 CPU) resample 0,033 secs (0,342 CPU)
      1,3144 [dev_pixelpipe] took 0,033 secs (0,342 CPU) initing base buffer [thumbnail]
      1,3260 [dev_pixelpipe] took 0,012 secs (0,063 CPU) [thumbnail] processed `colorin' on GPU, blended on GPU
      1,3276 [dev_pixelpipe] took 0,002 secs (0,001 CPU) [thumbnail] processed `colorout' on GPU, blended on GPU
      1,3316 [dev_pixelpipe] took 0,004 secs (0,001 CPU) [thumbnail] processed `gamma' on CPU, blended on CPU
      1,3316 [opencl_profiling] profiling device 0 ('rusticl AMD Radeon Graphics'):
      1,3316 [opencl_profiling] spent  0,0000 seconds in [Write Image (from host to device)]
      1,3316 [opencl_profiling] spent  0,0000 seconds in colorin_unbound
      1,3316 [opencl_profiling] spent  0,0000 seconds in colorout
      1,3317 [opencl_profiling] spent  0,0000 seconds in [Read Image (from device to host)]
      1,3317 [opencl_profiling] spent  0,0000 seconds totally in command queue (with 0 events missing)
      1,3317 [dev_process_thumbnail] pixel pipeline processing took 0,051 secs (0,408 CPU)
      7,1241 [dt_dev_load_raw] loading the image. took 0,000 secs (0,000 CPU)
      7,1737 [export] creating pixelpipe took 0,047 secs (0,100 CPU)
      7,1737 [dev_pixelpipe] took 0,000 secs (0,000 CPU) initing base buffer [export]
      7,2092 [dev_pixelpipe] took 0,035 secs (0,037 CPU) [export] processed `colorin' on GPU, blended on GPU
      7,2144 [dev_pixelpipe] took 0,005 secs (0,003 CPU) [export] processed `colorout' on GPU, blended on GPU
      7,2149 [resample_cl] took 0,000 secs (0,000 CPU) 1:1 copy/crop of 2612x2736 pixels
      7,2221 [dev_pixelpipe] took 0,008 secs (0,001 CPU) [export] processed `finalscale' on GPU, blended on GPU
      7,2663 [opencl_profiling] profiling device 0 ('rusticl AMD Radeon Graphics'):
      7,2663 [opencl_profiling] spent  0,0000 seconds in [Write Image (from host to device)]
      7,2663 [opencl_profiling] spent  0,0000 seconds in colorin_unbound
      7,2663 [opencl_profiling] spent  0,0000 seconds in colorout
      7,2663 [opencl_profiling] spent  0,0000 seconds in [Copy Image (on device)]
      7,2663 [opencl_profiling] spent  0,0000 seconds in [Read Image (from device to host)]
      7,2663 [opencl_profiling] spent  0,0000 seconds totally in command queue (with 0 events missing)
      7,2664 [dev_process_export] pixel pipeline processing took 0,093 secs (0,078 CPU)
      7,3641 [export_job] exported to `darktable.jpg'
  [opencl_summary_statistics] device 'rusticl AMD Radeon Graphics' (0): 9 out of 9 events were successful and 0 events lost. max event=4

###############################################################################################

$ clinfo
Number of platforms                               1
   Platform Name                                   rusticl
   Platform Vendor                                 Mesa/X.org
   Platform Version                                OpenCL 3.0
   Platform Profile                                FULL_PROFILE
   Platform Extensions                             cl_khr_byte_addressable_store cl_khr_create_command_queue cl_khr_extended_versioning cl_khr_icd cl_khr_il_program cl_khr_spirv_no_integer_wrap_decoration
   Platform Extensions with Version                cl_khr_byte_addressable_store                                    0x400000 (1.0.0)
                                                   cl_khr_create_command_queue                                      0x400000 (1.0.0)
                                                   cl_khr_extended_versioning                                       0x400000 (1.0.0)
                                                   cl_khr_icd                                                       0x400000 (1.0.0)
                                                   cl_khr_il_program                                                0x400000 (1.0.0)
                                                   cl_khr_spirv_no_integer_wrap_decoration                          0x400000 (1.0.0)
   Platform Numeric Version                        0xc00000 (3.0.0)
   Platform Extensions function suffix             MESA
   Platform Host timer resolution                  1ns

   Platform Name                                   rusticl
Number of devices                                 1
   Device Name                                     AMD Radeon Graphics (renoir, LLVM 17.0.6, DRM 3.54, 6.6.6-1-default)
   Device Vendor                                   AMD
   Device Vendor ID                                0x1002
   Device Version                                  OpenCL 3.0
   Device UUID                                     00000000-3000-0000-0000-000000000000
   Driver UUID                                     414d442d-4d45-5341-2d44-525600000000
   Valid Device LUID                               No
   Device LUID                                     0000-000000000000
   Device Node Mask                                0
   Device Numeric Version                          0xc00000 (3.0.0)
   Driver Version                                  23.2.1
   Device OpenCL C Version                         OpenCL C 1.2
   Device OpenCL C Numeric Version                 0x402000 (1.2.0)
   Device OpenCL C all versions                    OpenCL C                                                         0xc00000 (3.0.0)
                                                   OpenCL C                                                         0x402000 (1.2.0)
                                                   OpenCL C                                                         0x401000 (1.1.0)
                                                   OpenCL C                                                         0x400000 (1.0.0)
   Device OpenCL C features                        __opencl_c_integer_dot_product_input_4x8bit_packed               0x800000 (2.0.0)
                                                   __opencl_c_integer_dot_product_input_4x8bit                      0x800000 (2.0.0)
                                                   __opencl_c_int64                                                 0x400000 (1.0.0)
                                                   __opencl_c_images                                                0x400000 (1.0.0)
                                                   __opencl_c_3d_image_writes                                       0x400000 (1.0.0)
                                                   __opencl_c_subgroups                                             0x400000 (1.0.0)
   Latest conformance test passed                  v0000-01-01-00
   Device Type                                     GPU
   Device PCI bus info (KHR)                       PCI-E, 0000:30:00.0
   Device Profile                                  EMBEDDED_PROFILE
   Device Available                                Yes
   Compiler Available                              Yes
   Linker Available                                Yes
   Max compute units                               7
   Max clock frequency                             2100MHz
   Device Partition                                (core)
     Max number of sub-devices                     0
     Supported partition types                     None
     Supported affinity domains                    (n/a)
   Max work item dimensions                        3
   Max work item sizes                             1024x1024x1024
   Max work group size                             1024
   Preferred work group size multiple (device)     64
   Preferred work group size multiple (kernel)     64
   Max sub-groups per work group                   16
   Preferred / native vector sizes
     char                                                 1 / 1
     short                                                1 / 1
     int                                                  1 / 1
     long                                                 1 / 1
     half                                                 0 / 0        (n/a)
     float                                                1 / 1
     double                                               0 / 0        (n/a)
   Half-precision Floating-point support           (n/a)
   Single-precision Floating-point support         (core)
     Denormals                                     No
     Infinity and NANs                             Yes
     Round to nearest                              Yes
     Round to zero                                 No
     Round to infinity                             No
     IEEE754-2008 fused multiply-add               No
     Support is emulated in software               No
     Correctly-rounded divide and sqrt operations  No
   Double-precision Floating-point support         (n/a)
   Address bits                                    64, Little-Endian
   Global memory size                              14652411904 (13.65GiB)
   Error Correction support                        No
   Max memory allocation                           2147483648 (2GiB)
   Unified memory for Host and Device              No
   Shared Virtual Memory (SVM) capabilities        (core)
     Coarse-grained buffer sharing                 No
     Fine-grained buffer sharing                   No
     Fine-grained system sharing                   No
     Atomics                                       No
   Minimum alignment for any data type             128 bytes
   Alignment of base address                       4096 bits (512 bytes)
   Preferred alignment for atomics
     SVM                                           0 bytes
     Global                                        0 bytes
     Local                                         0 bytes
   Atomic memory capabilities                      relaxed, work-group scope
   Atomic fence capabilities                       relaxed, acquire/release, work-group scope
   Max size for global variable                    0
   Preferred total size of global vars             0
   Global Memory cache type                        None
   Image support                                   Yes
     Max number of samplers per kernel             32
     Max size for 1D images from buffer            268435455 pixels
     Max 1D or 2D image array size                 2048 images
     Base address alignment for 2D image buffers   0 bytes
     Pitch alignment for 2D image buffers          0 pixels
     Max 2D image size                             16384x16384 pixels
     Max 3D image size                             2048x2048x2048 pixels
     Max number of read image args                 32
     Max number of write image args                16
     Max number of read/write image args           0
   Pipe support                                    No
   Max number of pipe args                         0
   Max active pipe reservations                    0
   Max pipe packet size                            0
   Local memory type                               Global
   Local memory size                               65536 (64KiB)
   Max number of constant args                     16
   Max constant buffer size                        2147483648 (2GiB)
   Generic address space support                   No
   Max size of kernel argument                     32768 (32KiB)
   Queue properties (on host)
     Out-of-order execution                        No
     Profiling                                     Yes
   Device enqueue capabilities                     (n/a)
   Queue properties (on device)
     Out-of-order execution                        No
     Profiling                                     No
     Preferred size                                0
     Max size                                      0
   Max queues on device                            0
   Max events on device                            0
   Prefer user sync for interop                    Yes
   Profiling timer resolution                      10ns
   Execution capabilities
     Run OpenCL kernels                            Yes
     Run native kernels                            No
     Non-uniform work-groups                       No
     Work-group collective functions               No
     Sub-group independent forward progress        No
     IL version                                    SPIR-V_1.0 SPIR-V_1.1 SPIR-V_1.2 SPIR-V_1.3 SPIR-V_1.4
     ILs with version                              SPIR-V                                                           0x400000 (1.0.0)
                                                   SPIR-V                                                           0x401000 (1.1.0)
                                                   SPIR-V                                                           0x402000 (1.2.0)
                                                   SPIR-V                                                           0x403000 (1.3.0)
                                                   SPIR-V                                                           0x404000 (1.4.0)
   printf() buffer size                            1048576 (1024KiB)
   Built-in kernels                                (n/a)
   Built-in kernels with version                   (n/a)
   Device Extensions                               cl_khr_byte_addressable_store cl_khr_create_command_queue cl_khr_extended_versioning cl_khr_icd cl_khr_il_program cl_khr_spirv_no_integer_wrap_decoration cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_integer_dot_product cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cles_khr_int64 cl_khr_3d_image_writes cl_khr_pci_bus_info cl_khr_device_uuid
   Device Extensions with Version                  cl_khr_byte_addressable_store                                    0x400000 (1.0.0)
                                                   cl_khr_create_command_queue                                      0x400000 (1.0.0)
                                                   cl_khr_extended_versioning                                       0x400000 (1.0.0)
                                                   cl_khr_icd                                                       0x400000 (1.0.0)
                                                   cl_khr_il_program                                                0x400000 (1.0.0)
                                                   cl_khr_spirv_no_integer_wrap_decoration                          0x400000 (1.0.0)
                                                   cl_khr_global_int32_base_atomics                                 0x400000 (1.0.0)
                                                   cl_khr_global_int32_extended_atomics                             0x400000 (1.0.0)
                                                   cl_khr_integer_dot_product                                       0x800000 (2.0.0)
                                                   cl_khr_local_int32_base_atomics                                  0x400000 (1.0.0)
                                                   cl_khr_local_int32_extended_atomics                              0x400000 (1.0.0)
                                                   cles_khr_int64                                                   0x400000 (1.0.0)
                                                   cl_khr_3d_image_writes                                           0x400000 (1.0.0)
                                                   cl_khr_pci_bus_info                                              0x400000 (1.0.0)
                                                   cl_khr_device_uuid                                               0x400000 (1.0.0)


More information about the mesa-users mailing list