[Bug 108947] GPU hang when running heavy compute workload

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Dec 5 07:24:33 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=108947

            Bug ID: 108947
           Summary: GPU hang when running heavy compute workload
           Product: Mesa
           Version: 18.3
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: critical
          Priority: medium
         Component: Drivers/Vulkan/intel
          Assignee: intel-3d-bugs at lists.freedesktop.org
          Reporter: zhiwen.wu at intel.com
        QA Contact: intel-3d-bugs at lists.freedesktop.org
                CC: jason at jlekstrand.net

I wrote an compute shader to do the convolution algorithm and run it on Intel
Apollo Lake GPU by using Vulkan API. When the convolution is a heavy workload,
GPU hang occurred.

==== Test environments:
    Ubuntu 16.04
    Mesa 18.3
    Vulkan SDK: 1.1.85.0
    CPU: Intel Celeron J3455
    GPU: HD Graphics 500 (Apollo Lake, 12 EU)

==== Steps to reproduce:
    git clone https://github.com/wzw-intel/vulkan_minimal_compute.git
    cd vulkan_minimal_compute
    mkdir build
    cd build
    cmake ..
    make
    cd ../
    ./build/vulkan_minimal_compute

==== What does the test program do
    This program will run a convolution shader 10 times serially. Each run will
be synced by a dedicated VkFence object. GPU hang may occur at any iteration
and print log "INTEL-MESA: error: vulkan/anv_device.c:2091: GPU hang on one of
our command buffers (VK_ERROR_DEVICE_LOST)"
    Not every run for program triger the GPU hang. If not hang, try more.

==== Other foundings:
    - Setting "LIGHT_WORKLOAD=1" environement variable (it makes the total
GFLOPS reduced by 50%) make GPU hang disappear. It seems that GPU hang only
occur for heavy workload

    - No GPU hang for high end Intel GPU.
      I tested this program on i7-6770HQ (GPU: Iris Pro Graphics 580, GT4e, 72
EU), no GPU hang. But on Intel Celeron J3455 (GPU: HD Graphics 500).
     and Intel Soc with HD Graphics 530 GPU, GPU hang occurs.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-3d-bugs/attachments/20181205/f99f3b3c/attachment.html>


More information about the intel-3d-bugs mailing list