[Bug 108875] Invalid subgroupSize for Intel GPU

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Nov 27 14:23:29 UTC 2018


Jason Ekstrand <jason at jlekstrand.net> changed:

           What    |Removed                     |Added
         Resolution|---                         |NOTABUG
             Status|NEW                         |RESOLVED

--- Comment #6 from Jason Ekstrand <jason at jlekstrand.net> ---
Just because the advertised subgroupSize is 32 doesnn't mean we have to run
with "full" subgroups.  Intel hardware has dispatch widths of 8, 16, and 32. 
In the Vulkan subgroup model, dispatch modes of 8 and 16 are advertised as a
subgroup size of 32 and where only the first 8 or 16 invocations are enabled. 
This is entirely in-line with the spec; there is nothing that guarantees that
local_size_x = subgroupSize will get you a single invocation or that
gl_NumSubgroups = DIV_ROUND_UP(gl_WorkgroupSize.x * gl_WorkgroupSize.y *
gl_WorkgroupSize.z, gl_SubgroupSize).

Besides that, you really don't want to run 32-wide on Intel.  The performance
trade-offs almost always aren't worth it.  We default to 16-wide because that's
tends to be a nice sweet-spot but sometimes 8-wide is even better.

You are receiving this mail because:
You are the assignee for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-3d-bugs/attachments/20181127/c3467239/attachment.html>

More information about the intel-3d-bugs mailing list