[Mesa-dev] [PATCH v2 13/14] radeonsi: Process multiple patches per threadgroup.
Marek Olšák
maraeo at gmail.com
Mon May 16 20:15:31 UTC 2016
On Fri, May 13, 2016 at 3:37 AM, Bas Nieuwenhuizen
<bas at basnieuwenhuizen.nl> wrote:
> Using more than 1 wave per threadgroup does increase performance
> generally. Not using too many patches per threadgroup also
> increases performance. Both catalyst and amdgpu-pro seem to
> use 40 patches as their maximum, but I haven't really seen
> any performance increase from limiting the number of patches
> to 40 instead of 64.
40 may be optimal for existing OpenGL apps on some chips.
Vulkan doesn't set more than 16.
Let's set either 40 or 16 with a comment where the value comes from.
>
> Note that the trick where we overlap the input and output LDS
> does not work anymore as the insertion of the tess factors
> changes the patch stride.
I don't understand this. Can you explain it more?
Marek
More information about the mesa-dev
mailing list