[Mesa-dev] [PATCH 3/4] radeonsi: fix gl_PrimitiveID in tessellation with instanced draws on SI
Marek Olšák
maraeo at gmail.com
Fri Jun 2 13:01:02 UTC 2017
On Wed, May 3, 2017 at 3:58 PM, Nicolai Hähnle <nhaehnle at gmail.com> wrote:
> From: Nicolai Hähnle <nicolai.haehnle at amd.com>
>
> Cc: mesa-stable at lists.freedesktop.org
> ---
> src/gallium/drivers/radeonsi/si_state_draw.c | 14 ++++++++++++++
> 1 file changed, 14 insertions(+)
>
> diff --git a/src/gallium/drivers/radeonsi/si_state_draw.c b/src/gallium/drivers/radeonsi/si_state_draw.c
> index e6a9ee0..3d1d1f8 100644
> --- a/src/gallium/drivers/radeonsi/si_state_draw.c
> +++ b/src/gallium/drivers/radeonsi/si_state_draw.c
> @@ -181,20 +181,34 @@ static void si_emit_derived_tess_state(struct si_context *sctx,
>
> /* Not necessary for correctness, but improves performance. The
> * specific value is taken from the proprietary driver.
> */
> *num_patches = MIN2(*num_patches, 40);
>
> /* SI bug workaround - limit LS-HS threadgroups to only one wave. */
> if (sctx->b.chip_class == SI) {
> unsigned one_wave = 64 / MAX2(num_tcs_input_cp, num_tcs_output_cp);
> *num_patches = MIN2(*num_patches, one_wave);
> +
> + if (sctx->screen->b.info.max_se == 1) {
> + /* The VGT HS block increments the patch ID unconditionally
> + * within a single threadgroup. This results in incorrect
> + * patch IDs when instanced draws are used.
> + *
> + * The intended solution is to restrict threadgroups to
> + * a single instance by setting SWITCH_ON_EOI, which
> + * should cause IA to split instances up. However, this
> + * doesn't work correctly on SI when there is no other
> + * SE to switch to.
> + */
> + *num_patches = 1;
> + }
Hi Nicolai,
This commit massively decreases tessellation performance on SI 1-SE
parts. We need a different solution. Would this work: "Set num_patches
to the greatest divisor of the the number of patches per instance."
Thanks,
Marek
More information about the mesa-dev
mailing list