[Mesa-dev] [PATCH] radeonsi: don't use fast color clear for small images even on APUs
Dieter Nützel
Dieter at nuetzel-hh.de
Wed Dec 13 09:03:40 UTC 2017
Tested-by: Dieter Nützel <Dieter at nuetzel-hh.de>
Yes, on RX580 it is slightly more than 20%, but GREAT ;-)
Dieter
Am 13.12.2017 00:53, schrieb Marek Olšák:
> From: Marek Olšák <marek.olsak at amd.com>
>
> Increase the limit and handle non-square images better.
>
> This makes glxgears 20% faster on APUs, and a little more on dGPUs.
> We all use and love glxgears.
> ---
> src/gallium/drivers/radeonsi/si_clear.c | 9 ++++-----
> 1 file changed, 4 insertions(+), 5 deletions(-)
>
> diff --git a/src/gallium/drivers/radeonsi/si_clear.c
> b/src/gallium/drivers/radeonsi/si_clear.c
> index 0ac83f4..464b9d7 100644
> --- a/src/gallium/drivers/radeonsi/si_clear.c
> +++ b/src/gallium/drivers/radeonsi/si_clear.c
> @@ -418,26 +418,25 @@ static void si_do_fast_color_clear(struct
> si_context *sctx,
> sctx->b.family == CHIP_STONEY)
> tex->num_slow_clears++;
> }
>
> bool need_decompress_pass = false;
>
> /* Use a slow clear for small surfaces where the cost of
> * the eliminate pass can be higher than the benefit of fast
> * clear. The closed driver does this, but the numbers may differ.
> *
> - * Always use fast clear on APUs.
> + * This helps on both dGPUs and APUs, even small APUs like Mullins.
> */
> - bool too_small = sctx->screen->info.has_dedicated_vram &&
> - tex->resource.b.b.nr_samples <= 1 &&
> - tex->resource.b.b.width0 <= 256 &&
> - tex->resource.b.b.height0 <= 256;
> + bool too_small = tex->resource.b.b.nr_samples <= 1 &&
> + tex->resource.b.b.width0 *
> + tex->resource.b.b.height0 <= 512 * 512;
>
> /* Try to clear DCC first, otherwise try CMASK. */
> if (vi_dcc_enabled(tex, 0)) {
> uint32_t reset_value;
> bool clear_words_needed;
>
> if (sctx->screen->debug_flags & DBG(NO_DCC_CLEAR))
> continue;
>
> /* This can only occur with MSAA. */
More information about the mesa-dev
mailing list