[Mesa-dev] [PATCH 3/5] radeonsi: disable the patch ID workaround on SI when the patch ID isn't used

Nicolai Hähnle nhaehnle at gmail.com
Wed Jun 7 11:43:54 UTC 2017


On 07.06.2017 13:36, Nicolai Hähnle wrote:
> On 06.06.2017 16:30, Marek Olšák wrote:
>> From: Marek Olšák <marek.olsak at amd.com>
>>
>> The workaround causes a massive performance decrease on 1-SE parts.
>> (Cape Verde, Hainan, Oland)
>>
>> The performance regression is already part of 17.0 and 17.1.
>>
>> Cc: 17.0 17.1 <mesa-stable at lists.freedesktop.org>
>> ---
>>   src/gallium/drivers/radeonsi/si_state_draw.c | 2 +-
>>   1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> diff --git a/src/gallium/drivers/radeonsi/si_state_draw.c 
>> b/src/gallium/drivers/radeonsi/si_state_draw.c
>> index cd069e3..75e83ff 100644
>> --- a/src/gallium/drivers/radeonsi/si_state_draw.c
>> +++ b/src/gallium/drivers/radeonsi/si_state_draw.c
>> @@ -188,21 +188,21 @@ static void si_emit_derived_tess_state(struct 
>> si_context *sctx,
>>        */
>>       *num_patches = MIN2(*num_patches, 40);
>>       if (sctx->b.chip_class == SI) {
>>           /* SI bug workaround, related to power management. Limit LS-HS
>>            * threadgroups to only one wave.
>>            */
>>           unsigned one_wave = 64 / MAX2(num_tcs_input_cp, 
>> num_tcs_output_cp);
>>           *num_patches = MIN2(*num_patches, one_wave);
>> -        if (sctx->screen->b.info.max_se == 1) {
>> +        if (sctx->screen->b.info.max_se == 1 && tcs->info.uses_primid) {
> 
> This is insufficient. All downstream shader stages are affected, 
> including TES, GS, and even PS (unless there's an API GS -- in that 
> case, the PS gets the GS primid output).

Basically, I think this should use 
sctx->ia_multi_vgt_param_key.u.tess_uses_prim_id.

Then there's still the issue that state changes need to be taken into 
account.


> 
> Cheers,
> Nicolai
> 
> 
>>               /* The VGT HS block increments the patch ID unconditionally
>>                * within a single threadgroup. This results in incorrect
>>                * patch IDs when instanced draws are used.
>>                *
>>                * The intended solution is to restrict threadgroups to
>>                * a single instance by setting SWITCH_ON_EOI, which
>>                * should cause IA to split instances up. However, this
>>                * doesn't work correctly on SI when there is no other
>>                * SE to switch to.
>>                */
>>
> 
> 


-- 
Lerne, wie die Welt wirklich ist,
Aber vergiss niemals, wie sie sein sollte.


More information about the mesa-dev mailing list