[Mesa-dev] [PATCH v2 2/2] nvir/gm107: consider FILE_FLAGS dependencies in SchedDataCalculatorGM107

Karol Herbst kherbst at redhat.com
Mon Feb 26 11:01:09 UTC 2018


On Mon, Feb 26, 2018 at 11:52 AM, Samuel Pitoiset
<samuel.pitoiset at gmail.com> wrote:
>
>
> On 02/24/2018 11:36 AM, Karol Herbst wrote:
>>
>> currently while insterting barriers, writes and reads to FILE_FLAGS aren't
>> considered. This can lead to WaR hazards in some situations.
>>
>> With the previous commit fixes shaders with intstructions like this:
>>    mad u32 $r2 $r4 $r11 $r2
>>    mad u32 { $r5 $c0 } $r4 $r10 $r6
>>    mad (SUBOP:1) u32 $r3 $r4 $r10 $r2 $c0
>>
>> Affects OpenCL CTS tests on Maxwell+:
>> basic/test_basic intmath_long
>> basic/test_basic intmath_long2
>> basic/test_basic intmath_long4
>>
>> v2: only put barriers on instructions which actually read flags
>>
>> Signed-off-by: Karol Herbst <kherbst at redhat.com>
>> ---
>>   .../drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp        | 15
>> ++++++++++++++-
>>   1 file changed, 14 insertions(+), 1 deletion(-)
>>
>> diff --git a/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>> b/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>> index 96bd276884..fafece81ad 100644
>> --- a/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>> +++ b/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>> @@ -3944,6 +3944,7 @@ SchedDataCalculatorGM107::needWrDepBar(const
>> Instruction *insn) const
>>        for (int d = 0; insn->defExists(d); ++d) {
>>         if (insn->def(d).getFile() == FILE_GPR ||
>> +          insn->def(d).getFile() == FILE_FLAGS ||
>>             insn->def(d).getFile() == FILE_PREDICATE)
>>            return true;
>>      }
>> @@ -3983,6 +3984,12 @@ SchedDataCalculatorGM107::findFirstUse(const
>> Instruction *bari) const
>>                     continue;
>>                  return insn;
>>               }
>> +            if (def.getFile() == FILE_FLAGS) {
>> +               if (insn->src(s).getFile() != FILE_FLAGS ||
>> +                   src->reg.data.id != minGPR)
>> +                  continue;
>> +               return insn;
>> +            }
>>            }
>>         }
>>      }
>> @@ -4002,7 +4009,8 @@ SchedDataCalculatorGM107::findFirstDef(const
>> Instruction *bari) const
>>           for (int d = 0; insn->defExists(d); ++d) {
>>            const Value *def = insn->def(d).rep();
>> -         if (insn->def(d).getFile() != FILE_GPR)
>> +         if (insn->def(d).getFile() != FILE_GPR &&
>> +             insn->def(d).getFile() != FILE_FLAGS)
>>               continue;
>>              minGPR = def->reg.data.id;
>> @@ -4010,7 +4018,12 @@ SchedDataCalculatorGM107::findFirstDef(const
>> Instruction *bari) const
>>              for (int s = 0; bari->srcExists(s); ++s) {
>>               const Value *src = bari->src(s).rep();
>> +            if (bari->src(s).getFile() == FILE_FLAGS &&
>> +                insn->def(d).getFile() == FILE_FLAGS &&
>> +                src->reg.data.id == minGPR)
>
>
> Why don't you check the dst GPR id too?
>

minGPR is def.rep()->reg.data.id. I just did what we do for predicates
here, because it should be kind of the same.

>
>> +               return insn;
>>               if (bari->src(s).getFile() != FILE_GPR ||
>> +                insn->def(d).getFile() != FILE_GPR ||
>>                   src->reg.data.id + src->reg.size / 4 - 1 < minGPR ||
>>                   src->reg.data.id > maxGPR)
>>                  continue;
>>
>


More information about the mesa-dev mailing list