[Mesa-dev] [PATCH v2 2/2] nvir/gm107: consider FILE_FLAGS dependencies in SchedDataCalculatorGM107
Samuel Pitoiset
samuel.pitoiset at gmail.com
Mon Feb 26 11:05:34 UTC 2018
On 02/26/2018 12:01 PM, Karol Herbst wrote:
> On Mon, Feb 26, 2018 at 11:52 AM, Samuel Pitoiset
> <samuel.pitoiset at gmail.com> wrote:
>>
>>
>> On 02/24/2018 11:36 AM, Karol Herbst wrote:
>>>
>>> currently while insterting barriers, writes and reads to FILE_FLAGS aren't
>>> considered. This can lead to WaR hazards in some situations.
>>>
>>> With the previous commit fixes shaders with intstructions like this:
>>> mad u32 $r2 $r4 $r11 $r2
>>> mad u32 { $r5 $c0 } $r4 $r10 $r6
>>> mad (SUBOP:1) u32 $r3 $r4 $r10 $r2 $c0
>>>
>>> Affects OpenCL CTS tests on Maxwell+:
>>> basic/test_basic intmath_long
>>> basic/test_basic intmath_long2
>>> basic/test_basic intmath_long4
>>>
>>> v2: only put barriers on instructions which actually read flags
>>>
>>> Signed-off-by: Karol Herbst <kherbst at redhat.com>
>>> ---
>>> .../drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp | 15
>>> ++++++++++++++-
>>> 1 file changed, 14 insertions(+), 1 deletion(-)
>>>
>>> diff --git a/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>>> b/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>>> index 96bd276884..fafece81ad 100644
>>> --- a/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>>> +++ b/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp
>>> @@ -3944,6 +3944,7 @@ SchedDataCalculatorGM107::needWrDepBar(const
>>> Instruction *insn) const
>>> for (int d = 0; insn->defExists(d); ++d) {
>>> if (insn->def(d).getFile() == FILE_GPR ||
>>> + insn->def(d).getFile() == FILE_FLAGS ||
>>> insn->def(d).getFile() == FILE_PREDICATE)
>>> return true;
>>> }
>>> @@ -3983,6 +3984,12 @@ SchedDataCalculatorGM107::findFirstUse(const
>>> Instruction *bari) const
>>> continue;
>>> return insn;
>>> }
>>> + if (def.getFile() == FILE_FLAGS) {
>>> + if (insn->src(s).getFile() != FILE_FLAGS ||
>>> + src->reg.data.id != minGPR)
>>> + continue;
>>> + return insn;
>>> + }
>>> }
>>> }
>>> }
>>> @@ -4002,7 +4009,8 @@ SchedDataCalculatorGM107::findFirstDef(const
>>> Instruction *bari) const
>>> for (int d = 0; insn->defExists(d); ++d) {
>>> const Value *def = insn->def(d).rep();
>>> - if (insn->def(d).getFile() != FILE_GPR)
>>> + if (insn->def(d).getFile() != FILE_GPR &&
>>> + insn->def(d).getFile() != FILE_FLAGS)
>>> continue;
>>> minGPR = def->reg.data.id;
>>> @@ -4010,7 +4018,12 @@ SchedDataCalculatorGM107::findFirstDef(const
>>> Instruction *bari) const
>>> for (int s = 0; bari->srcExists(s); ++s) {
>>> const Value *src = bari->src(s).rep();
>>> + if (bari->src(s).getFile() == FILE_FLAGS &&
>>> + insn->def(d).getFile() == FILE_FLAGS &&
>>> + src->reg.data.id == minGPR)
>>
>>
>> Why don't you check the dst GPR id too?
>>
>
> minGPR is def.rep()->reg.data.id. I just did what we do for predicates
> here, because it should be kind of the same.
Right, missed that.
Reviewed-by: Samuel Pitoiset <samuel.pitoiset at gmail.com>
Thanks!
>
>>
>>> + return insn;
>>> if (bari->src(s).getFile() != FILE_GPR ||
>>> + insn->def(d).getFile() != FILE_GPR ||
>>> src->reg.data.id + src->reg.size / 4 - 1 < minGPR ||
>>> src->reg.data.id > maxGPR)
>>> continue;
>>>
>>
More information about the mesa-dev
mailing list