[Mesa-dev] [PATCH] st/mesa: change SQRT lowering to fix the game Risen

Axel Davy axel.davy at ens.fr
Sun Jun 5 16:34:11 UTC 2016


if abs is not inserted in the source glsl, that's a wine bug, not a mesa 
bug,
Thus I don't think doing this is a good idea.

On 05/06/2016 17:17, Ilia Mirkin wrote:
> On Mon, May 30, 2016 at 7:19 PM, Marek Olšák <maraeo at gmail.com> wrote:
>> From: Marek Olšák <marek.olsak at amd.com>
>>
>> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94627
>> (against nouveau)
>> ---
>>   src/mesa/state_tracker/st_glsl_to_tgsi.cpp | 16 +++++++++-------
>>   1 file changed, 9 insertions(+), 7 deletions(-)
>>
>> diff --git a/src/mesa/state_tracker/st_glsl_to_tgsi.cpp b/src/mesa/state_tracker/st_glsl_to_tgsi.cpp
>> index aa443a5..0f5ee02 100644
>> --- a/src/mesa/state_tracker/st_glsl_to_tgsi.cpp
>> +++ b/src/mesa/state_tracker/st_glsl_to_tgsi.cpp
>> @@ -1901,13 +1901,15 @@ glsl_to_tgsi_visitor::visit_expression(ir_expression* ir, st_src_reg *op)
>>         if (have_sqrt) {
>>            emit_scalar(ir, TGSI_OPCODE_SQRT, result_dst, op[0]);
>>         } else {
>> -         /* sqrt(x) = x * rsq(x). */
>> -         emit_scalar(ir, TGSI_OPCODE_RSQ, result_dst, op[0]);
>> -         emit_asm(ir, TGSI_OPCODE_MUL, result_dst, result_src, op[0]);
>> -         /* For incoming channels <= 0, set the result to 0. */
>> -         op[0].negate = ~op[0].negate;
>> -         emit_asm(ir, TGSI_OPCODE_CMP, result_dst,
>> -              op[0], result_src, st_src_reg_for_float(0.0));
>> +         /* This is the only instruction sequence that makes the game "Risen"
>> +          * render correctly. ABS is not required for the game, but since GLSL
>> +          * declares negative values as "undefined", allowing us to do whatever
>> +          * we want, I choose to use ABS to match DX9 and pre-GLSL RSQ
>> +          * behavior.
>> +          */
>> +         emit_scalar(ir, TGSI_OPCODE_ABS, result_dst, op[0]);
>> +         emit_scalar(ir, TGSI_OPCODE_RSQ, result_dst, result_src);
>> +         emit_scalar(ir, TGSI_OPCODE_RCP, result_dst, result_src);
> I spent a bunch of time trying to come up with alternatives that still
> had the desired behavior for 0 and Infinity (since RCP sucks in terms
> of precision). At the end of the day, it depends on what the hw does
> with RSQ(0) and RCP(0). For nv50/nvc0, this sequence did have the
> desired behavior - perhaps that holds for all hardware?
>
> Either way, I'm fairly ambivalent about this - nv50/nvc0 now claim to
> have SQRT support (and perform the lowering in the backend), and I
> doubt much of anything would care on nv30. I'd just as soon leave the
> abs out of it, but I don't strongly care.
>
> Acked-by: Ilia Mirkin <imirkin at alum.mit.edu>
>
>>         }
>>         break;
>>      case ir_unop_rsq:
>> --
>> 2.7.4
>>
>> _______________________________________________
>> mesa-dev mailing list
>> mesa-dev at lists.freedesktop.org
>> https://lists.freedesktop.org/mailman/listinfo/mesa-dev
> _______________________________________________
> mesa-dev mailing list
> mesa-dev at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/mesa-dev




More information about the mesa-dev mailing list