[Mesa-dev] [PATCH] i965/vec4: fill src_reg type using the constructor type parameter

Matt Turner mattst88 at gmail.com
Tue Sep 1 10:53:52 PDT 2015


On Tue, Sep 1, 2015 at 8:02 AM, Alejandro PiƱeiro <apinheiro at igalia.com> wrote:
> The src_reg constructor that received the glsl_type was using it
> only to build the swizzle, but not to fill this->type as dst_reg
> is doing.
>
> This caused some type mismatch between movs and alu operations
> on the NIR path, so copy propagation optimization was not applied
> to remove unneeded movs if negate modifier was involved. This was
> first detected on minus (negate+add) operations.
>
> Shader DB results (taking into account only vec4):
>
> total instructions in shared programs: 20019 -> 19934 (-0.42%)
> instructions in affected programs:     2918 -> 2833 (-2.91%)
> helped:                                79
> HURT:                                  0
> GAINED:                                0
> LOST:                                  0
> ---

How silly. :)

Thanks for finding that.

Reviewed-by: Matt Turner <mattst88 at gmail.com>


More information about the mesa-dev mailing list