[Mesa-dev] [PATCH] intel/fs: Don't emit a des copy for image ops with has_dest == false
Matt Turner
mattst88 at gmail.com
Tue Mar 27 23:33:34 UTC 2018
On Tue, Mar 27, 2018 at 4:29 PM, Jason Ekstrand <jason at jlekstrand.net> wrote:
> This was causing us to walk dest_components times over a thing with no
> destination. This happened to work because all of the image intrinsics
> without a destination also happened to have dest_components == 0. We
> shouldn't be reading dest_components if has_dest == false.
> ---
> src/intel/compiler/brw_fs_nir.cpp | 8 +++++---
> 1 file changed, 5 insertions(+), 3 deletions(-)
>
> diff --git a/src/intel/compiler/brw_fs_nir.cpp b/src/intel/compiler/brw_fs_nir.cpp
> index f5d5399..8d1c387 100644
> --- a/src/intel/compiler/brw_fs_nir.cpp
> +++ b/src/intel/compiler/brw_fs_nir.cpp
> @@ -3848,9 +3848,11 @@ fs_visitor::nir_emit_intrinsic(const fs_builder &bld, nir_intrinsic_instr *instr
> get_image_atomic_op(instr->intrinsic, type));
>
> /* Assign the result. */
> - for (unsigned c = 0; c < info->dest_components; ++c)
> - bld.MOV(offset(retype(dest, base_type), bld, c),
> - offset(tmp, bld, c));
> + if (nir_intrinsic_infos[instr->intrinsic].has_dest) {
> + for (unsigned c = 0; c < info->dest_components; ++c)
> + bld.MOV(offset(retype(dest, base_type), bld, c),
> + offset(tmp, bld, c));
Nested control flow and a multiline statement: braces required. Please
fix while you're here.
Reviewed-by: Matt Turner <mattst88 at gmail.com>
More information about the mesa-dev
mailing list