[Mesa-dev] [PATCH 03/13] i965: use nir_lower_indirect_derefs() for GLSL
Jason Ekstrand
jason at jlekstrand.net
Thu Dec 22 20:36:04 UTC 2016
On Wed, Dec 21, 2016 at 6:26 PM, Timothy Arceri <
timothy.arceri at collabora.com> wrote:
> This moves the nir_lower_indirect_derefs() call into
> brw_preprocess_nir() so thats is called by both OpenGL and Vulkan
> and removes that call to the old GLSL IR pass
> lower_variable_index_to_cond_assign()
>
> We want to do this pass in nir to be able to move loop unrolling
> to nir.
>
> There is a increase of 1-3 instructions in a small number of shaders,
> and 2 Kerbal Space program shaders that increase by 32 instructions.
> The changes seem to be caused be the difference in the GLSL IR vs
> NIR variable index lowering passes. The GLSL IR pass creates a
> simple if ladder for arrays of size 4 or less, while the NIR pass
> implements a binary search for all arrays regardless of size.
>
> Shader-db results BDW:
>
> total instructions in shared programs: 13021176 -> 13021819 (0.00%)
> instructions in affected programs: 57693 -> 58336 (1.11%)
> helped: 20
> HURT: 190
>
> total cycles in shared programs: 299805580 -> 299750826 (-0.02%)
> cycles in affected programs: 2290024 -> 2235270 (-2.39%)
> helped: 337
> HURT: 442
>
> total fills in shared programs: 19984 -> 19984 (0.00%)
> fills in affected programs: 0 -> 0
> helped: 0
> HURT: 0
>
> LOST: 4
> GAINED: 0
>
> V2: remove the do_copy_propagation() call from the i965 GLSL IR
> linking code. This call was added in f7741c52111 but since we are
> moving the variable index lowering to NIR we no longer need it and
> can just rely on the nir copy propagation pass.
>
> Reviewed-by: Kenneth Graunke <kenneth at whitecape.org>
> ---
> src/intel/vulkan/anv_pipeline.c | 10 ----------
> src/mesa/drivers/dri/i965/brw_link.cpp | 15 ---------------
> src/mesa/drivers/dri/i965/brw_nir.c | 10 ++++++++++
> 3 files changed, 10 insertions(+), 25 deletions(-)
>
> diff --git a/src/intel/vulkan/anv_pipeline.c b/src/intel/vulkan/anv_
> pipeline.c
> index 9104267..e2fbcab 100644
> --- a/src/intel/vulkan/anv_pipeline.c
> +++ b/src/intel/vulkan/anv_pipeline.c
> @@ -190,16 +190,6 @@ anv_shader_compile_to_nir(struct anv_device *device,
>
> nir_shader_gather_info(nir, entry_point->impl);
>
> - nir_variable_mode indirect_mask = 0;
> - if (compiler->glsl_compiler_options[stage].EmitNoIndirectInput)
> - indirect_mask |= nir_var_shader_in;
> - if (compiler->glsl_compiler_options[stage].EmitNoIndirectOutput)
> - indirect_mask |= nir_var_shader_out;
> - if (compiler->glsl_compiler_options[stage].EmitNoIndirectTemp)
> - indirect_mask |= nir_var_local;
> -
> - nir_lower_indirect_derefs(nir, indirect_mask);
> -
> return nir;
> }
>
> diff --git a/src/mesa/drivers/dri/i965/brw_link.cpp
> b/src/mesa/drivers/dri/i965/brw_link.cpp
> index 6f37428..0d8a626 100644
> --- a/src/mesa/drivers/dri/i965/brw_link.cpp
> +++ b/src/mesa/drivers/dri/i965/brw_link.cpp
> @@ -134,21 +134,6 @@ process_glsl_ir(struct brw_context *brw,
> lower_noise(shader->ir);
> lower_quadop_vector(shader->ir, false);
>
> - do_copy_propagation(shader->ir);
> -
> - bool lowered_variable_indexing =
> - lower_variable_index_to_cond_assign(shader->Stage, shader->ir,
> - options->EmitNoIndirectInput,
> - options->EmitNoIndirectOutput,
> - options->EmitNoIndirectTemp,
> - options->
> EmitNoIndirectUniform);
> -
> - if (unlikely(brw->perf_debug && lowered_variable_indexing)) {
> - perf_debug("Unsupported form of variable indexing in %s; falling "
> - "back to very inefficient code generation\n",
> - _mesa_shader_stage_to_abbrev(shader->Stage));
> - }
> -
> bool progress;
> do {
> progress = false;
> diff --git a/src/mesa/drivers/dri/i965/brw_nir.c
> b/src/mesa/drivers/dri/i965/brw_nir.c
> index 55b16cf..7624126 100644
> --- a/src/mesa/drivers/dri/i965/brw_nir.c
> +++ b/src/mesa/drivers/dri/i965/brw_nir.c
> @@ -486,6 +486,16 @@ brw_preprocess_nir(const struct brw_compiler
> *compiler, nir_shader *nir)
> /* Lower a bunch of stuff */
> OPT_V(nir_lower_var_copies);
>
> + nir_variable_mode indirect_mask = 0;
> + if (compiler->glsl_compiler_options[nir->stage].EmitNoIndirectInput)
> + indirect_mask |= nir_var_shader_in;
> + if (compiler->glsl_compiler_options[nir->stage].EmitNoIndirectOutput)
> + indirect_mask |= nir_var_shader_out;
> + if (compiler->glsl_compiler_options[nir->stage].EmitNoIndirectTemp)
> + indirect_mask |= nir_var_local;
> +
> + nir_lower_indirect_derefs(nir, indirect_mask);
>
This needs to happen *after* lower_clip_cull_distance_arrays otherwise the
"compact" value won't get set on the clip and cull variables and
nir_lower_io will throw a fit. Probably best to just move
lower_clip_cull_distance_arrays up. With that change,
Reviewed-by: Jason Ekstrand <jason at jlekstrand.net>
> +
> /* Get rid of split copies */
> nir = nir_optimize(nir, is_scalar);
>
> --
> 2.9.3
>
> _______________________________________________
> mesa-dev mailing list
> mesa-dev at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/mesa-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/mesa-dev/attachments/20161222/fd85371b/attachment.html>
More information about the mesa-dev
mailing list