<p dir="ltr">I don't think anyone is trying to throw away phi's... I think what Connor is proposing is more a better interface to this cloning operation than anything else. Instead of having a separate list of phi's, you can just pre-populate the remap table before doing the clone. For the first loop iteration, you add remaps that remap all of the "header phi's" (those in the first block in the loop) to their values coming from before the loop. For each subsequent copy operation, you populate the remap table such that the header phis map to their values from the previous iteration which you can get from the final state of the remap table after the clone is complete.</p>
<p dir="ltr">This should be completely safe as the only way for a value to propagate from one iteration to the next is through a header phi and, assuming no continues, each header phi will have exactly two sources: one from before the loop and one from the previous iteration.</p>
<p dir="ltr">This would substantially simplify the interface as well as remove the linear walk over the phi remap table every time we do a remap.</p>
<p dir="ltr">--Jason</p>
<div class="gmail_extra"><br><div class="gmail_quote">On Aug 29, 2016 9:40 PM, "Timothy Arceri" <<a href="mailto:timothy.arceri@collabora.com">timothy.arceri@collabora.com</a>> wrote:<br type="attribution"><blockquote class="quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="elided-text">On Mon, 2016-08-29 at 21:06 -0400, Connor Abbott wrote:<br>
> So, you've noticed that your method of handling phi's while cloning<br>
> doesn't handle phi's that point to other phi's in the same block.<br>
> Particularly, something like:<br>
><br>
> a = phi(b, ...)<br>
> b = phi(a, ...)<br>
><br>
> which is supposed to swap a and b each iteration. Here's a better<br>
> strategy which should be simpler than this and fixes that problem.<br>
> Create a new remap_table for each iteration of the loop. On the first<br>
> iteration, make all the phi nodes remap to their pre-loop sources<br>
> before cloning the body. On each successive iteration, make each phi<br>
> node map to what the *old* remap table (from the previous iteration)<br>
> says that it's source from the previous iteration remaps to. In<br>
> pseudocode:<br>
><br>
> nir_cf_list loop_body;<br>
> extract loop body into loop_body<br>
> move loop header instructions (minus phi nodes) into loop_body<br>
><br>
> struct hash_table *old_remap_table = NULL, *remap_table;<br>
> for each iteration:<br>
> remap_table = new dictionary;<br>
> for each phi in original loop:<br>
> if old_remap_table:<br>
> remap_table[phi] = old_remap_table[phi source pointing to<br>
> end of loop]<br>
> else:<br>
> remap_table[phi] = phi source pointing to block before<br>
> loop<br>
> nir_cf_list cloned_body;<br>
> clone loop_body to cloned_body using remap_table<br>
> insert cloned_body before loop<br>
><br>
> delete loop_body<br>
> delete loop<br>
<br>
</div>I see what you are getting at but what about something like this?<br>
<br>
int j = 0;<br>
for (int i=0 ; i < 5; i++) {<br>
...<br>
j++;<br>
if (uniform)<br>
j += 2;<br>
}<br>
<br>
I'm not sure we can throw away phis so easily.<br>
<div class="elided-text"><br>
><br>
> This should eliminate the need to expose nir_cf_list_cleanup(),<br>
> stitch_blocks(), etc. publicly as well as the previous patch, and it<br>
> should handle the phi swapping case correctly.<br>
><br>
><br>
> On Mon, Aug 29, 2016 at 12:59 AM, Timothy Arceri<br>
> <<a href="mailto:timothy.arceri@collabora.com">timothy.arceri@collabora.com</a>> wrote:<br>
> ><br>
> > ---<br>
> > src/compiler/nir/nir.h <wbr>| 3 +++<br>
> > src/compiler/nir/nir_clone.c | 64<br>
> > ++++++++++++++++++++++++++++++<wbr>+++++++++-----<br>
> > 2 files changed, 60 insertions(+), 7 deletions(-)<br>
> ><br>
> > diff --git a/src/compiler/nir/nir.h b/src/compiler/nir/nir.h<br>
> > index 0ab3ebc..9083bd0 100644<br>
> > --- a/src/compiler/nir/nir.h<br>
> > +++ b/src/compiler/nir/nir.h<br>
> > @@ -2372,6 +2372,9 @@ void nir_print_instr(const nir_instr *instr,<br>
> > FILE *fp);<br>
> ><br>
> > nir_shader *nir_shader_clone(void *mem_ctx, const nir_shader *s);<br>
> > nir_function_impl *nir_function_impl_clone(const nir_function_impl<br>
> > *fi);<br>
> > +void nir_clone_loop_list(struct exec_list *dst, const struct<br>
> > exec_list *list,<br>
> > + <wbr>struct hash_table *remap_table,<br>
> > + <wbr>struct hash_table *phi_remap, nir_shader<br>
> > *ns);<br>
> > nir_constant *nir_constant_clone(const nir_constant *c,<br>
> > nir_variable *var);<br>
> > nir_variable *nir_variable_clone(const nir_variable *c, nir_shader<br>
> > *shader);<br>
> ><br>
> > diff --git a/src/compiler/nir/nir_clone.c<br>
> > b/src/compiler/nir/nir_clone.c<br>
> > index 8808333..071afc9 100644<br>
> > --- a/src/compiler/nir/nir_clone.c<br>
> > +++ b/src/compiler/nir/nir_clone.c<br>
> > @@ -35,9 +35,17 @@ typedef struct {<br>
> > /* True if we are cloning an entire shader. */<br>
> > bool global_clone;<br>
> ><br>
> > + /* This allows us to clone a loop body without having to add<br>
> > srcs from<br>
> > + * outside the loop to the remap table. This is useful for loop<br>
> > unrolling.<br>
> > + */<br>
> > + bool allow_remap_fallback;<br>
> > +<br>
> > /* maps orig ptr -> cloned ptr: */<br>
> > struct hash_table *remap_table;<br>
> ><br>
> > + /* used for remaping when cloning loop body for loop unrolling<br>
> > */<br>
> > + struct hash_table *phi_remap_table;<br>
> > +<br>
> > /* List of phi sources. */<br>
> > struct list_head phi_srcs;<br>
> ><br>
> > @@ -46,11 +54,20 @@ typedef struct {<br>
> > } clone_state;<br>
> ><br>
> > static void<br>
> > -init_clone_state(clone_state *state, bool global)<br>
> > +init_clone_state(clone_state *state, struct hash_table<br>
> > *remap_table,<br>
> > + bool global, bool allow_remap_fallback)<br>
> > {<br>
> > state->global_clone = global;<br>
> > - state->remap_table = _mesa_hash_table_create(NULL,<br>
> > _mesa_hash_pointer,<br>
> > - <wbr> _mesa_key_<wbr>pointer_<br>
> > equal);<br>
> > + state->allow_remap_<wbr>fallback = allow_remap_fallback;<br>
> > +<br>
> > + state->phi_remap_table = NULL;<br>
> > + if (remap_table) {<br>
> > + state->remap_table = remap_table;<br>
> > + } else {<br>
> > + state->remap_table = _mesa_hash_table_create(NULL,<br>
> > _mesa_hash_pointer,<br>
> > + <wbr> _mesa_<wbr>key_point<br>
> > er_equal);<br>
> > + }<br>
> > +<br>
> > list_inithead(&state->phi_<wbr>srcs);<br>
> > }<br>
> ><br>
> > @@ -72,16 +89,32 @@ _lookup_ptr(clone_state *state, const void<br>
> > *ptr, bool global)<br>
> > return (void *)ptr;<br>
> ><br>
> > entry = _mesa_hash_table_search(state-<wbr>>remap_table, ptr);<br>
> > - assert(entry && "Failed to find pointer!");<br>
> > if (!entry)<br>
> > - return NULL;<br>
> > + return state->allow_remap_fallback ? (void *)ptr : NULL;<br>
> ><br>
> > return entry->data;<br>
> > }<br>
> ><br>
> > +/**<br>
> > + * Updates a phi remap table used for unrolling loops.<br>
> > + */<br>
> > +static void<br>
> > +update_phi_remap_table(clone_<wbr>state *state, const void *ptr, void<br>
> > *nptr)<br>
> > +{<br>
> > + if (state->phi_remap_table == NULL)<br>
> > + return;<br>
> > +<br>
> > + struct hash_entry *hte;<br>
> > + hash_table_foreach(state-><wbr>phi_remap_table, hte) {<br>
> > + if (hte->data == ptr)<br>
> > + hte->data = nptr;<br>
> > + }<br>
> > +}<br>
> > +<br>
> > static void<br>
> > add_remap(clone_state *state, void *nptr, const void *ptr)<br>
> > {<br>
> > + update_phi_remap_table(<wbr>state, ptr, nptr);<br>
> > _mesa_hash_table_insert(<wbr>state->remap_table, ptr, nptr);<br>
> > }<br>
> ><br>
> > @@ -613,6 +646,23 @@ fixup_phi_srcs(clone_state *state)<br>
> > assert(list_empty(&state-><wbr>phi_srcs));<br>
> > }<br>
> ><br>
> > +void<br>
> > +nir_clone_loop_list(struct exec_list *dst, const struct exec_list<br>
> > *list,<br>
> > + struct hash_table *remap_table,<br>
> > + struct hash_table *phi_remap, nir_shader *ns)<br>
> > +{<br>
> > + clone_state state;<br>
> > + init_clone_state(&state, remap_table, false, true);<br>
> > +<br>
> > + /* We use the same shader */<br>
> > + state.ns = ns;<br>
> > + state.phi_remap_table = phi_remap;<br>
> > +<br>
> > + clone_cf_list(&state, dst, list);<br>
> > +<br>
> > + fixup_phi_srcs(&state);<br>
> > +}<br>
> > +<br>
> > static nir_function_impl *<br>
> > clone_function_impl(clone_<wbr>state *state, const nir_function_impl<br>
> > *fi)<br>
> > {<br>
> > @@ -646,7 +696,7 @@ nir_function_impl *<br>
> > nir_function_impl_clone(const nir_function_impl *fi)<br>
> > {<br>
> > clone_state state;<br>
> > - init_clone_state(&state, false);<br>
> > + init_clone_state(&state, NULL, false, false);<br>
> ><br>
> > /* We use the same shader */<br>
> > state.ns = fi->function->shader;<br>
> > @@ -686,7 +736,7 @@ nir_shader *<br>
> > nir_shader_clone(void *mem_ctx, const nir_shader *s)<br>
> > {<br>
> > clone_state state;<br>
> > - init_clone_state(&state, true);<br>
> > + init_clone_state(&state, NULL, true, false);<br>
> ><br>
> > nir_shader *ns = nir_shader_create(mem_ctx, s->stage, s-<br>
> > >options);<br>
> > state.ns = ns;<br>
> > --<br>
> > 2.7.4<br>
> ><br>
> > ______________________________<wbr>_________________<br>
> > mesa-dev mailing list<br>
> > <a href="mailto:mesa-dev@lists.freedesktop.org">mesa-dev@lists.freedesktop.org</a><br>
> > <a href="https://lists.freedesktop.org/mailman/listinfo/mesa-dev" rel="noreferrer" target="_blank">https://lists.freedesktop.org/<wbr>mailman/listinfo/mesa-dev</a><br>
> ______________________________<wbr>_________________<br>
> mesa-dev mailing list<br>
> <a href="mailto:mesa-dev@lists.freedesktop.org">mesa-dev@lists.freedesktop.org</a><br>
> <a href="https://lists.freedesktop.org/mailman/listinfo/mesa-dev" rel="noreferrer" target="_blank">https://lists.freedesktop.org/<wbr>mailman/listinfo/mesa-dev</a><br>
______________________________<wbr>_________________<br>
mesa-dev mailing list<br>
<a href="mailto:mesa-dev@lists.freedesktop.org">mesa-dev@lists.freedesktop.org</a><br>
<a href="https://lists.freedesktop.org/mailman/listinfo/mesa-dev" rel="noreferrer" target="_blank">https://lists.freedesktop.org/<wbr>mailman/listinfo/mesa-dev</a><br>
</div></blockquote></div><br></div>