<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">On Wed, May 10, 2017 at 1:35 PM, Nanley Chery <span dir="ltr"><<a href="mailto:nanleychery@gmail.com" target="_blank">nanleychery@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="gmail-HOEnZb"><div class="gmail-h5">On Wed, May 10, 2017 at 12:38:50PM -0700, Jason Ekstrand wrote:<br>
> On Wed, May 10, 2017 at 11:30 AM, Nanley Chery <<a href="mailto:nanleychery@gmail.com">nanleychery@gmail.com</a>><br>
> wrote:<br>
><br>
> > On Wed, May 03, 2017 at 02:24:19PM -0700, Jason Ekstrand wrote:<br>
> > > On Thu, Apr 27, 2017 at 11:32 AM, Nanley Chery <<a href="mailto:nanleychery@gmail.com">nanleychery@gmail.com</a>><br>
> > > wrote:<br>
> > ><br>
> > > > Signed-off-by: Nanley Chery <<a href="mailto:nanley.g.chery@intel.com">nanley.g.chery@intel.com</a>><br>
> > > > ---<br>
> > > > src/intel/vulkan/genX_cmd_<wbr>buffer.c | 93<br>
> > ++++++++++++++++++++++++++++++<br>
> > > > ++++++++<br>
> > > > 1 file changed, 93 insertions(+)<br>
> > > ><br>
> > > > diff --git a/src/intel/vulkan/genX_cmd_<wbr>buffer.c<br>
> > > > b/src/intel/vulkan/genX_cmd_<wbr>buffer.c<br>
> > > > index d5cc358aec..1ae0c3256e 100644<br>
> > > > --- a/src/intel/vulkan/genX_cmd_<wbr>buffer.c<br>
> > > > +++ b/src/intel/vulkan/genX_cmd_<wbr>buffer.c<br>
> > > > @@ -454,6 +454,99 @@ genX(transfer_clear_value)(<wbr>struct anv_cmd_buffer<br>
> > *<br>
> > > > const cmd_buffer,<br>
> > > > }<br>
> > > > }<br>
> > > ><br>
> > > > +/* Transitions a color buffer from one layout to another. */<br>
> > > > +static void<br>
> > > > +transition_color_buffer(<wbr>struct anv_cmd_buffer * const cmd_buffer,<br>
> > > > + const struct anv_image * const image,<br>
> > > > + const uint32_t base_level, uint32_t<br>
> > level_count,<br>
> > > > + const uint32_t base_layer, uint32_t<br>
> > layer_count,<br>
> > > > + const VkImageLayout initial_layout,<br>
> > > > + const VkImageLayout final_layout)<br>
> > > > +{<br>
> > > > + assert(cmd_buffer && image);<br>
> > > > +<br>
> > > > + /* This must be a color image. */<br>
> > > > + assert(image->aspects == VK_IMAGE_ASPECT_COLOR_BIT);<br>
> > > > +<br>
> > > > + /* Only color buffers with CCS need resolving. */<br>
> > > > + if (image->aux_surface.isl.size == 0 || image->samples > 1)<br>
> > > > + return;<br>
> > > > +<br>
> > > > + /* Don't transition this subresource range if it lacks auxiliary<br>
> > data.<br>
> > > > */<br>
> > > > + if (base_level >= anv_color_aux_levels(image) ||<br>
> > > > + base_layer >= anv_color_aux_layers(image, base_level))<br>
> > > > + return;<br>
> > > > +<br>
> > > > + /* The undefined layout indicates that the user doesn't care about<br>
> > the<br>
> > > > + * data that's currently in the buffer. The pre-initialized layout<br>
> > is<br>
> > > > + * equivalent to the undefined layout for optimally-tiled images.<br>
> > > > + *<br>
> > > > + * We can only skip the resolve for CCS_E images in this layout<br>
> > > > because it<br>
> > > > + * is enabled outside of render passes. This allows previously<br>
> > > > fast-cleared<br>
> > > > + * and undefined buffers to be defined with transfer operations.<br>
> > > > + */<br>
> > > > + const bool is_ccs_e = image->aux_usage == ISL_AUX_USAGE_CCS_E;<br>
> > > > + const bool undef_layout = initial_layout ==<br>
> > VK_IMAGE_LAYOUT_UNDEFINED<br>
> > > > ||<br>
> > > > + initial_layout == VK_IMAGE_LAYOUT_<br>
> > > > PREINITIALIZED;<br>
> > > > + if (is_ccs_e && undef_layout)<br>
> > > > + return;<br>
> > > > +<br>
> > > > + /* A resolve isn't necessary when transitioning from a layout that<br>
> > > > doesn't<br>
> > > > + * have fast-clear data or to a layout that will be aware of the<br>
> > > > fast-clear<br>
> > > > + * value.<br>
> > > > + */<br>
> > > > + const bool maybe_fast_cleared = undef_layout || initial_layout ==<br>
> > > > + VK_IMAGE_LAYOUT_COLOR_<br>
> > > > ATTACHMENT_OPTIMAL;<br>
> > > > + if (!maybe_fast_cleared || final_layout ==<br>
> > > > + VK_IMAGE_LAYOUT_COLOR_<wbr>ATTACHMENT_OPTIMAL)<br>
> > > > + return;<br>
> > > > +<br>
> > > > + /* Determine the optimal resolve operation. For now we only need to<br>
> > > > resolve<br>
> > > > + * the clear color.<br>
> > > > + */<br>
> > > > + const enum blorp_fast_clear_op op = is_ccs_e ?<br>
> > > > + BLORP_FAST_CLEAR_OP_RESOLVE_<br>
> > PARTIAL<br>
> > > > :<br>
> > > > + BLORP_FAST_CLEAR_OP_RESOLVE_<br>
> > FULL;<br>
> > > > +<br>
> > > > + /* The actual range that will be transitioned is limited by the<br>
> > number<br>
> > > > of<br>
> > > > + * subresources that have auxiliary data.<br>
> > > > + */<br>
> > > > + level_count = MIN2(level_count, anv_color_aux_levels(image));<br>
> > > > +<br>
> > > > + /* A write cache flush with an end-of-pipe sync is required between<br>
> > > > + * rendering, clearing, and resolving operations. Perform a flush<br>
> > of<br>
> > > > the<br>
> > > > + * write cache before and after the resolve operation to meet this<br>
> > > > + * requirement.<br>
> > > > + */<br>
> > > ><br>
> > ><br>
> > > anv_blorp.c has a more detailed comment about this. I think I'd rather<br>
> > we<br>
> > > copy it here so that it doesn't get lost when we delete<br>
> > > ccs_resolve_attachments.<br>
> > ><br>
> > > Also, I have no idea whether we need to do it per-fast-clear-op or once<br>
> > > before we do fast-clear ops and once afterwards. I think this is<br>
> > probably<br>
> > > fine.<br>
> > ><br>
> > ><br>
> ><br>
> > Contrary to the comment in anv_blorp, I found documentation that<br>
> > provides us more information as to what's needed. How about I rewrite my<br>
> > comment to this instead?<br>
> ><br>
> > From the Sky Lake PRM Vol. 7, "MCS Buffer for Render Target(s)":<br>
> ><br>
> > Any transition from any value in {Clear, Render, Resolve} to a<br>
> > different value in {Clear, Render, Resolve} requires end of pipe<br>
> > synchronization.<br>
> ><br>
> > Perform a flush of the write cache before and after the resolve<br>
> > operation to meet this requirement.<br>
> ><br>
><br>
> That PRM citation is fine and maybe better than the one I used (though they<br>
> say about the same thing). The part I was hoping to preserve is<br>
><br>
> * This comment is a bit cryptic and doesn't really tell you what's going<br>
> * or what's really needed. It appears that fast clear ops are not<br>
> * properly synchronized with other drawing. We need to use a<br>
> PIPE_CONTROL<br>
> * to ensure that the contents of the previous draw hit the render target<br>
> * before we resolve and then use a second PIPE_CONTROL after the resolve<br>
> * to ensure that it is completed before any additional drawing occurs.<br>
><br>
> Yes, the PRM text tells you that an end of pipe sync is needed, but it<br>
> doesn't tell you where. It's empirical evidence from hunting down bugs<br>
> that tells us we need it on both sides.<br>
><br>
<br>
</div></div>How does the PRM text not tell you where the end of pipe sync is needed?<br>
It states that we must perform the sync between any value in {Clear,<br>
Render, Resolve} to a different value in {Clear, Render, Resolve}. This<br>
means that we're left with two options:<br>
<br>
1. Track every clear, render, resolve operation and perform the sync<br>
in between as described (should be easy in GL).<br>
2. Track none of the operations and perform the sync around 2 of the<br>
three operations (only feasible option in Vulkan).<br>
</blockquote></div><br></div><div class="gmail_extra">Maybe it does contain all of the data. However, even reading the PRM bit in context, it's still seems cryptic. What does it mean by "different value"? Is it talking about surface values or values in the bits in 3DSTATE_PS? How about:<br><br><br>
From the Sky Lake PRM Vol. 7, "MCS Buffer for Render Target(s)":<br>
<br>
Any transition from any value in {Clear, Render, Resolve} to a<br>
different value in {Clear, Render, Resolve} requires end of pipe<br>
synchronization.<br><br></div><div class="gmail_extra"> In other words, fast clear ops are not properly synchronized with other<br></div><div class="gmail_extra"> drawing. We need to use a PIPE_CONTROL to ensure that the contents of<br></div><div class="gmail_extra"> the previous draw hit the render target before we resolve and again afterwards<br></div><div class="gmail_extra"> to ensure that the resolve is complete before we do any more regular drawing.<br><br></div><div class="gmail_extra">In general, I'd rather the comment describe what's going on in the hardware and why we need to do the flush rather than just "the PRM says so"<br><br></div><div class="gmail_extra">--Jason<br></div></div>