<div dir="ltr"><div>Looking through your code, I believe it is correct. I'm a bit concerned, however, because we're now copying back to the main surface on every draw call. If an app ever happens to hit this path, it will be correct, but it's performance will tank. Unfortunately, a better implementation would be a pain because we would have to track which slice we pulled out and do something clever when someone goes to texture from it. The resolve framework could handle this but it would be painful. Ken? Thoughts?<br><br></div>Reviewed-by: Jason Ekstrand <<a href="mailto:jason@jlekstrand.net">jason@jlekstrand.net</a>><br></div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Jun 9, 2017 at 7:04 AM, Topi Pohjolainen <span dir="ltr"><<a href="mailto:topi.pohjolainen@gmail.com" target="_blank">topi.pohjolainen@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On gen < 6 one doesn't have level or layer specifiers available<br>
for render and depth targets. In order to support rendering to<br>
specific level/layer, driver needs to manually offset the surface<br>
to the desired slice.<br>
There are, however, alignment restrictions to respect as well and<br>
in come cases the only option is to use temporary single slice<br>
surface which driver copies after rendering to the full miptree.<br>
<br>
Current alignment workaround introduces new texture images which<br>
are added to the parent texture object. Texture validation later<br>
on copies the additional levels back to the surface that contains<br>
the full mipmap.<br>
This only works for non-arrayed surfaces and driver currently<br>
creates new arrayed images in vain - individual layers within the<br>
newly created are still unaligned the same as before.<br>
<br>
This patch drops this mechanism and instead attaches single<br>
temporary slice into the render buffer. This gets immediately<br>
copied back to the mipmapped and/or arrayed surface just after<br>
the render is done.<br>
<br>
Sitting on top of earlier series cleaning up the depth buffer<br>
state, this patch additionally fixes the following piglit tests:<br>
<br>
ext_texture_array.copyteximage 2d_array.g45m64<br>
ext_texture_array.copyteximage 1d_array.g45m64<br>
arb_framebuffer_object.fbo-<wbr>blit-stretch.g33m64<br>
ext_framebuffer_object.fbo-<wbr>cubemap.g965m64<br>
arb_framebuffer_object.fbo-<wbr>generatemipmap-cubemap.g965m64<br>
arb_texture_cube_map.<wbr>copyteximage cube.g965m64<br>
ext_texture_array.copyteximage 1d_array.g965m64<br>
ext_texture_array.copyteximage 2d_array.g965m64<br>
ext_texture_array.fbo-array.<wbr>g965m64<br>
ext_texture_array.gen-mipmap.<wbr>g965m64<br>
ext_texture_array.fbo-<wbr>generatemipmap-array.g965m64<br>
arb_pixel_buffer_object.<wbr>texsubimage array pbo.g965m64<br>
ext_texture_array.copyteximage 2d_array.ilkm64<br>
ext_texture_array.copyteximage 1d_array.ilkm64<br>
arb_texture_cube_map.<wbr>copyteximage cube.ilkm64<br>
<br>
CC: Kenneth Graunke <<a href="mailto:kenneth@whitecape.org">kenneth@whitecape.org</a>><br>
CC: Jason Ekstrand <<a href="mailto:jason@jlekstrand.net">jason@jlekstrand.net</a>><br>
CC: Ian Romanick <<a href="mailto:ian.d.romanick@intel.com">ian.d.romanick@intel.com</a>><br>
Signed-off-by: Topi Pohjolainen <<a href="mailto:topi.pohjolainen@intel.com">topi.pohjolainen@intel.com</a>><br>
---<br>
src/mesa/drivers/dri/i965/brw_<wbr>draw.c | 51 ++++++++++++++++++++++++<br>
src/mesa/drivers/dri/i965/brw_<wbr>misc_state.c | 4 +-<br>
src/mesa/drivers/dri/i965/brw_<wbr>wm_surface_state.c | 3 +-<br>
src/mesa/drivers/dri/i965/<wbr>intel_fbo.c | 19 +++++----<br>
src/mesa/drivers/dri/i965/<wbr>intel_fbo.h | 24 +++++++++++<br>
src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.c | 2 +-<br>
src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.h | 7 ++++<br>
7 files changed, 99 insertions(+), 11 deletions(-)<br>
<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>brw_draw.c b/src/mesa/drivers/dri/i965/<wbr>brw_draw.c<br>
index 611cb86..cb441c3 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>brw_draw.c<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>brw_draw.c<br>
@@ -396,6 +396,56 @@ brw_postdraw_set_buffers_need_<wbr>resolve(struct brw_context *brw)<br>
}<br>
<br>
static void<br>
+intel_renderbuffer_move_temp_<wbr>back(struct brw_context *brw,<br>
+ struct intel_renderbuffer *irb)<br>
+{<br>
+ if (irb->align_wa_mt == NULL)<br>
+ return;<br>
+<br>
+ brw_render_cache_set_check_<wbr>flush(brw, irb->align_wa_mt->bo);<br>
+<br>
+ intel_miptree_copy_slice(brw, irb->align_wa_mt, 0, 0,<br>
+ irb->mt,<br>
+ irb->Base.Base.TexImage-><wbr>Level, irb->mt_layer);<br>
+<br>
+ intel_miptree_reference(&irb-><wbr>align_wa_mt, NULL);<br>
+<br>
+ /* Finally restore the x,y to correspond to full miptree. */<br>
+ intel_renderbuffer_set_draw_<wbr>offset(irb);<br>
+<br>
+ /* Make sure render surface state gets re-emitted with updated miptree. */<br>
+ brw->NewGLState |= _NEW_BUFFERS;<br>
+}<br>
+<br>
+static void<br>
+brw_postdraw_reconcile_align_<wbr>wa_slices(struct brw_context *brw)<br>
+{<br>
+ struct gl_context *ctx = &brw->ctx;<br>
+ struct gl_framebuffer *fb = ctx->DrawBuffer;<br>
+<br>
+ struct intel_renderbuffer *depth_irb =<br>
+ intel_get_renderbuffer(fb, BUFFER_DEPTH);<br>
+ struct intel_renderbuffer *stencil_irb =<br>
+ intel_get_renderbuffer(fb, BUFFER_STENCIL);<br>
+<br>
+ if (depth_irb && depth_irb->align_wa_mt)<br>
+ intel_renderbuffer_move_temp_<wbr>back(brw, depth_irb);<br>
+<br>
+ if (stencil_irb && stencil_irb->align_wa_mt)<br>
+ intel_renderbuffer_move_temp_<wbr>back(brw, stencil_irb);<br>
+<br>
+ for (unsigned i = 0; i < fb->_NumColorDrawBuffers; i++) {<br>
+ struct intel_renderbuffer *irb =<br>
+ intel_renderbuffer(fb->_<wbr>ColorDrawBuffers[i]);<br>
+<br>
+ if (!irb || irb->align_wa_mt == NULL)<br>
+ continue;<br>
+<br>
+ intel_renderbuffer_move_temp_<wbr>back(brw, irb);<br>
+ }<br>
+}<br>
+<br>
+static void<br>
brw_predraw_set_aux_buffers(<wbr>struct brw_context *brw)<br>
{<br>
if (brw->gen < 9)<br>
@@ -626,6 +676,7 @@ retry:<br>
intel_batchbuffer_flush(brw);<br>
<br>
brw_program_cache_check_size(<wbr>brw);<br>
+ brw_postdraw_reconcile_align_<wbr>wa_slices(brw);<br>
brw_postdraw_set_buffers_need_<wbr>resolve(brw);<br>
<br>
return;<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>brw_misc_state.c b/src/mesa/drivers/dri/i965/<wbr>brw_misc_state.c<br>
index fe021b0..0c25261 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>brw_misc_state.c<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>brw_misc_state.c<br>
@@ -132,7 +132,7 @@ get_stencil_miptree(struct intel_renderbuffer *irb)<br>
return NULL;<br>
if (irb->mt->stencil_mt)<br>
return irb->mt->stencil_mt;<br>
- return irb->mt;<br>
+ return intel_renderbuffer_get_mt(irb)<wbr>;<br>
}<br>
<br>
static bool<br>
@@ -268,7 +268,7 @@ brw_emit_depthbuffer(struct brw_context *brw)<br>
/* _NEW_BUFFERS */<br>
struct intel_renderbuffer *depth_irb = intel_get_renderbuffer(fb, BUFFER_DEPTH);<br>
struct intel_renderbuffer *stencil_irb = intel_get_renderbuffer(fb, BUFFER_STENCIL);<br>
- struct intel_mipmap_tree *depth_mt = depth_irb ? depth_irb->mt : NULL;<br>
+ struct intel_mipmap_tree *depth_mt = intel_renderbuffer_get_mt(<wbr>depth_irb);<br>
struct intel_mipmap_tree *stencil_mt = get_stencil_miptree(stencil_<wbr>irb);<br>
uint32_t tile_x = brw->depthstencil.tile_x;<br>
uint32_t tile_y = brw->depthstencil.tile_y;<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>brw_wm_surface_state.c b/src/mesa/drivers/dri/i965/<wbr>brw_wm_surface_state.c<br>
index 3278ef3..8f818ab 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>brw_wm_surface_state.c<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>brw_wm_surface_state.c<br>
@@ -1008,7 +1008,8 @@ gen4_update_renderbuffer_<wbr>surface(struct brw_context *brw,<br>
* miptree and render into that.<br>
*/<br>
intel_renderbuffer_move_to_<wbr>temp(brw, irb, false);<br>
- mt = irb->mt;<br>
+ assert(irb->align_wa_mt);<br>
+ mt = irb->align_wa_mt;<br>
}<br>
}<br>
<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>intel_fbo.c b/src/mesa/drivers/dri/i965/<wbr>intel_fbo.c<br>
index a24ddd0..efbaa82 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>intel_fbo.c<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>intel_fbo.c<br>
@@ -993,11 +993,11 @@ intel_renderbuffer_move_to_<wbr>temp(struct brw_context *brw,<br>
<br>
intel_get_image_dims(rb-><wbr>TexImage, &width, &height, &depth);<br>
<br>
- new_mt = intel_miptree_create(brw, rb->TexImage->TexObject-><wbr>Target,<br>
+ assert(irb->align_wa_mt == NULL);<br>
+ new_mt = intel_miptree_create(brw, GL_TEXTURE_2D,<br>
intel_image->base.Base.<wbr>TexFormat,<br>
- intel_image->base.Base.Level,<br>
- intel_image->base.Base.Level,<br>
- width, height, depth,<br>
+ 0, 0,<br>
+ width, height, 1,<br>
irb->mt->num_samples,<br>
layout_flags);<br>
<br>
@@ -1005,11 +1005,16 @@ intel_renderbuffer_move_to_<wbr>temp(struct brw_context *brw,<br>
intel_miptree_alloc_hiz(brw, new_mt);<br>
}<br>
<br>
- intel_miptree_copy_teximage(<wbr>brw, intel_image, new_mt, invalidate);<br>
+ if (!invalidate)<br>
+ intel_miptree_copy_slice(brw, intel_image->mt,<br>
+ intel_image->base.Base.Level, irb->mt_layer,<br>
+ new_mt, 0, 0);<br>
<br>
- intel_miptree_reference(&irb-><wbr>mt, intel_image->mt);<br>
- intel_renderbuffer_set_draw_<wbr>offset(irb);<br>
+ intel_miptree_reference(&irb-><wbr>align_wa_mt, new_mt);<br>
intel_miptree_release(&new_mt)<wbr>;<br>
+<br>
+ irb->draw_x = 0;<br>
+ irb->draw_y = 0;<br>
}<br>
<br>
void<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>intel_fbo.h b/src/mesa/drivers/dri/i965/<wbr>intel_fbo.h<br>
index 2d2ef1e..3c2be82 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>intel_fbo.h<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>intel_fbo.h<br>
@@ -67,6 +67,16 @@ struct intel_renderbuffer<br>
*/<br>
struct intel_mipmap_tree *singlesample_mt;<br>
<br>
+ /* Gen < 6 doesn't have layer specifier for render targets or depth. Driver<br>
+ * needs to manually offset surfaces to correct level/layer. There are,<br>
+ * however, alignment restrictions to respect as well and in come cases<br>
+ * the only option is to use temporary single slice surface which driver<br>
+ * copies after rendering to the full miptree.<br>
+ *<br>
+ * See intel_renderbuffer_move_to_<wbr>temp().<br>
+ */<br>
+ struct intel_mipmap_tree *align_wa_mt;<br>
+<br>
/**<br>
* \name Miptree view<br>
* \{<br>
@@ -136,6 +146,14 @@ intel_renderbuffer(struct gl_renderbuffer *rb)<br>
return NULL;<br>
}<br>
<br>
+static inline struct intel_mipmap_tree *<br>
+intel_renderbuffer_get_mt(<wbr>struct intel_renderbuffer *irb)<br>
+{<br>
+ if (!irb)<br>
+ return NULL;<br>
+<br>
+ return (irb->align_wa_mt) ? irb->align_wa_mt : irb->mt;<br>
+}<br>
<br>
/**<br>
* \brief Return the framebuffer attachment specified by attIndex.<br>
@@ -188,6 +206,12 @@ intel_renderbuffer_get_tile_<wbr>offsets(struct intel_renderbuffer *irb,<br>
uint32_t *tile_x,<br>
uint32_t *tile_y)<br>
{<br>
+ if (irb->align_wa_mt) {<br>
+ *tile_x = 0;<br>
+ *tile_y = 0;<br>
+ return 0;<br>
+ }<br>
+<br>
return intel_miptree_get_tile_<wbr>offsets(irb->mt, irb->mt_level, irb->mt_layer,<br>
tile_x, tile_y);<br>
}<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.c b/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.c<br>
index a4b2aeb..01b68cd 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.c<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.c<br>
@@ -1266,7 +1266,7 @@ intel_miptree_copy_slice_sw(<wbr>struct brw_context *brw,<br>
}<br>
}<br>
<br>
-static void<br>
+void<br>
intel_miptree_copy_slice(<wbr>struct brw_context *brw,<br>
struct intel_mipmap_tree *src_mt,<br>
unsigned src_level, unsigned src_layer,<br>
diff --git a/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.h b/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.h<br>
index 41d3d25..7979f3b 100644<br>
--- a/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.h<br>
+++ b/src/mesa/drivers/dri/i965/<wbr>intel_mipmap_tree.h<br>
@@ -853,6 +853,13 @@ void intel_miptree_set_image_<wbr>offset(struct intel_mipmap_tree *mt,<br>
GLuint img, GLuint x, GLuint y);<br>
<br>
void<br>
+intel_miptree_copy_slice(<wbr>struct brw_context *brw,<br>
+ struct intel_mipmap_tree *src_mt,<br>
+ unsigned src_level, unsigned src_layer,<br>
+ struct intel_mipmap_tree *dst_mt,<br>
+ unsigned dst_level, unsigned dst_layer);<br>
+<br>
+void<br>
intel_miptree_copy_teximage(<wbr>struct brw_context *brw,<br>
struct intel_texture_image *intelImage,<br>
struct intel_mipmap_tree *dst_mt, bool invalidate);<br>
<span class="HOEnZb"><font color="#888888">--<br>
2.9.3<br>
<br>
</font></span></blockquote></div><br></div>