<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Jul 14, 2016 at 2:26 PM, Chad Versace <span dir="ltr"><<a href="mailto:chad.versace@intel.com" target="_blank">chad.versace@intel.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="HOEnZb"><div class="h5">On Wed 13 Jul 2016, Jason Ekstrand wrote:<br>
> Reviewed-by: Topi Pohjolainen <<a href="mailto:topi.pohjolainen@intel.com">topi.pohjolainen@intel.com</a>><br>
> ---<br>
> src/mesa/drivers/dri/i965/brw_blorp.c | 157 ++++++++++++++++++++++++++++++++++<br>
> src/mesa/drivers/dri/i965/brw_blorp.h | 6 ++<br>
> 2 files changed, 163 insertions(+)<br>
><br>
> diff --git a/src/mesa/drivers/dri/i965/brw_blorp.c b/src/mesa/drivers/dri/i965/brw_blorp.c<br>
> index 04c10b6..282a5b2 100644<br>
> --- a/src/mesa/drivers/dri/i965/brw_blorp.c<br>
> +++ b/src/mesa/drivers/dri/i965/brw_blorp.c<br>
> @@ -231,6 +231,163 @@ brw_blorp_compile_nir_shader(struct brw_context *brw, struct nir_shader *nir,<br>
> return program;<br>
> }<br>
><br>
> +static enum isl_msaa_layout<br>
> +get_isl_msaa_layout(enum intel_msaa_layout layout)<br>
> +{<br>
> + switch (layout) {<br>
> + case INTEL_MSAA_LAYOUT_NONE:<br>
> + return ISL_MSAA_LAYOUT_NONE;<br>
> + case INTEL_MSAA_LAYOUT_IMS:<br>
> + return ISL_MSAA_LAYOUT_INTERLEAVED;<br>
> + case INTEL_MSAA_LAYOUT_UMS:<br>
> + case INTEL_MSAA_LAYOUT_CMS:<br>
> + return ISL_MSAA_LAYOUT_ARRAY;<br>
> + default:<br>
> + unreachable("Invalid MSAA layout");<br>
> + }<br>
> +}<br>
<br>
</div></div>Lookup table? With nchery-style validation? Just a suggestion.<br>
<div><div class="h5"><br>
> +<br>
> +struct surface_state_info {<br>
> + unsigned num_dwords;<br>
> + unsigned ss_align; /* Required alignment of RENDER_SURFACE_STATE in bytes */<br>
> + unsigned reloc_dw;<br>
> + unsigned aux_reloc_dw;<br>
> + unsigned tex_mocs;<br>
> + unsigned rb_mocs;<br>
> +};<br>
> +<br>
> +static const struct surface_state_info surface_state_infos[] = {<br>
> + [6] = {6, 32, 1, 0},<br>
> + [7] = {8, 32, 1, 6, GEN7_MOCS_L3, GEN7_MOCS_L3},<br>
> + [8] = {13, 64, 8, 10, BDW_MOCS_WB, BDW_MOCS_PTE},<br>
> + [9] = {16, 64, 8, 10, SKL_MOCS_WB, SKL_MOCS_PTE},<br>
> +};<br>
> +<br>
> +uint32_t<br>
> +brw_blorp_emit_surface_state(struct brw_context *brw,<br>
> + const struct brw_blorp_surface_info *surface,<br>
> + uint32_t read_domains, uint32_t write_domain,<br>
> + bool is_render_target)<br>
> +{<br>
> + const struct surface_state_info ss_info = surface_state_infos[brw->gen];<br>
> +<br>
> + struct isl_surf surf;<br>
> + intel_miptree_get_isl_surf(brw, surface->mt, &surf);<br>
> +<br>
> + /* Stomp surface dimensions and tiling (if needed) with info from blorp */<br>
> + surf.dim = ISL_SURF_DIM_2D;<br>
> + surf.dim_layout = ISL_DIM_LAYOUT_GEN4_2D;<br>
> + surf.msaa_layout = get_isl_msaa_layout(surface->msaa_layout);<br>
> + surf.logical_level0_px.width = surface->width;<br>
> + surf.logical_level0_px.height = surface->height;<br>
> + surf.logical_level0_px.depth = 1;<br>
> + surf.logical_level0_px.array_len = 1;<br>
> + surf.levels = 1;<br>
> + surf.samples = MAX2(surface->num_samples, 1);<br>
<br>
</div></div>This scares me. I'm really really scared.<br>
/me closes eyes and continues<br>
<div><div class="h5"><br>
> +<br>
> + /* Alignment doesn't matter since we have 1 miplevel and 1 array slice so<br>
> + * just pick something that works for everybody.<br>
> + */<br>
> + surf.image_alignment_el = isl_extent3d(4, 4, 1);<br>
> +<br>
> + if (brw->gen == 6 && surface->num_samples > 1) {<br>
> + /* Since gen6 uses INTEL_MSAA_LAYOUT_IMS, width and height are measured<br>
> + * in samples. But SURFACE_STATE wants them in pixels, so we need to<br>
> + * divide them each by 2.<br>
> + */<br>
> + surf.logical_level0_px.width /= 2;<br>
> + surf.logical_level0_px.height /= 2;<br>
> + }<br>
> +<br>
> + if (brw->gen == 6 && surf.image_alignment_el.height > 4) {<br>
> + /* This can happen on stencil buffers on Sandy Bridge due to the<br>
> + * single-LOD work-around. It's fairly harmless as long as we don't<br>
> + * pass a bogus value into isl_surf_fill_state().<br>
> + */<br>
> + surf.image_alignment_el = isl_extent3d(4, 2, 1);<br>
> + }<br>
> +<br>
> + /* We need to fake W-tiling with Y-tiling */<br>
> + if (surface->map_stencil_as_y_tiled)<br>
> + surf.tiling = ISL_TILING_Y0;<br>
> +<br>
> + union isl_color_value clear_color = { .u32 = { 0, 0, 0, 0 } };<br>
> +<br>
> + struct isl_surf *aux_surf = NULL, aux_surf_s;<br>
> + uint64_t aux_offset = 0;<br>
> + enum isl_aux_usage aux_usage = ISL_AUX_USAGE_NONE;<br>
> + if (surface->mt->mcs_mt) {<br>
> + /* We should probably to similar stomping to above but most of the aux<br>
<br>
</div></div>Correction:<br>
We should probably do similar stomping as above but most of the aux<br>
or<br>
We should probably do stomping similar to the above but most of the aux<br>
<span class=""><br>
> + * surf gets ignored when we fill out the surface state anyway so<br>
> + * there's no point.<br>
> + */<br>
<br>
</span>I'm scared again. Lovecraft wrote that comment.<br>
/me shuts eyes tighter<br></blockquote><div><br></div><div>Actually, we should never stomp the aux surface. The comment is bogus. I believe it also goes away eventually so meh.<br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<span class=""><br>
> + intel_miptree_get_aux_isl_surf(brw, surface->mt, &aux_surf_s, &aux_usage);<br>
> + aux_surf = &aux_surf_s;<br>
> + assert(surface->mt->mcs_mt->offset == 0);<br>
</span>> + aux_offset = surface->mt->mcs_mt->bo->offset64/<br>
<span class="">> +<br>
> + /* We only really need a clear color if we also have an auxiliary<br>
> + * surface. Without one, it does nothing.<br>
> + */<br>
> + clear_color = intel_miptree_get_isl_clear_color(brw, surface->mt);<br>
> + }<br>
> +<br>
> + struct isl_view view = {<br>
> + .format = surface->brw_surfaceformat,<br>
> + .base_level = 0,<br>
> + .levels = 1,<br>
> + .base_array_layer = 0,<br>
> + .array_len = 1,<br>
> + .channel_select = {<br>
> + ISL_CHANNEL_SELECT_RED,<br>
> + ISL_CHANNEL_SELECT_GREEN,<br>
> + ISL_CHANNEL_SELECT_BLUE,<br>
> + ISL_CHANNEL_SELECT_ALPHA,<br>
> + },<br>
> + .usage = is_render_target ? ISL_SURF_USAGE_RENDER_TARGET_BIT :<br>
> + ISL_SURF_USAGE_TEXTURE_BIT,<br>
> + };<br>
<br>
</span>Are you confident that blorp never requires ISL_SURF_USAGE_CUBE_MAP_BIT?<br></blockquote><div><br></div><div>Yes, very confident. That's only required if you're going to use a cube sampler. Blorp has no need for cube samplers.<br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<span class=""><br>
> +<br>
> + uint32_t offset, tile_x, tile_y;<br>
> + offset = brw_blorp_compute_tile_offsets(surface, &tile_x, &tile_y);<br>
> +<br>
> + uint32_t surf_offset;<br>
> + uint32_t *dw = brw_state_batch(brw, AUB_TRACE_SURFACE_STATE,<br>
> + ss_info.num_dwords * 4, ss_info.ss_align,<br>
> + &surf_offset);<br>
> +<br>
> + const uint32_t mocs = is_render_target ? ss_info.rb_mocs : ss_info.tex_mocs;<br>
> +<br>
> + isl_surf_fill_state(&brw->isl_dev, dw, .surf = &surf, .view = &view,<br>
> + .address = surface->mt->bo->offset64 + offset,<br>
> + .aux_surf = aux_surf, .aux_usage = aux_usage,<br>
> + .aux_address = aux_offset,<br>
> + .mocs = mocs, .clear_color = clear_color,<br>
> + .x_offset_sa = tile_x, .y_offset_sa = tile_y);<br>
<br>
</span>Whoa! Where did x_offset_sa and y_offset_sa come from? It's not on<br>
master, and I didn't see it in any earlier patches.<br></blockquote><div><br></div><div>patch 8<br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
And now I skip ahead to patch 21, as you recommended.<br>
<div class="HOEnZb"><div class="h5"><br>
> +<br>
> + /* Emit relocation to surface contents */<br>
> + drm_intel_bo_emit_reloc(brw-><a href="http://batch.bo" rel="noreferrer" target="_blank">batch.bo</a>,<br>
> + surf_offset + ss_info.reloc_dw * 4,<br>
> + surface->mt->bo,<br>
> + dw[ss_info.reloc_dw] - surface->mt->bo->offset64,<br>
> + read_domains, write_domain);<br>
> +<br>
> + if (aux_surf) {<br>
> + /* On gen7 and prior, the bottom 12 bits of the MCS base address are<br>
> + * used to store other information. This should be ok, however, because<br>
> + * surface buffer addresses are always 4K page alinged.<br>
> + */<br>
> + assert((aux_offset & 0xfff) == 0);<br>
> + drm_intel_bo_emit_reloc(brw-><a href="http://batch.bo" rel="noreferrer" target="_blank">batch.bo</a>,<br>
> + surf_offset + ss_info.aux_reloc_dw * 4,<br>
> + surface->mt->mcs_mt->bo,<br>
> + dw[ss_info.aux_reloc_dw] & 0xfff,<br>
> + read_domains, write_domain);<br>
> + }<br>
> +<br>
> + return surf_offset;<br>
> +}<br>
> +<br>
> /**<br>
> * Perform a HiZ or depth resolve operation.<br>
> *<br>
> diff --git a/src/mesa/drivers/dri/i965/brw_blorp.h b/src/mesa/drivers/dri/i965/brw_blorp.h<br>
> index c8f6219..beef90e 100644<br>
> --- a/src/mesa/drivers/dri/i965/brw_blorp.h<br>
> +++ b/src/mesa/drivers/dri/i965/brw_blorp.h<br>
> @@ -400,6 +400,12 @@ brw_blorp_compile_nir_shader(struct brw_context *brw, struct nir_shader *nir,<br>
> struct brw_blorp_prog_data *prog_data,<br>
> unsigned *program_size);<br>
><br>
> +uint32_t<br>
> +brw_blorp_emit_surface_state(struct brw_context *brw,<br>
> + const struct brw_blorp_surface_info *surface,<br>
> + uint32_t read_domains, uint32_t write_domain,<br>
> + bool is_render_target);<br>
> +<br>
> void<br>
> gen6_blorp_init(struct brw_context *brw);<br>
</div></div></blockquote></div><br></div></div>