[Mesa-dev] [PATCH 4/6] i965/gen8: Correct HALIGN for AUX surfaces
Ben Widawsky
ben at bwidawsk.net
Thu Jun 4 15:15:50 PDT 2015
On Fri, May 29, 2015 at 05:21:14PM -0700, Chad Versace wrote:
> On Fri 29 May 2015, Chad Versace wrote:
> > On Thu 28 May 2015, Ben Widawsky wrote:
> > > This restriction was attempted in this commit:
> > > commit 47053464630888f819ef8cc44278f1a1220159b9
> > > Author: Anuj Phogat <anuj.phogat at gmail.com>
> > > Date: Fri Feb 13 11:21:21 2015 -0800
> > >
> > > i965/gen8: Use HALIGN_16 if MCS is enabled for non-MSRT
> > >
> > > However, the commit itself doesn't achieve the desired goal as determined by the
> > > asserts which the next patch adds. mcs_mt is never a value because we're in the
> > > process of allocating the mcs_mt miptree when we get to this function. I didn't
> > > check, but perhaps this would work with blorp, however, meta clears allocate the
> > > miptree structure (which AFAICT needs the alignment also) way before it
> > > allocates using meta clears where the renderbuffer is allocated way before the
> > > aux buffer.
> > >
> > > The restriction is referenced in a few places, but the most concise one [IMO]
> > > from the spec is for Gen9. Gen8 8 loosens the restriction in that it only
> > > requires this for non-msrt surface.
> > >
> > > When Auxiliary Surface Mode is set to AUX_CCS_D or AUX_CCS_E, HALIGN 16 must
> > > be used.
> > >
> > > With the code before the miptree layout flag rework (patches preceding this),
> > > accomplishing this workaround is very difficult.
> > >
> > > Cc: Anuj Phogat <anuj.phogat at gmail.com>
> > > Cc: Neil Roberts <neil at linux.intel.com>
> > > Signed-off-by: Ben Widawsky <ben at bwidawsk.net>
> > > ---
> > > src/mesa/drivers/dri/i965/brw_tex_layout.c | 16 ++++++++++------
> > > src/mesa/drivers/dri/i965/intel_mipmap_tree.c | 15 ++++++++++++---
> > > src/mesa/drivers/dri/i965/intel_mipmap_tree.h | 4 +++-
> > > 3 files changed, 25 insertions(+), 10 deletions(-)
> > >
> > > diff --git a/src/mesa/drivers/dri/i965/brw_tex_layout.c b/src/mesa/drivers/dri/i965/brw_tex_layout.c
> > > index 72b02a2..b51c7c7 100644
> > > --- a/src/mesa/drivers/dri/i965/brw_tex_layout.c
> > > +++ b/src/mesa/drivers/dri/i965/brw_tex_layout.c
> > > @@ -41,8 +41,13 @@
> > >
> > > static unsigned int
> > > intel_horizontal_texture_alignment_unit(struct brw_context *brw,
> > > - struct intel_mipmap_tree *mt)
> > > + struct intel_mipmap_tree *mt,
> > > + uint32_t layout_flags)
> > > {
> > > +
> > > + if (layout_flags & MIPTREE_LAYOUT_FORCE_HALIGN16)
> > > + return 16;
> > > +
> >
> > This snippet makes sense. If the force flag is set, then don't bother
> > inspecting any other state.
> >
> > > /**
> > > * From the "Alignment Unit Size" section of various specs, namely:
> > > * - Gen3 Spec: "Memory Data Formats" Volume, Section 1.20.1.4
> > > @@ -91,9 +96,6 @@ intel_horizontal_texture_alignment_unit(struct brw_context *brw,
> > > if (brw->gen >= 7 && mt->format == MESA_FORMAT_Z_UNORM16)
> > > return 8;
> > >
> > > - if (brw->gen == 8 && mt->mcs_mt && mt->num_samples <= 1)
> > > - return 16;
> > > -
> > > return 4;
> > > }
> > >
> > > @@ -459,7 +461,8 @@ brw_miptree_layout_texture_3d(struct brw_context *brw,
> > > }
> > >
> > > void
> > > -brw_miptree_layout(struct brw_context *brw, struct intel_mipmap_tree *mt)
> > > +brw_miptree_layout(struct brw_context *brw, struct intel_mipmap_tree *mt,
> > > + uint32_t layout_flags)
> > > {
> > > bool multisampled = mt->num_samples > 1;
> > > bool gen6_hiz_or_stencil = false;
> > > @@ -492,8 +495,9 @@ brw_miptree_layout(struct brw_context *brw, struct intel_mipmap_tree *mt)
> > > mt->align_w = 128 / mt->cpp;
> > > mt->align_h = 32;
> > > }
> > > + assert((layout_flags & MIPTREE_LAYOUT_FORCE_HALIGN16) == 0);
> >
> > This assertion makes sense.
> >
> > > } else {
> > > - mt->align_w = intel_horizontal_texture_alignment_unit(brw, mt);
> > > + mt->align_w = intel_horizontal_texture_alignment_unit(brw, mt, layout_flags);
> > > mt->align_h =
> > > intel_vertical_texture_alignment_unit(brw, mt->format, multisampled);
> > > }
> > > diff --git a/src/mesa/drivers/dri/i965/intel_mipmap_tree.c b/src/mesa/drivers/dri/i965/intel_mipmap_tree.c
> > > index 75ee19a..a1ac0cf 100644
> > > --- a/src/mesa/drivers/dri/i965/intel_mipmap_tree.c
> > > +++ b/src/mesa/drivers/dri/i965/intel_mipmap_tree.c
> > > @@ -475,7 +475,10 @@ intel_miptree_create_layout(struct brw_context *brw,
> > > if (layout_flags & MIPTREE_LAYOUT_FORCE_ALL_SLICE_AT_LOD)
> > > mt->array_layout = ALL_SLICES_AT_EACH_LOD;
> > >
> > > - brw_miptree_layout(brw, mt);
> > > + if (intel_miptree_is_fast_clear_capable(brw, mt))
> > > + layout_flags |= MIPTREE_LAYOUT_FORCE_HALIGN16;
> > > +
> > > + brw_miptree_layout(brw, mt, layout_flags);
> >
> > This does not make sense to me. I though HALIGN16 didn't exist before
> > Skylake, but here you're setting it on Ivybridge through Broadwell too.
> > Am I misunderstanding something?
>
> I did some homewwork. HALIGN16 was introduced in Broadwell and does not
> exist in Haswell. So this code still looks wrong to me.
Yeah, you are right. It's accidentally correct because the only buffers which
need HALIGN8 are depth/stencil, which aren't fast clear capable. I'll fix this.
(I was trying to figure out why I had no regressions with this bug).
More information about the mesa-dev
mailing list