[Mesa-dev] [PATCH 32/59] intel/compiler: fix 16-bit float ddx and ddy for SIMD8
Iago Toral
itoral at igalia.com
Tue Dec 11 11:06:11 UTC 2018
I used byte_offset() in the previous patch like you suggested and with
that we no longer need this one.
On Fri, 2018-12-07 at 13:09 -0600, Jason Ekstrand wrote:
> And here we are.... I think I'd still like byte_offset better but,
> either way patches 31 and 32 are
>
> Reviewed-by: Jason Ekstrand <jason at jlekstrand.net>
>
>
> On Tue, Dec 4, 2018 at 1:18 AM Iago Toral Quiroga <itoral at igalia.com>
> wrote:
> > In SIMD8 we pack 2 vector components in a single SIMD register, so
> >
> > for example, component Y of a 16-bit vec2 starts is at byte offset
> >
> > 16B. This means that when we compute the offset of the elements to
> >
> > be differentiated we should not stomp whatever base offset we have,
> >
> > but instead add to it.
> >
> > ---
> >
> > src/intel/compiler/brw_fs_generator.cpp | 6 +++---
> >
> > 1 file changed, 3 insertions(+), 3 deletions(-)
> >
> >
> >
> > diff --git a/src/intel/compiler/brw_fs_generator.cpp
> > b/src/intel/compiler/brw_fs_generator.cpp
> >
> > index bffd9bc4787..d8e4bae17e0 100644
> >
> > --- a/src/intel/compiler/brw_fs_generator.cpp
> >
> > +++ b/src/intel/compiler/brw_fs_generator.cpp
> >
> > @@ -1259,7 +1259,7 @@ fs_generator::generate_ddx(const fs_inst
> > *inst,
> >
> > struct brw_reg src0 = src;
> >
> > struct brw_reg src1 = src;
> >
> >
> >
> > - src0.subnr = type_sz(src.type);
> >
> > + src0.subnr += type_sz(src.type);
> >
> > src0.vstride = vstride;
> >
> > src0.width = width;
> >
> > src0.hstride = BRW_HORIZONTAL_STRIDE_0;
> >
> > @@ -1325,8 +1325,8 @@ fs_generator::generate_ddy(const fs_inst
> > *inst,
> >
> > /* replicate the derivative at the top-left pixel to other
> > pixels */
> >
> > struct brw_reg src0 = stride(src, 4, 4, 0);
> >
> > struct brw_reg src1 = stride(src, 4, 4, 0);
> >
> > - src0.subnr = 0 * type_size;
> >
> > - src1.subnr = 2 * type_size;
> >
> > + src0.subnr += 0 * type_size;
> >
> > + src1.subnr += 2 * type_size;
> >
> >
> >
> > brw_ADD(p, dst, negate(src0), src1);
> >
> > }
> >
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/mesa-dev/attachments/20181211/5af7e70c/attachment.html>
More information about the mesa-dev
mailing list