[Mesa-dev] [PATCH v2 06/18] intel/compiler: fix brw_imm_w for negative 16-bit integers
Chema Casanova
jmcasanova at igalia.com
Wed May 2 15:19:09 UTC 2018
El 01/05/18 a las 01:22, Jason Ekstrand escribió:
> On Mon, Apr 30, 2018 at 3:53 PM, Chema Casanova <jmcasanova at igalia.com
> <mailto:jmcasanova at igalia.com>> wrote:
>
>
>
> On 30/04/18 23:12, Jason Ekstrand wrote:
> > On Mon, Apr 30, 2018 at 7:18 AM, Iago Toral Quiroga <itoral at igalia.com <mailto:itoral at igalia.com>
> > <mailto:itoral at igalia.com <mailto:itoral at igalia.com>>> wrote:
> >
> > From: Jose Maria Casanova Crespo <jmcasanova at igalia.com <mailto:jmcasanova at igalia.com>
> > <mailto:jmcasanova at igalia.com <mailto:jmcasanova at igalia.com>>>
> >
> > 16-bit immediates need to replicate the 16-bit immediate value
> > in both words of the 32-bit value. This needs to be careful
> > to avoid sign-extension, which the previous implementation was
> > not handling properly.
> >
> > For example, with the previous implementation, storing the value
> > -3 would generate imm.d = 0xfffffffd due to signed integer sign
> > extension, which is not correct. Instead, we should cast to
> > unsigned, which gives us the correct result: imm.ud = 0xfffdfffd.
> >
> > We only had a couple of cases hitting this path in the driver
> > until now, one with value -1, which would work since all bits are
> > one in this case, and another with value -2 in brw_clip_tri(),
> > which would hit the aforementioned issue (this case only affects
> > gen4 although we are not aware of whether this was causing an
> > actual bug somewhere).
> > ---
> > src/intel/compiler/brw_reg.h | 2 +-
> > 1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > diff --git a/src/intel/compiler/brw_reg.h
> b/src/intel/compiler/brw_reg.h
> > index dff9b970b2..0084a78af6 100644
> > --- a/src/intel/compiler/brw_reg.h
> > +++ b/src/intel/compiler/brw_reg.h
> > @@ -705,7 +705,7 @@ static inline struct brw_reg
> > brw_imm_w(int16_t w)
> > {
> > struct brw_reg imm = brw_imm_reg(BRW_REGISTER_TYPE_W);
> > - imm.d = w | (w << 16);
> > + imm.ud = (uint16_t)w | ((uint16_t)w << 16);
>
> > Uh... Is this cast right? Doing a << 16 on a 16-bit data type should
> > yield undefined results. I think you want a (uint32_t) cast.
>
> In my test code it was working at least with GCC, I think it is because
> at the end we have an integer promotion for any type with lower rank
> than int.
>
> "Formally, the rule says (C11 6.3.1.1):
>
> If an int can represent all values of the original type (as
> restricted by the width, for a bit-field), the value is converted to an
> int; otherwise, it is converted to an unsigned int. These are called the
> integer promotions."
>
> But I agree that is clearer if we just use (uint32_t).
> I can change also the brw_imm_uw case that has the same issue.
>
>
> Yeah, best to make it clear. :-)
I was wrong, we can't just replace (uint16_t) cast by (uint32_t) because
the cast from signed short to uint32_t implies sign extension, because
it seems that sign extensions is done if source is signed and not in
destination type.
So for example, being w = -2 (0xfffe).
imm.ud = (uint32_t)w | (uint32_t)w << 16;
becomes: 0xfffffffe
So the alternatives I figure out with the correct result are.
imm.ud = (uint32_t) w & 0xffff | (uint32_t)w << 16;
Or:
uint16_t value = w;
imm.ud = (uint32_t)value | (uint32_t)value << 16;
Or something like:
imm.ud = (uint32_t)(uint16_t)w | ((uint32_t)(uint16_t)w << 16);
Any preference?
Chema
More information about the mesa-dev
mailing list