[Mesa-dev] [PATCH 5/8] PowerPC: clear Altivec NJ bit
Roland Scheidegger
sroland at vmware.com
Thu Nov 22 13:33:07 PST 2012
Am 22.11.2012 21:34, schrieb Adhemerval Zanella:
> Mostly PowerPC system sets the Altivec NJ bit to 1 so denormal number
> are handled as 0. Initially it was a performance configuration, since
> denormal handling tended to be costly. However it is not the case on
> more recent PowerPC chips (POWER6 and onwards).
>
> This patch enforces the clear of NJ bit in VSCR Altivec register so
> denormal numbers are handles as expected by IEEE standards
> (more information on PowerISA 2.06 - Section 6.3). This make the
> half-float to float transformation and some rounding work correctly
> on an Altivec enabled machine.
>
> Any tips, advices, comments?
Looks good to me, though I think ultimately we should be able to avoid
denorms - even for x86 we probably want to switch on the DAZ and/or FTZ
flags (I guess there's also the question if that interferes with the
callers of the jited functions). Denorms ARE slow, and they typically
shouldn't be required for shaders (dx10 for instance even though it
mostly conforms to ieee754 seems to mandate they are flushed to zero).
Roland
More information about the mesa-dev
mailing list