<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">On Thu, Oct 5, 2017 at 7:39 PM, Dave Airlie <span dir="ltr"><<a href="mailto:airlied@gmail.com" target="_blank">airlied@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On 6 October 2017 at 12:31, Marek Olšák <<a href="mailto:maraeo@gmail.com">maraeo@gmail.com</a>> wrote:<br>
> On Fri, Oct 6, 2017 at 4:10 AM, Connor Abbott <<a href="mailto:cwabbott0@gmail.com">cwabbott0@gmail.com</a>> wrote:<br>
>> On Thu, Oct 5, 2017 at 10:08 PM, Marek Olšák <<a href="mailto:maraeo@gmail.com">maraeo@gmail.com</a>> wrote:<br>
>>> On Fri, Oct 6, 2017 at 3:50 AM, Connor Abbott <<a href="mailto:cwabbott0@gmail.com">cwabbott0@gmail.com</a>> wrote:<br>
>>>> Why? While it might technically be legal, always generating an unfused<br>
>>>> mul+add when the user explicitly requested fma() seems harsh...<br>
>>><br>
>>> It's slow on some chips. It doesn't need any other reason.<br>
>>><br>
>>> Marek<br>
>><br>
>> Presumably, if the developer asked for fma, then they don't care how<br>
>> fast or slow it is...<br>
><br>
> Feral asked for fma. They care. This debate is pointless. We just<br>
> won't use fma by default. Period.<br>
<br>
</span>They didn't ask for it with precise precision. I'm assuming if someone wants<br>
fma with precise precision we should give it to them. Like at least<br>
the fma manpage states.<br></blockquote><div><br></div><div>Eh, fma() doesn't guarantee additional precision so anyone who's counting on that is in for some trouble. If someone uses fma() explicitly its because they either care about speed (in which case fma on radeon doesn't make sense from what Marek says) or because they want to explicitly control the order of operations. Giving them the slow thing just because the GPU has that instruction is *not* what they want.</div><div><br></div><div>--Jason<br></div></div></div></div>