<p dir="ltr">On Mar 25, 2016 4:43 AM, <<a href="mailto:eocallaghan@alterapraxis.com">eocallaghan@alterapraxis.com</a>> wrote:<br>
><br>
> On 2016-03-25 14:02, Ilia Mirkin wrote:<br>
>><br>
>> On Thu, Mar 24, 2016 at 8:11 PM, Edward O'Callaghan<br>
>> <<a href="mailto:eocallaghan@alterapraxis.com">eocallaghan@alterapraxis.com</a>> wrote:<br>
>>><br>
>>> Using PIPE_FORMAT_NONE to indicate what MSAA modes are supported<br>
>>> with a framebuffer using no attachment.<br>
>>><br>
>>> Signed-off-by: Edward O'Callaghan <<a href="mailto:eocallaghan@alterapraxis.com">eocallaghan@alterapraxis.com</a>><br>
>>> ---<br>
>>> src/mesa/state_tracker/st_atom_framebuffer.c | 51 ++++++++++++++++++++++++++++<br>
>>> 1 file changed, 51 insertions(+)<br>
>>><br>
>>> diff --git a/src/mesa/state_tracker/st_atom_framebuffer.c b/src/mesa/state_tracker/st_atom_framebuffer.c<br>
>>> index ae883a2..07854ca 100644<br>
>>> --- a/src/mesa/state_tracker/st_atom_framebuffer.c<br>
>>> +++ b/src/mesa/state_tracker/st_atom_framebuffer.c<br>
>>> @@ -64,6 +64,44 @@ update_framebuffer_size(struct pipe_framebuffer_state *framebuffer,<br>
>>> framebuffer->height = MIN2(framebuffer->height, surface->height);<br>
>>> }<br>
>>><br>
>>> +/**<br>
>>> + * Round up the requested multisample count to the next supported sample size.<br>
>>> + */<br>
>>> +static unsigned<br>
>>> +framebuffer_quantize_num_samples(struct pipe_screen *screen, unsigned num_samples)<br>
>>> +{<br>
>>> + int quantized_samples = 0;<br>
>>> + bool msaa_mode_supported;<br>
>>> +<br>
>>> + if (!num_samples)<br>
>>> + return 0;<br>
>>> +<br>
>>> + assert(screen);<br>
>>> +<br>
>>> + /* Assumes the highest supported MSAA is x32 on any hardware */<br>
>>> + for (unsigned msaa_mode = 32; msaa_mode >= 1; msaa_mode = msaa_mode/2) {<br>
>><br>
>><br>
>> This should probably start at MaxFramebufferSamples right? Also<br>
>> msaa_mode >= num_samples? [then you can get rid of the if below]<br>
><br>
><br>
> I did it in this manner because I don't trust all C compilers to warn sufficiently<br>
> on `num_samples' overflows turning this into a infinite loop even though it is a<br>
> unsigned type. The micro-optimization serves no purpose because the optimizer will<br>
> trivially reduce the loop down, not that it has that many iterations any way. The<br>
> loop as-is is both well bounded and deterministic, nice qualities to have.<br>
><br>
> "Premature optimization is the root of all evil" ~ Donald Knuth's.<br>
><br>
></p>
<p dir="ltr">I was going for clarity and simplicity, not runtime efficiency. Fewer lines of code to read, fewer conditions. For loop semantics are fairly well defined, compilers tend to get those things right.</p>
<p dir="ltr">>><br>
>> And lastly I don't know if it's a valid assumption that we can always<br>
>> just divide by 2. That said, I don't know of any hw that actually<br>
>> supports non-power-of-two MSAA levels, so perhaps it's OK.<br>
><br>
><br>
> You are way over engineering here; it is a totally reasonable assumption and if such<br>
> hardware does exist which we would support (I can`t see any in the tree currently as<br>
> far as I am aware) then they can provide follow up fixes.</p>
<p dir="ltr">I'm flexible on this one... If no one else cares, I don't care either.</p>
<p dir="ltr">><br>
><br>
>><br>
>>> + assert(!(msaa_mode > 32 || msaa_mode == 0)); /* be safe from int overflows */<br>
>>> + if (msaa_mode >= num_samples) {<br>
>>> + /**<br>
>>> + * For ARB_framebuffer_no_attachment, A format of<br>
>>> + * PIPE_FORMAT_NONE implies what number of samples is<br>
>>> + * supported for a framebuffer with no attachment. Thus the<br>
>>> + * drivers callback must be adjusted for this.<br>
>>> + */<br>
>>> + msaa_mode_supported = screen->is_format_supported(screen,<br>
>>> + PIPE_FORMAT_NONE, PIPE_TEXTURE_2D,<br>
>>> + msaa_mode, PIPE_BIND_RENDER_TARGET);<br>
>>> + /**<br>
>>> + * Check if the MSAA mode that is higher than the requested<br>
>>> + * num_samples is supported, and if so returning it.<br>
>>> + */<br>
>>> + if (msaa_mode_supported)<br>
>>> + quantized_samples = msaa_mode;<br>
>>> + }<br>
>>> + }<br>
>>> +<br>
>>> + return quantized_samples;<br>
>>> +}<br>
>>><br>
>>> /**<br>
>>> * Update framebuffer state (color, depth, stencil, etc. buffers)<br>
>>> @@ -72,6 +110,8 @@ static void<br>
>>> update_framebuffer_state( struct st_context *st )<br>
>>> {<br>
>>> struct pipe_framebuffer_state *framebuffer = &st->state.framebuffer;<br>
>>> + struct pipe_context *pipe = st->pipe;<br>
>>> + struct pipe_screen *screen = pipe->screen;<br>
>>> struct gl_framebuffer *fb = st->ctx->DrawBuffer;<br>
>>> struct st_renderbuffer *strb;<br>
>>> GLuint i;<br>
>>> @@ -82,6 +122,17 @@ update_framebuffer_state( struct st_context *st )<br>
>>> framebuffer->width = UINT_MAX;<br>
>>> framebuffer->height = UINT_MAX;<br>
>>><br>
>>> + /**<br>
>>> + * Quantize the derived default number of samples:<br>
>>> + *<br>
>>> + * A query to the driver of supported MSAA values the<br>
>>> + * hardware supports is done as to legalize the number<br>
>>> + * of application requested samples, NumSamples.<br>
>>> + * See commit eb9cf3c for more information.<br>
>>> + */<br>
>>> + fb->DefaultGeometry._NumSamples =<br>
>>> + framebuffer_quantize_num_samples(screen, fb->DefaultGeometry.NumSamples);<br>
>>> +<br>
>>> /*printf("------ fb size %d x %d\n", fb->Width, fb->Height);*/<br>
>>><br>
>>> /* Examine Mesa's ctx->DrawBuffer->_ColorDrawBuffers state<br>
>>> --<br>
>>> 2.5.5<br>
>>><br>
>>> _______________________________________________<br>
>>> mesa-dev mailing list<br>
>>> <a href="mailto:mesa-dev@lists.freedesktop.org">mesa-dev@lists.freedesktop.org</a><br>
>>> <a href="https://lists.freedesktop.org/mailman/listinfo/mesa-dev">https://lists.freedesktop.org/mailman/listinfo/mesa-dev</a><br>
><br>
><br>
</p>