[Mesa-dev] [PATCH] nir: add lowering for gl_HelperInvocation

Rob Clark robdclark at gmail.com
Mon Jun 25 18:38:46 UTC 2018


On Mon, Jun 25, 2018 at 2:29 PM, Ilia Mirkin <imirkin at alum.mit.edu> wrote:
> On Mon, Jun 25, 2018 at 2:23 PM, Ian Romanick <idr at freedesktop.org> wrote:
>> On 06/19/2018 08:24 AM, Rob Clark wrote:
>>> v2: reword comment about lower_helper_invocations to be more clear
>>>     that it might not work on all hardware
>>> v3: add special variant of load_sample_id which does not imply per-
>>>     sample shading
>>>
>>> Signed-off-by: Rob Clark <robdclark at gmail.com>
>>> ---
>>> Yeah, new intrinsic name is a bit funny sounding, but I couldn't
>>> think of anything better.
>>>
>>>  src/compiler/nir/nir.h                          | 12 ++++++++++++
>>>  src/compiler/nir/nir_intrinsics.py              |  3 +++
>>>  src/compiler/nir/nir_lower_system_values.c      | 17 +++++++++++++++++
>>>  .../drivers/freedreno/ir3/ir3_compiler_nir.c    |  1 +
>>>  src/gallium/drivers/freedreno/ir3/ir3_nir.c     |  1 +
>>>  5 files changed, 34 insertions(+)
>>>
>>> diff --git a/src/compiler/nir/nir.h b/src/compiler/nir/nir.h
>>> index 2d620454796..cb9a4af64cf 100644
>>> --- a/src/compiler/nir/nir.h
>>> +++ b/src/compiler/nir/nir.h
>>> @@ -1981,6 +1981,18 @@ typedef struct nir_shader_compiler_options {
>>>      */
>>>     bool lower_base_vertex;
>>>
>>> +   /**
>>> +    * If enabled, gl_HelperInvocation will be lowered as:
>>> +    *
>>> +    *   !((1 << sample_id) & sample_mask_in))
>>
>> I recently encountered something related to this.  Issue #22 of the
>> GL_ARB_shader_image_load_store spec suggests using (gl_SampleMaskIn[0]
>> == 0) to detect a helper invocation.  There's a note that
>> implementations that support > 32 samples will also need to check
>> gl_SampleMaskIn[1].

note that for adreno I just have up to 4x samples.. in fact, do any
mesa drivers support > 32 samples?

>  Did you try that?  It should generate less code.
>>
>> See also piglit commit c89439fcaba7259b25d090df43a6bf2aac6483d7.
>
> Based on what I've seen of adreno, that sysval is likely actually not
> gl_SampleMaskIn, but rather a full pixel coverage mask. This only
> matters with sample-rate shading which I don't think Rob has
> investigated yet.
>
> I think gl_SampleMaskIn[0] == 0 should work in the general case, but I
> suspect that freedreno will have to implement gl_SampleMaskIn as
> covmask & (1 << sampleid) for sample-rate shading.
>

I didn't try that but I think gl_SampleMaskIn[0] will work as long as
there isn't sample-rate shading.  Blob always does 'covmask & (1 <<
sampleid)'.  But I guess as long as there isn't any GL state that can
also trigger sample-rate shading (?), I could switch lowering based on
whether shader reads sample_id, and use the simpler
gl_SampleMaskIn[0]==0 in the non-sample-rate case...

BR,
-R

> Cheers,
>
>   -ilia


More information about the mesa-dev mailing list