[Mesa-dev] [PATCH v4 13/14] nvc0: expose ARB_compute_variable_group_size

Ilia Mirkin imirkin at alum.mit.edu
Wed Oct 5 18:57:19 UTC 2016


On Wed, Oct 5, 2016 at 2:48 PM, Samuel Pitoiset
<samuel.pitoiset at gmail.com> wrote:
> Only expose 512 threads/block on Fermi to not be limited by
> 32 GPRs/thread.
>
> v4: - use 512 threads on Fermi, 2014 on Kepler+

Dyslexics... untie!

>
> Signed-off-by: Samuel Pitoiset <samuel.pitoiset at gmail.com>
> ---
>  src/gallium/drivers/nouveau/nvc0/nvc0_screen.c | 8 ++++++--
>  1 file changed, 6 insertions(+), 2 deletions(-)
>
> diff --git a/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c b/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
> index df6c6af..afcb08b 100644
> --- a/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
> +++ b/src/gallium/drivers/nouveau/nvc0/nvc0_screen.c
> @@ -448,6 +448,12 @@ nvc0_screen_get_compute_param(struct pipe_screen *pscreen,
>        RET(((uint64_t []) { 1024, 1024, 64 }));
>     case PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK:
>        RET((uint64_t []) { 1024 });
> +   case PIPE_COMPUTE_CAP_MAX_VARIABLE_THREADS_PER_BLOCK:
> +      if (obj_class >= NVE4_COMPUTE_CLASS) {
> +         RET((uint64_t []) { 1024 });
> +      } else {
> +         RET((uint64_t []) { 512 });
> +      }
>     case PIPE_COMPUTE_CAP_MAX_GLOBAL_SIZE: /* g[] */
>        RET((uint64_t []) { 1ULL << 40 });
>     case PIPE_COMPUTE_CAP_MAX_LOCAL_SIZE: /* s[] */
> @@ -478,8 +484,6 @@ nvc0_screen_get_compute_param(struct pipe_screen *pscreen,
>        RET((uint32_t []) { 512 }); /* FIXME: arbitrary limit */
>     case PIPE_COMPUTE_CAP_ADDRESS_BITS:
>        RET((uint32_t []) { 64 });
> -   case PIPE_COMPUTE_CAP_MAX_VARIABLE_THREADS_PER_BLOCK:
> -      RET((uint64_t []) { 0 });
>     default:
>        return 0;
>     }
> --
> 2.10.0
>
> _______________________________________________
> mesa-dev mailing list
> mesa-dev at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/mesa-dev


More information about the mesa-dev mailing list