[Mesa-dev] [PATCH] radeonsi/compute: Fix LDS size calculation

Michel Dänzer michel at daenzer.net
Thu Aug 22 08:42:34 PDT 2013


On Don, 2013-08-22 at 11:25 -0400, Tom Stellard wrote:
> From: Tom Stellard <thomas.stellard at amd.com>
> 
> We need to include the number of LDS bytes allocated by the state tracker.
> ---
>  src/gallium/drivers/radeonsi/radeonsi_compute.c | 8 +++++++-
>  1 file changed, 7 insertions(+), 1 deletion(-)
> 
> diff --git a/src/gallium/drivers/radeonsi/radeonsi_compute.c b/src/gallium/drivers/radeonsi/radeonsi_compute.c
> index 10309ba..1adb32f 100644
> --- a/src/gallium/drivers/radeonsi/radeonsi_compute.c
> +++ b/src/gallium/drivers/radeonsi/radeonsi_compute.c
> @@ -191,7 +191,13 @@ static void radeonsi_launch_grid(
>  		| S_00B84C_TGID_Z_EN(1)
>  		| S_00B84C_TG_SIZE_EN(1)
>  		| S_00B84C_TIDIG_COMP_CNT(2)
> -		| S_00B84C_LDS_SIZE(shader->lds_size)
> +		/* XXX: We are over allocating LDS.  The shader reports LDS in
> +		 * blocks of 256 bytes, so if there are 4 bytes lds allocated in
> +		 * the shader and 4 bytes allocated by the state tracker, then
> +		 * we will set LDS_SIZE to 512 bytes rather than 256.
> +		 */
> +		| S_00B84C_LDS_SIZE(shader->lds_size +
> +				(align(program->local_size, 256) >> 8) + 1)

What's the + 1 at the end for?


-- 
Earthling Michel Dänzer           |                   http://www.amd.com
Libre software enthusiast         |          Debian, X and DRI developer



More information about the mesa-dev mailing list