[PATCH 1/2] drm/vc4: Correct lbm size and calculation

Dave Stevenson dave.stevenson at raspberrypi.com
Fri Jan 22 10:12:23 UTC 2021


Hi Maxime

On Thu, 21 Jan 2021 at 10:58, Maxime Ripard <maxime at cerno.tech> wrote:
>
> From: Dom Cobley <popcornmix at gmail.com>
>
> LBM base address is measured in units of pixels per cycle.
> That is 4 for 2711 (hvs5) and 2 for 2708.
>
> We are wasting 75% of lbm by indexing without the scaling.
> But we were also using too high a size for the lbm resulting
> in partial corruption (right hand side) of vertically
> scaled images, usually at 4K or lower resolutions with more layers.
>
> The physical RAM of LBM on 2711 is 8 * 1920 * 16 * 12-bit
> (pixels are stored 12-bits per component regardless of format).
>
> The LBM adress indexes work in units of pixels per clock,
> so for 4 pixels per clock that means we have 32 * 1920 = 60K
>
> Fixes: c54619b0bfb3 ("drm/vc4: Add support for the BCM2711 HVS5")
> Signed-off-by: Dom Cobley <popcornmix at gmail.com>
> Signed-off-by: Maxime Ripard <maxime at cerno.tech>

Reviewed-by: Dave Stevenson <dave.stevenson at raspberrypi.com>

> ---
>  drivers/gpu/drm/vc4/vc4_hvs.c   | 8 ++++----
>  drivers/gpu/drm/vc4/vc4_plane.c | 7 ++++++-
>  2 files changed, 10 insertions(+), 5 deletions(-)
>
> diff --git a/drivers/gpu/drm/vc4/vc4_hvs.c b/drivers/gpu/drm/vc4/vc4_hvs.c
> index 2b3a597fa65f..c239045e05d6 100644
> --- a/drivers/gpu/drm/vc4/vc4_hvs.c
> +++ b/drivers/gpu/drm/vc4/vc4_hvs.c
> @@ -622,11 +622,11 @@ static int vc4_hvs_bind(struct device *dev, struct device *master, void *data)
>          * for now we just allocate globally.
>          */
>         if (!hvs->hvs5)
> -               /* 96kB */
> -               drm_mm_init(&hvs->lbm_mm, 0, 96 * 1024);
> +               /* 48k words of 2x12-bit pixels */
> +               drm_mm_init(&hvs->lbm_mm, 0, 48 * 1024);
>         else
> -               /* 70k words */
> -               drm_mm_init(&hvs->lbm_mm, 0, 70 * 2 * 1024);
> +               /* 60k words of 4x12-bit pixels */
> +               drm_mm_init(&hvs->lbm_mm, 0, 60 * 1024);
>
>         /* Upload filter kernels.  We only have the one for now, so we
>          * keep it around for the lifetime of the driver.
> diff --git a/drivers/gpu/drm/vc4/vc4_plane.c b/drivers/gpu/drm/vc4/vc4_plane.c
> index 6bd8260aa9f2..b98eabb52920 100644
> --- a/drivers/gpu/drm/vc4/vc4_plane.c
> +++ b/drivers/gpu/drm/vc4/vc4_plane.c
> @@ -437,6 +437,7 @@ static void vc4_write_ppf(struct vc4_plane_state *vc4_state, u32 src, u32 dst)
>  static u32 vc4_lbm_size(struct drm_plane_state *state)
>  {
>         struct vc4_plane_state *vc4_state = to_vc4_plane_state(state);
> +       struct vc4_dev *vc4 = to_vc4_dev(state->plane->dev);
>         u32 pix_per_line;
>         u32 lbm;
>
> @@ -472,7 +473,11 @@ static u32 vc4_lbm_size(struct drm_plane_state *state)
>                 lbm = pix_per_line * 16;
>         }
>
> -       lbm = roundup(lbm, 32);
> +       /* Align it to 64 or 128 (hvs5) bytes */
> +       lbm = roundup(lbm, vc4->hvs->hvs5 ? 128 : 64);
> +
> +       /* Each "word" of the LBM memory contains 2 or 4 (hvs5) pixels */
> +       lbm /= vc4->hvs->hvs5 ? 4 : 2;
>
>         return lbm;
>  }
> --
> 2.29.2
>


More information about the dri-devel mailing list