[PATCH v4 08/13] drm/msm/dpu: add support for virtual planes

Fri Jun 7 21:39:29 UTC 2024

On 6/7/2024 2:10 PM, Dmitry Baryshkov wrote:
> On Fri, Jun 07, 2024 at 12:22:16PM -0700, Abhinav Kumar wrote:
>>
>>
>> On 6/7/2024 12:16 AM, Dmitry Baryshkov wrote:
>>> On Thu, Jun 06, 2024 at 03:21:11PM -0700, Abhinav Kumar wrote:
>>>> On 3/13/2024 5:02 PM, Dmitry Baryshkov wrote:
>>>>> Only several SSPP blocks support such features as YUV output or scaling,
>>>>> thus different DRM planes have different features.  Properly utilizing
>>>>> all planes requires the attention of the compositor, who should
>>>>> prefer simpler planes to YUV-supporting ones. Otherwise it is very easy
>>>>> to end up in a situation when all featureful planes are already
>>>>> allocated for simple windows, leaving no spare plane for YUV playback.
>>>>>
>>>>> To solve this problem make all planes virtual. Each plane is registered
>>>>> as if it supports all possible features, but then at the runtime during
>>>>> the atomic_check phase the driver selects backing SSPP block for each
>>>>> plane.
>>>>>
>>>>> Note, this does not provide support for using two different SSPP blocks
>>>>> for a single plane or using two rectangles of an SSPP to drive two
>>>>> planes. Each plane still gets its own SSPP and can utilize either a solo
>>>>> rectangle or both multirect rectangles depending on the resolution.
>>>>>
>>>>> Note #2: By default support for virtual planes is turned off and the
>>>>> driver still uses old code path with preallocated SSPP block for each
>>>>> plane. To enable virtual planes, pass 'msm.dpu_use_virtual_planes=1'
>>>>> kernel parameter.
>>>>>
>>>>
>>>> I like the overall approach in this patch. Some comments below.
>>>>
>>>>> Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov at linaro.org>
>>>>> ---
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_crtc.c  |  50 +++++
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_kms.c   |  10 +-
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_kms.h   |   4 +
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c | 230 +++++++++++++++++++---
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_plane.h |  19 ++
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_rm.c    |  77 ++++++++
>>>>>     drivers/gpu/drm/msm/disp/dpu1/dpu_rm.h    |  28 +++
>>>>>     7 files changed, 390 insertions(+), 28 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_crtc.c b/drivers/gpu/drm/msm/disp/dpu1/dpu_crtc.c
>>>>> index 88c2e51ab166..794c5643584f 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_crtc.c
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_crtc.c
>>>>> @@ -1168,6 +1168,49 @@ static bool dpu_crtc_needs_dirtyfb(struct drm_crtc_state *cstate)
>>>>>     	return false;
>>>>>     }
>>>>> +static int dpu_crtc_reassign_planes(struct drm_crtc *crtc, struct drm_crtc_state *crtc_state)
>>>>> +{
>>>>> +	int total_planes = crtc->dev->mode_config.num_total_plane;
>>>>> +	struct drm_atomic_state *state = crtc_state->state;
>>>>> +	struct dpu_global_state *global_state;
>>>>> +	struct drm_plane_state **states;
>>>>> +	struct drm_plane *plane;
>>>>> +	int ret;
>>>>> +
>>>>> +	global_state = dpu_kms_get_global_state(crtc_state->state);
>>>>> +	if (IS_ERR(global_state))
>>>>> +		return PTR_ERR(global_state);
>>>>> +
>>>>> +	dpu_rm_release_all_sspp(global_state, crtc);
>>>>> +
>>>>
>>>> Do we need to call dpu_rm_release_all_sspp() even in the
>>>> _dpu_plane_atomic_disable()?
>>>
>>> It allows the driver to optimize the usage of the SSPP rectangles.
>>>
>>
>> No, what I meant was that we should call dpu_rm_release_all_sspp() in
>> dpu_plane_atomic_update() as well because in the atomic_check() path where
>> its called today, its being called only for zpos_changed and planes_changed
>> but during disable we must call this for sure.
> 
> No. the dpu_rm_release_all_sspp() should only be called during check.
> When dpu_plane_atomic_update() is called, the state should already be
> finalised. The atomic_check() callback is called when a plane is going
> to be disabled.
> 

atomic_check() will be called when plane is disabled but 
dpu_rm_release_all_sspp() may not be called as it is protected by 
zpos_changed and planes_changed. OR you need to add a !visible check 
here to call dpu_rm_release_all_sspp() that time. Thats whay I wrote 
previously.

>>
>>>>
>>>>> +	if (!crtc_state->enable)
>>>>> +		return 0;
>>>>> +
>>>>> +	states = kcalloc(total_planes, sizeof(*states), GFP_KERNEL);
>>>>> +	if (!states)
>>>>> +		return -ENOMEM;
>>>>> +
>>>>> +	drm_atomic_crtc_state_for_each_plane(plane, crtc_state) {
>>>>> +		struct drm_plane_state *plane_state =
>>>>> +			drm_atomic_get_plane_state(state, plane);
>>>>> +
>>>>> +		if (IS_ERR(plane_state)) {
>>>>> +			ret = PTR_ERR(plane_state);
>>>>> +			goto done;
>>>>> +		}
>>>>> +
>>>>> +		states[plane_state->normalized_zpos] = plane_state;
>>>>> +	}
>>>>> +
>>>>> +	ret = dpu_assign_plane_resources(global_state, state, crtc, states, total_planes);
>>>>> +
>>>>> +done:
>>>>> +	kfree(states);
>>>>> +	return ret;
>>>>> +
>>>>> +	return 0;
>>>>> +}
>>>>> +
>>>>>     static int dpu_crtc_atomic_check(struct drm_crtc *crtc,
>>>>>     		struct drm_atomic_state *state)
>>>>>     {
>>>>> @@ -1183,6 +1226,13 @@ static int dpu_crtc_atomic_check(struct drm_crtc *crtc,
>>>>>     	bool needs_dirtyfb = dpu_crtc_needs_dirtyfb(crtc_state);
>>>>> +	if (dpu_use_virtual_planes &&
>>>>> +	    (crtc_state->planes_changed || crtc_state->zpos_changed)) {
>>>>
>>>> Here, I assume you are relying on DRM to set zpos_changed. But can you
>>>> please elaborate why we have to reassign planes when zpos_changes?
>>>
>>> Because the SSPP might be split between two planes. If zpos has changed
>>> we might have to break this split and use two different SSPPs for those
>>> planes.
>>>
>>
>> Got it. But that support has not been added yet so belongs to a later
>> patchset?
> 
> Yes and no. This patch still fills the plane states array following the
> zpos order. If the z-order changes, the result of
> dpu_assign_plane_resources() might change too. The actual algorithm that
> assigns SSPPs is a different code unit. So, I do not want to get back to
> this code while changing the dpu_assign_plane_resources()
> implementation.
> 

Not entirely convinced but lets see.

>>
>>>>
>>>>> +		rc = dpu_crtc_reassign_planes(crtc, crtc_state);
>>>>> +		if (rc < 0)
>>>>> +			return rc;
>>>>> +	}
>>>>> +
>>>>>     	if (!crtc_state->enable || !drm_atomic_crtc_effectively_active(crtc_state)) {
>>>>>     		DRM_DEBUG_ATOMIC("crtc%d -> enable %d, active %d, skip atomic_check\n",
>>>>>     				crtc->base.id, crtc_state->enable,
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.c b/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.c
>>>>> index 9a1fe6868979..becdd98f3c40 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.c
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.c
>>>>> @@ -51,6 +51,9 @@
>>>>>     #define DPU_DEBUGFS_DIR "msm_dpu"
>>>>>     #define DPU_DEBUGFS_HWMASKNAME "hw_log_mask"
>>>>> +bool dpu_use_virtual_planes = false;
>>>>> +module_param(dpu_use_virtual_planes, bool, 0);
>>>>> +
>>>>>     static int dpu_kms_hw_init(struct msm_kms *kms);
>>>>>     static void _dpu_kms_mmu_destroy(struct dpu_kms *dpu_kms);
>>>>> @@ -770,8 +773,11 @@ static int _dpu_kms_drm_obj_init(struct dpu_kms *dpu_kms)
>>>>>     			  type, catalog->sspp[i].features,
>>>>>     			  catalog->sspp[i].features & BIT(DPU_SSPP_CURSOR));
>>>>> -		plane = dpu_plane_init(dev, catalog->sspp[i].id, type,
>>>>> -				       (1UL << max_crtc_count) - 1);
>>>>> +		if (dpu_use_virtual_planes)
>>>>> +			plane = dpu_plane_init_virtual(dev, type, (1UL << max_crtc_count) - 1);
>>>>> +		else
>>>>> +			plane = dpu_plane_init(dev, catalog->sspp[i].id, type,
>>>>> +					       (1UL << max_crtc_count) - 1);
>>>>>     		if (IS_ERR(plane)) {
>>>>>     			DPU_ERROR("dpu_plane_init failed\n");
>>>>>     			ret = PTR_ERR(plane);
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.h b/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.h
>>>>> index e2adc937ea63..195257660057 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.h
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_kms.h
>>>>> @@ -64,6 +64,8 @@
>>>>>     #define ktime_compare_safe(A, B) \
>>>>>     	ktime_compare(ktime_sub((A), (B)), ktime_set(0, 0))
>>>>> +extern bool dpu_use_virtual_planes;
>>>>> +
>>>>>     struct dpu_kms {
>>>>>     	struct msm_kms base;
>>>>>     	struct drm_device *dev;
>>>>> @@ -138,6 +140,8 @@ struct dpu_global_state {
>>>>>     	uint32_t dspp_to_enc_id[DSPP_MAX - DSPP_0];
>>>>>     	uint32_t dsc_to_enc_id[DSC_MAX - DSC_0];
>>>>>     	uint32_t cdm_to_enc_id;
>>>>> +
>>>>> +	uint32_t sspp_to_crtc_id[SSPP_MAX - SSPP_NONE];
>>>>
>>>> I will re-visit this sspp_to_crtc_id mapping after checking the rest of the
>>>> patches.
>>>>
>>>>>     };
>>>>>     struct dpu_global_state
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
>>>>> index a41ffa2d774b..2961b809ccf3 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.c
>>>>> @@ -876,7 +876,7 @@ static int dpu_plane_atomic_check_nopipe(struct drm_plane *plane,
>>>>>     	drm_rect_rotate_inv(&pipe_cfg->src_rect,
>>>>>     			    new_plane_state->fb->width, new_plane_state->fb->height,
>>>>>     			    new_plane_state->rotation);
>>>>> -	if (r_pipe_cfg->src_rect.x1 != 0)
>>>>> +	if (drm_rect_width(&r_pipe_cfg->src_rect) != 0)
>>>>>     		drm_rect_rotate_inv(&r_pipe_cfg->src_rect,
>>>>>     				    new_plane_state->fb->width, new_plane_state->fb->height,
>>>>>     				    new_plane_state->rotation);
>>>>> @@ -942,7 +942,7 @@ static int dpu_plane_atomic_check_pipes(struct drm_plane *plane,
>>>>>     	if (ret)
>>>>>     		return ret;
>>>>> -	if (r_pipe_cfg->src_rect.x1 != 0) {
>>>>> +	if (drm_rect_width(&r_pipe_cfg->src_rect) != 0) {
>>>>>     		/*
>>>>>     		 * In parallel multirect case only the half of the usual width
>>>>>     		 * is supported for tiled formats. If we are here, we know that
>>>>> @@ -1022,6 +1022,113 @@ static int dpu_plane_atomic_check(struct drm_plane *plane,
>>>>>     	return dpu_plane_atomic_check_pipes(plane, state, crtc_state);
>>>>>     }
>>>>
>>>> This part should goto patch 6 right?
>>>
>>> Why? This is only relevant for the virtual planes.
>>>
>>
>> Patch 6 introducted the if (r_pipe_cfg->src_rect.x1 != 0) checks from
>> afaict. Was that to ensure there was a valid src_rect for the r_pipe before
>> invoking drm_rect_rotate_inv()?
>>
>> This patch changes it from using x1!=0 to using drm_rect_width().
>>
>> Why cant this check be moved to that patch itself is my question.
> 
> I now understand your concern. Yes, this should (and in fact must) be
> moved to that patch.
> 
>>
>>>>
>>>>> +static int dpu_plane_virtual_atomic_check(struct drm_plane *plane,
>>>>> +					  struct drm_atomic_state *state)
>>>>> +{
>>>>> +	struct drm_plane_state *plane_state =
>>>>> +		drm_atomic_get_plane_state(state, plane);
>>>>> +	struct dpu_plane_state *pstate = to_dpu_plane_state(plane_state);
>>>>> +	const struct dpu_format *format;
>>>>> +	struct drm_crtc_state *crtc_state;
>>>>> +	int ret;
>>>>> +
>>>>> +	if (plane_state->crtc)
>>>>> +		crtc_state = drm_atomic_get_new_crtc_state(state,
>>>>> +							   plane_state->crtc);
>>>>> +
>>>>> +	ret = dpu_plane_atomic_check_nopipe(plane, plane_state, crtc_state);
>>>>> +	if (ret)
>>>>> +		return ret;
>>>>> +
>>>>> +	if (!plane_state->visible) {
>>>>> +		/*
>>>>> +		 * resources are freed by dpu_crtc_assign_plane_resources(),
>>>>> +		 * but clean them here.
>>>>> +		 */
>>>>> +		pstate->pipe.sspp = NULL;
>>>>> +		pstate->r_pipe.sspp = NULL;
>>>>> +
>>>>> +		return 0;
>>>>> +	}
>>>>> +
>>>>> +	format = to_dpu_format(msm_framebuffer_format(plane_state->fb));
>>>>> +
>>>>> +	/* force resource reallocation if the format of FB has changed */
>>>>> +	if (pstate->saved_fmt != format) {
>>>>> +		crtc_state->planes_changed = true;
>>>>
>>>> planes_changed means planes on this CRTC are updated. We are using this to
>>>> track that the underlying SSPP of the plane needs to be changed?
>>>>
>>>> Is this still correct because this might conflict with the DRM's expectation
>>>> of planes_changed.
>>>
>>> No, it still means that the planes were changed. DRM doesn't seem to
>>> care about the format changes. We do.
>>>
>>
>> Yes, I am aware that we need to detect format changes and re-assign SSPP if
>> needed if the SSPP does not support the format which was requested.
>>
>> My question was whether planes_changed is the right way to track that
>> because that is being used by DRM to track whether the plane's CRTC changed
>> so I dont know whether its appropriate to overload that meaning.
> 
> planes_changed is a hint that planes were updated. It is not limited to
> just toggling the planes.
> 
>> Also from the other comment below, since we are dropping saved_fmt, do we
>> need to retain planes_changed setting?
> 
> Yes. We still need to track the YUV <-> RGB-related changes.
> 

Ok, got it.

>>
>>>>
>>>>> +		pstate->saved_fmt = format;
>>>>> +	}
>>>>> +
>>>>> +	return 0;
>>>>> +}
>>>>> +
>>>>> +static int dpu_plane_virtual_assign_resources(struct drm_crtc *crtc,
>>>>> +					      struct dpu_global_state *global_state,
>>>>> +					      struct drm_atomic_state *state,
>>>>> +					      struct drm_plane_state *plane_state)
>>>>> +{
>>>>> +	const struct drm_crtc_state *crtc_state = NULL;
>>>>> +	struct drm_plane *plane = plane_state->plane;
>>>>> +	struct dpu_kms *dpu_kms = _dpu_plane_get_kms(plane);
>>>>> +	struct dpu_rm_sspp_requirements reqs;
>>>>> +	struct dpu_plane_state *pstate;
>>>>> +	struct dpu_sw_pipe *pipe;
>>>>> +	struct dpu_sw_pipe *r_pipe;
>>>>> +	const struct dpu_format *fmt;
>>>>> +
>>>>> +	if (plane_state->crtc)
>>>>> +		crtc_state = drm_atomic_get_new_crtc_state(state,
>>>>> +							   plane_state->crtc);
>>>>> +
>>>>> +	pstate = to_dpu_plane_state(plane_state);
>>>>> +	pipe = &pstate->pipe;
>>>>> +	r_pipe = &pstate->r_pipe;
>>>>> +
>>>>> +	pipe->sspp = NULL;
>>>>> +	r_pipe->sspp = NULL;
>>>>> +
>>>>> +	if (!plane_state->fb)
>>>>> +		return -EINVAL;
>>>>> +
>>>>> +	fmt = to_dpu_format(msm_framebuffer_format(plane_state->fb));
>>>>> +	reqs.yuv = DPU_FORMAT_IS_YUV(fmt);
>>>>> +	reqs.scale = (plane_state->src_w >> 16 != plane_state->crtc_w) ||
>>>>> +		(plane_state->src_h >> 16 != plane_state->crtc_h);
>>>>> +
>>>>> +	reqs.rot90 = drm_rotation_90_or_270(plane_state->rotation);
>>>>> +
>>>>> +	pipe->sspp = dpu_rm_reserve_sspp(&dpu_kms->rm, global_state, crtc, &reqs);
>>>>> +	if (!pipe->sspp)
>>>>> +		return -ENODEV;
>>>>> +
>>>>> +	return dpu_plane_atomic_check_pipes(plane, state, crtc_state);
>>>>> +}
>>>>> +
>>>>> +int dpu_assign_plane_resources(struct dpu_global_state *global_state,
>>>>> +			       struct drm_atomic_state *state,
>>>>> +			       struct drm_crtc *crtc,
>>>>> +			       struct drm_plane_state **states,
>>>>> +			       unsigned int num_planes)
>>>>> +{
>>>>> +	unsigned int i;
>>>>> +	int ret;
>>>>> +
>>>>> +	for (i = 0; i < num_planes; i++) {
>>>>> +		struct drm_plane_state *plane_state = states[i];
>>>>> +
>>>>> +		if (!plane_state ||
>>>>> +		    !plane_state->visible)
>>>>> +			continue;
>>>>> +
>>>>> +		ret = dpu_plane_virtual_assign_resources(crtc, global_state,
>>>>> +							 state, plane_state);
>>>>> +		if (ret)
>>>>> +			break;
>>>>> +	}
>>>>> +
>>>>> +	return ret;
>>>>> +}
>>>>> +
>>>>>     static void dpu_plane_flush_csc(struct dpu_plane *pdpu, struct dpu_sw_pipe *pipe)
>>>>>     {
>>>>>     	const struct dpu_format *format =
>>>>> @@ -1342,12 +1449,14 @@ static void dpu_plane_atomic_print_state(struct drm_printer *p,
>>>>>     	drm_printf(p, "\tstage=%d\n", pstate->stage);
>>>>> -	drm_printf(p, "\tsspp[0]=%s\n", pipe->sspp->cap->name);
>>>>> -	drm_printf(p, "\tmultirect_mode[0]=%s\n", dpu_get_multirect_mode(pipe->multirect_mode));
>>>>> -	drm_printf(p, "\tmultirect_index[0]=%s\n",
>>>>> -		   dpu_get_multirect_index(pipe->multirect_index));
>>>>> -	drm_printf(p, "\tsrc[0]=" DRM_RECT_FMT "\n", DRM_RECT_ARG(&pipe_cfg->src_rect));
>>>>> -	drm_printf(p, "\tdst[0]=" DRM_RECT_FMT "\n", DRM_RECT_ARG(&pipe_cfg->dst_rect));
>>>>> +	if (pipe->sspp) {
>>>>> +		drm_printf(p, "\tsspp[0]=%s\n", pipe->sspp->cap->name);
>>>>> +		drm_printf(p, "\tmultirect_mode[0]=%s\n", dpu_get_multirect_mode(pipe->multirect_mode));
>>>>> +		drm_printf(p, "\tmultirect_index[0]=%s\n",
>>>>> +			   dpu_get_multirect_index(pipe->multirect_index));
>>>>> +		drm_printf(p, "\tsrc[0]=" DRM_RECT_FMT "\n", DRM_RECT_ARG(&pipe_cfg->src_rect));
>>>>> +		drm_printf(p, "\tdst[0]=" DRM_RECT_FMT "\n", DRM_RECT_ARG(&pipe_cfg->dst_rect));
>>>>> +	}
>>>>
>>>> I dont mind this being pushed out as a separate patch to protect pipe->sspp.
>>>> Even though it is static assignment today, there is no harm against adding
>>>> this protection even today IMO.
>>>
>>> No, it doesn't make sense. Currently pipe->sspp is always valid.
>>> Unnecessary checks do harm, they make you think that the condition they
>>> are checking might be invalid.
>>>
>>
>> Ack.
>>
>>>>
>>>>>     	if (r_pipe->sspp) {
>>>>>     		drm_printf(p, "\tsspp[1]=%s\n", r_pipe->sspp->cap->name);
>>>>> @@ -1436,31 +1545,29 @@ static const struct drm_plane_helper_funcs dpu_plane_helper_funcs = {
>>>>>     		.atomic_update = dpu_plane_atomic_update,
>>>>>     };
>>>>> +static const struct drm_plane_helper_funcs dpu_plane_virtual_helper_funcs = {
>>>>> +	.prepare_fb = dpu_plane_prepare_fb,
>>>>> +	.cleanup_fb = dpu_plane_cleanup_fb,
>>>>> +	.atomic_check = dpu_plane_virtual_atomic_check,
>>>>> +	.atomic_update = dpu_plane_atomic_update,
>>>>> +};
>>>>> +
>>>>>     /* initialize plane */
>>>>> -struct drm_plane *dpu_plane_init(struct drm_device *dev,
>>>>> -		uint32_t pipe, enum drm_plane_type type,
>>>>> -		unsigned long possible_crtcs)
>>>>> +static struct drm_plane *dpu_plane_init_common(struct drm_device *dev,
>>>>> +					       enum drm_plane_type type,
>>>>> +					       unsigned long possible_crtcs,
>>>>> +					       bool inline_rotation,
>>>>> +					       const uint32_t *format_list,
>>>>> +					       uint32_t num_formats,
>>>>> +					       enum dpu_sspp pipe)
>>>>>     {
>>>>>     	struct drm_plane *plane = NULL;
>>>>> -	const uint32_t *format_list;
>>>>>     	struct dpu_plane *pdpu;
>>>>>     	struct msm_drm_private *priv = dev->dev_private;
>>>>>     	struct dpu_kms *kms = to_dpu_kms(priv->kms);
>>>>> -	struct dpu_hw_sspp *pipe_hw;
>>>>> -	uint32_t num_formats;
>>>>>     	uint32_t supported_rotations;
>>>>>     	int ret;
>>>>> -	/* initialize underlying h/w driver */
>>>>> -	pipe_hw = dpu_rm_get_sspp(&kms->rm, pipe);
>>>>> -	if (!pipe_hw || !pipe_hw->cap || !pipe_hw->cap->sblk) {
>>>>> -		DPU_ERROR("[%u]SSPP is invalid\n", pipe);
>>>>> -		return ERR_PTR(-EINVAL);
>>>>> -	}
>>>>> -
>>>>> -	format_list = pipe_hw->cap->sblk->format_list;
>>>>> -	num_formats = pipe_hw->cap->sblk->num_formats;
>>>>> -
>>>>>     	pdpu = drmm_universal_plane_alloc(dev, struct dpu_plane, base,
>>>>>     				0xff, &dpu_plane_funcs,
>>>>>     				format_list, num_formats,
>>>>> @@ -1486,7 +1593,7 @@ struct drm_plane *dpu_plane_init(struct drm_device *dev,
>>>>>     	supported_rotations = DRM_MODE_REFLECT_MASK | DRM_MODE_ROTATE_0 | DRM_MODE_ROTATE_180;
>>>>> -	if (pipe_hw->cap->features & BIT(DPU_SSPP_INLINE_ROTATION))
>>>>> +	if (inline_rotation)
>>>>>     		supported_rotations |= DRM_MODE_ROTATE_MASK;
>>>>>     	drm_plane_create_rotation_property(plane,
>>>>> @@ -1494,10 +1601,81 @@ struct drm_plane *dpu_plane_init(struct drm_device *dev,
>>>>>     	drm_plane_enable_fb_damage_clips(plane);
>>>>> -	/* success! finalize initialization */
>>>>> +	DPU_DEBUG("%s created for pipe:%u id:%u\n", plane->name,
>>>>> +					pipe, plane->base.id);
>>>>> +	return plane;
>>>>> +}
>>>>> +
>>>>> +struct drm_plane *dpu_plane_init(struct drm_device *dev,
>>>>> +				 uint32_t pipe, enum drm_plane_type type,
>>>>> +				 unsigned long possible_crtcs)
>>>>> +{
>>>>> +	struct drm_plane *plane = NULL;
>>>>> +	struct msm_drm_private *priv = dev->dev_private;
>>>>> +	struct dpu_kms *kms = to_dpu_kms(priv->kms);
>>>>> +	struct dpu_hw_sspp *pipe_hw;
>>>>> +
>>>>> +	/* initialize underlying h/w driver */
>>>>> +	pipe_hw = dpu_rm_get_sspp(&kms->rm, pipe);
>>>>> +	if (!pipe_hw || !pipe_hw->cap || !pipe_hw->cap->sblk) {
>>>>> +		DPU_ERROR("[%u]SSPP is invalid\n", pipe);
>>>>> +		return ERR_PTR(-EINVAL);
>>>>> +	}
>>>>> +
>>>>> +
>>>>> +	plane = dpu_plane_init_common(dev, type, possible_crtcs,
>>>>> +				      pipe_hw->cap->features & BIT(DPU_SSPP_INLINE_ROTATION),
>>>>> +				      pipe_hw->cap->sblk->format_list,
>>>>> +				      pipe_hw->cap->sblk->num_formats,
>>>>> +				      pipe);
>>>>> +	if (IS_ERR(plane))
>>>>> +		return plane;
>>>>> +
>>>>>     	drm_plane_helper_add(plane, &dpu_plane_helper_funcs);
>>>>>     	DPU_DEBUG("%s created for pipe:%u id:%u\n", plane->name,
>>>>>     					pipe, plane->base.id);
>>>>> +
>>>>> +	return plane;
>>>>> +}
>>>>> +
>>>>> +struct drm_plane *dpu_plane_init_virtual(struct drm_device *dev,
>>>>> +					 enum drm_plane_type type,
>>>>> +					 unsigned long possible_crtcs)
>>>>> +{
>>>>> +	struct drm_plane *plane = NULL;
>>>>> +	struct msm_drm_private *priv = dev->dev_private;
>>>>> +	struct dpu_kms *kms = to_dpu_kms(priv->kms);
>>>>> +	bool has_inline_rotation = false;
>>>>> +	const u32 *format_list = NULL;
>>>>> +	u32 num_formats = 0;
>>>>> +	int i;
>>>>> +
>>>>> +	/* Determine the largest configuration that we can implement */
>>>>> +	for (i = 0; i < kms->catalog->sspp_count; i++) {
>>>>> +		const struct dpu_sspp_cfg *cfg = &kms->catalog->sspp[i];
>>>>> +
>>>>> +		if (test_bit(DPU_SSPP_INLINE_ROTATION, &cfg->features))
>>>>> +			has_inline_rotation = true;
>>>>> +
>>>>> +		if (!format_list ||
>>>>> +		    cfg->sblk->csc_blk.len) {
>>>>
>>>> But format_list is being assigned to NULL just a few lines above. Why is
>>>> this check needed?
>>>
>>> It was assigned before the loop.
>>>
>>
>> Yes, I got this part but missed on why we needed the loop at all.
> 
> Which set of formats should the virtual plane use?
> 
>>>>
>>>> I dont get why this part can also goto dpu_plane_init_common() as it looks
>>>> identical to me.
>>>
>>> And it is not. For the non-virtual case there is no loop around formats
>>> list assignment.
>>>
>>
>> Ah okay, I misunderstood the logic. After reading the comment above the loop
>> I get what you are trying to do here.
>>
>> But I dont get why you really need to do that?
>>
>> 1) In this patch the relationship between virtual plane and SSPP is still
>> 1:1 so what is wrong to retain the sspp's actual format for the plane rather
>> than picking the best format (you are targetting Vig SSPP)
> 
> No. With this patch there is no 1:1 relationship. The RM will select the
> SSPP that suits the requirements (YUV, scaling, rotation, etc).
> 

Yes but there is always only one SSPP for one plane is what I meant. 
That does not change till the next patch.

In that sense, I dont see why you need to expose the superset of formats.

>> In fact, that will reduce atomic_check() failures with this patch because
>> compositor will still work the same way as it used to work before by not
>> trying an unsupported format on a plane.
> 
> The virtual plane should support any of the formats that the backing
> hardware can support. If (for example) we only had RGB-only and YUV-only
> hardware blocks, the driver would have to construct a _superset_ of
> those formats. Fortunately this is not the case and VIG supports a
> strict superset of what DMA (or RGB) SSPP supports.
> 

Yes, thats why I said plane_formats_yuv is enough in my next point below 
because Vig is super set of DMA or IOW Vig is the most feature rich plane.

>> If one plane maps to two SSPPs, then yes we can go with the superset of
>> formats but that comes in a later patch right?
>>
>> 2) So even if we want to do it this way from this patch itself, I think all
>> you are looking for is whether there is a Vig SSPP and if so use
>> plane_formats_yuv. There is no need for this loop IMO.
>>
>> 3) I noticed that virt_format_list is still present in the driver. If you
>> are planning to not use that perhaps drop it with this series.
> 
> Ack
> 
>>
>>>>
>>>>
>>>>> +			format_list = cfg->sblk->format_list;
>>>>> +			num_formats = cfg->sblk->num_formats;
>>>>> +		}
>>>>> +	}
>>>>> +
>>>>> +	plane = dpu_plane_init_common(dev, type, possible_crtcs,
>>>>> +				      has_inline_rotation,
>>>>> +				      format_list,
>>>>> +				      num_formats,
>>>>> +				      SSPP_NONE);
>>>>
>>>> Ok, here is the part which we were discussing in
>>>>
>>>> https://patchwork.freedesktop.org/patch/582820/?series=131109&rev=1#comment_1087370
>>>>
>>>> So yes, that part belongs to this patch.
>>>
>>> I'll check it while preparing the next iteration.
>>>
>>>>
>>>>> +	if (IS_ERR(plane))
>>>>> +		return plane;
>>>>> +
>>>>> +	drm_plane_helper_add(plane, &dpu_plane_virtual_helper_funcs);
>>>>> +
>>>>> +	DPU_DEBUG("%s created virtual id:%u\n", plane->name, plane->base.id);
>>>>> +
>>>>>     	return plane;
>>>>>     }
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.h b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.h
>>>>> index a3ae45dc95d0..15f7d60d8b85 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.h
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_plane.h
>>>>> @@ -30,6 +30,7 @@
>>>>>      * @plane_fetch_bw: calculated BW per plane
>>>>>      * @plane_clk: calculated clk per plane
>>>>>      * @needs_dirtyfb: whether attached CRTC needs pixel data explicitly flushed
>>>>> + * @saved_fmt: format used by the plane's FB, saved for for virtual plane support
>>>>>      */
>>>>>     struct dpu_plane_state {
>>>>>     	struct drm_plane_state base;
>>>>> @@ -46,6 +47,8 @@ struct dpu_plane_state {
>>>>>     	u64 plane_clk;
>>>>>     	bool needs_dirtyfb;
>>>>> +
>>>>> +	const struct dpu_format *saved_fmt;
>>>>>     };
>>>>
>>>> Why is saved_fmt needed?
>>>>
>>>> The use-case which comes to my mind is lets say if we have a RGB format and
>>>> we need to switch to a YUV format, basically switch from DMA to ViG SSPP,
>>>> then yes we have to mark planes_changed as we need to switch the underlying
>>>> SSPP that time, but why cant we simply check that by means of a check to see
>>>> if the fmt is YUV and whether CSC block is present in the SSPP.
>>>
>>> Yes, correct. And vice versa, going from YUV to RGB might free the VIG
>>> SSPP.
>>>
>>>>
>>>> This will lead to dpu_crtc_reassign_planes() getting called for format
>>>> changes even when the new format might be available in the same SSPP.
>>>
>>> So use 'needs_vig' instead of storing the format? Sounds good to me.
>>>
>>
>> Yes thats the idea. Basically "needs_reassignment". You could even go from
>> Vig to DMA if the use-case can just use DMA to save up Vig.
>>
>> Also, do we really need to cache anything in the plane state to track this?
>>
>> If we have a function called dpu_crtc_needs_plane_reassignment() will go
>> through the current plane state and the current SSPP from the global state
>> and see if needs reassignment.
> 
> No, looking at the global state won't be possible here. I'd have to lock
> the private object before consulting it, which might cause EDEADLOCK
> later on during resource reallocation. So all necessary information
> should be stored in the dpu_plane_state.
> 

But you are already calling dpu_kms_get_global_state() in 
dpu_crtc_reassign_planes().

>>
>>>>
>>>>>     #define to_dpu_plane_state(x) \
>>>>> @@ -75,6 +78,16 @@ struct drm_plane *dpu_plane_init(struct drm_device *dev,
>>>>>     		uint32_t pipe, enum drm_plane_type type,
>>>>>     		unsigned long possible_crtcs);
>>>>> +/**
>>>>> + * dpu_plane_init_virtual - create new dpu virtualized plane
>>>>> + * @dev:   Pointer to DRM device
>>>>> + * @type:  Plane type - PRIMARY/OVERLAY/CURSOR
>>>>> + * @possible_crtcs: bitmask of crtc that can be attached to the given pipe
>>>>> + */
>>>>> +struct drm_plane *dpu_plane_init_virtual(struct drm_device *dev,
>>>>> +					 enum drm_plane_type type,
>>>>> +					 unsigned long possible_crtcs);
>>>>> +
>>>>>     /**
>>>>>      * dpu_plane_color_fill - enables color fill on plane
>>>>>      * @plane:  Pointer to DRM plane object
>>>>> @@ -91,4 +104,10 @@ void dpu_plane_danger_signal_ctrl(struct drm_plane *plane, bool enable);
>>>>>     static inline void dpu_plane_danger_signal_ctrl(struct drm_plane *plane, bool enable) {}
>>>>>     #endif
>>>>> +int dpu_assign_plane_resources(struct dpu_global_state *global_state,
>>>>> +			       struct drm_atomic_state *state,
>>>>> +			       struct drm_crtc *crtc,
>>>>> +			       struct drm_plane_state **states,
>>>>> +			       unsigned int num_planes);
>>>>> +
>>>>>     #endif /* _DPU_PLANE_H_ */
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.c b/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.c
>>>>> index 44938ba7a2b7..7264a4d44a14 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.c
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.c
>>>>> @@ -694,6 +694,83 @@ int dpu_rm_reserve(
>>>>>     	return ret;
>>>>>     }
>>>>> +struct dpu_hw_sspp *dpu_rm_reserve_sspp(struct dpu_rm *rm,
>>>>> +					struct dpu_global_state *global_state,
>>>>> +					struct drm_crtc *crtc,
>>>>> +					struct dpu_rm_sspp_requirements *reqs)
>>>>> +{
>>>>> +	uint32_t crtc_id = crtc->base.id;
>>>>> +	unsigned int weight, best_weght = UINT_MAX;
>>>>
>>>> best_weight?
>>>
>>> Yes
>>>
>>>>
>>>>> +	struct dpu_hw_sspp *hw_sspp;
>>>>> +	unsigned long mask = 0;
>>>>> +	int i, best_idx = -1;
>>>>> +
>>>>> +	/*
>>>>> +	 * Don't take cursor feature into consideration until there is proper support for SSPP_CURSORn.
>>>>> +	 */
>>>>> +	mask |= BIT(DPU_SSPP_CURSOR);
>>>>> +
>>>>> +	if (reqs->scale)
>>>>> +		mask |= BIT(DPU_SSPP_SCALER_RGB) |
>>>>> +			BIT(DPU_SSPP_SCALER_QSEED2) |
>>>>> +			BIT(DPU_SSPP_SCALER_QSEED3_COMPATIBLE);
>>>>> +
>>>>> +	if (reqs->yuv)
>>>>> +		mask |= BIT(DPU_SSPP_CSC) |
>>>>> +			BIT(DPU_SSPP_CSC_10BIT);
>>>>> +
>>>>> +	if (reqs->rot90)
>>>>> +		mask |= BIT(DPU_SSPP_INLINE_ROTATION);
>>>>> +
>>>>> +	for (i = 0; i < ARRAY_SIZE(rm->hw_sspp); i++) {
>>>>> +		if (!rm->hw_sspp[i])
>>>>> +			continue;
>>>>> +
>>>>> +		if (global_state->sspp_to_crtc_id[i])
>>>>> +			continue;
>>>>> +
>>>>> +		hw_sspp = rm->hw_sspp[i];
>>>>> +
>>>>> +		/* skip incompatible planes */
>>>>> +		if (reqs->scale && !hw_sspp->cap->sblk->scaler_blk.len)
>>>>> +			continue;
>>>>> +
>>>>> +		if (reqs->yuv && !hw_sspp->cap->sblk->csc_blk.len)
>>>>> +			continue;
>>>>> +
>>>>> +		if (reqs->rot90 && !(hw_sspp->cap->features & DPU_SSPP_INLINE_ROTATION))
>>>>> +			continue;
>>>>> +
>>>>> +		/*
>>>>> +		 * For non-yuv, non-scaled planes prefer simple (DMA or RGB)
>>>>> +		 * plane, falling back to VIG only if there are no such planes.
>>>>> +		 *
>>>>> +		 * This way we'd leave VIG sspps to be later used for YUV formats.
>>>>> +		 */
>>>>> +		weight = hweight64(hw_sspp->cap->features & ~mask);
>>>>
>>>> This approach is assuming that ViG feature masks are more than DMA.
>>>> Hence the hweight of DMA SSPP's features is less than hweight of ViG SSPPs.
>>>>
>>>> Is this really true? Because there are other bits such as DMA_SDM845_MASK
>>>> which might increase the hweight of DMA SSPPs
>>>
>>> Which bits are present in the DMA mask, which are not present in the VIG
>>> mask? Also for the older platforms there are three kinds of planes: VIG,
>>> DMA and RGB. The selection algorithm should not require significant
>>> changes to support that case.
>>>
>>
>> DMA_SDM845_MASK has DPU_SSPP_QOS_8LVL which is not there in VIG_MSM8998_MASK
>> afaict. But we really cannot be counting the number of feature bits and
>> going by that.
> 
> MSM8998 uses DMA_MSM8998_MASK, not DMA_SDM845_MASK.
> 
>> Hence, inherently, going by hweight is not right because whenever we add a
>> catalog change to add a new feature bit to SSPP, we have to come back here
>> and make sure this logic will not break.
>>>>
>>>> I would rather make it simpler.
>>>>
>>>> 1) if we need scaling || yuv, then we have to get only a Vig
>>>> 2) else, first try to get a DMA SSPP
>>>
>>> How would you distinguish between VIG and DMA?
>>>
>>
>> the type SSPP_TYPE_VIG OR SSPP_TYPE_DMA. We also have a SSPP_TYPE_RGB so
>> that should address your concern about the third type of plane (Vig, DMA,
>> RGB).
> 
> I don't particularly like the idea of using type. We still need to
> evaluate plane's features. Consider QCM2290, where VIG planes do not
> support scaling.
> 
> I will evaluate if I can rework this part to use type, while still
> checking for the feature bit. BTW: should we prefer RGB or DMA plane if
> all other conditions are met?
> 

Ok, qcm2290 really seems like an odd case but point taken.

I am fine if it needs to be a combination of type and feature bit but 
certainly not hweight of feature bit. If you want to use type along with 
presence of scaler blk feature bit thats fine.

I need to check if there is any feature loss in RGB Vs DMA. Let me check 
and get back. This needs some history digging.

>>
>>
>>>> 3) if (2) fails, we have to try to get a ViG SSPP.
>>>>
>>>> Lets be more explicit about the SSPP type here rather than using hweight.
>>>>
>>>>
>>>>> +		if (weight < best_weght) {
>>>>> +			best_weght = weight;
>>>>> +			best_idx = i;
>>>>> +		}
>>>>> +	}
>>>>> +
>>>>> +	if (best_idx < 0)
>>>>> +		return NULL;
>>>>> +
>>>>> +	global_state->sspp_to_crtc_id[best_idx] = crtc_id;
>>>>> +
>>>>> +	return rm->hw_sspp[best_idx];
>>>>> +}
>>>>> +
>>>>> +void dpu_rm_release_all_sspp(struct dpu_global_state *global_state,
>>>>> +			     struct drm_crtc *crtc)
>>>>> +{
>>>>> +	uint32_t crtc_id = crtc->base.id;
>>>>> +
>>>>> +	_dpu_rm_clear_mapping(global_state->sspp_to_crtc_id,
>>>>> +		ARRAY_SIZE(global_state->sspp_to_crtc_id), crtc_id);
>>>>> +}
>>>>> +
>>>>>     int dpu_rm_get_assigned_resources(struct dpu_rm *rm,
>>>>>     	struct dpu_global_state *global_state, uint32_t enc_id,
>>>>>     	enum dpu_hw_blk_type type, struct dpu_hw_blk **blks, int blks_size)
>>>>> diff --git a/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.h b/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.h
>>>>> index e63db8ace6b9..bf9110547385 100644
>>>>> --- a/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.h
>>>>> +++ b/drivers/gpu/drm/msm/disp/dpu1/dpu_rm.h
>>>>> @@ -37,6 +37,12 @@ struct dpu_rm {
>>>>>     	struct dpu_hw_blk *cdm_blk;
>>>>>     };
>>>>> +struct dpu_rm_sspp_requirements {
>>>>> +	bool yuv;
>>>>> +	bool scale;
>>>>> +	bool rot90;
>>>>> +};
>>>>> +
>>>>>     /**
>>>>>      * dpu_rm_init - Read hardware catalog and create reservation tracking objects
>>>>>      *	for all HW blocks.
>>>>> @@ -82,6 +88,28 @@ int dpu_rm_reserve(struct dpu_rm *rm,
>>>>>     void dpu_rm_release(struct dpu_global_state *global_state,
>>>>>     		struct drm_encoder *enc);
>>>>> +/**
>>>>> + * dpu_rm_reserve_sspp - Reserve the required SSPP for the provided CRTC
>>>>> + * @rm: DPU Resource Manager handle
>>>>> + * @global_state: private global state
>>>>> + * @crtc: DRM CRTC handle
>>>>> + * @reqs: SSPP required features
>>>>> + */
>>>>> +struct dpu_hw_sspp *dpu_rm_reserve_sspp(struct dpu_rm *rm,
>>>>> +					struct dpu_global_state *global_state,
>>>>> +					struct drm_crtc *crtc,
>>>>> +					struct dpu_rm_sspp_requirements *reqs);
>>>>> +
>>>>> +/**
>>>>> + * dpu_rm_release_all_sspp - Given the CRTC, release all SSPP
>>>>> + *	blocks previously reserved for that use case.
>>>>> + * @rm: DPU Resource Manager handle
>>>>> + * @crtc: DRM CRTC handle
>>>>> + * @Return: 0 on Success otherwise -ERROR
>>>>> + */
>>>>
>>>> This is void so does not return anything?
> 
> Yes
> 
>>>>
>>>>> +void dpu_rm_release_all_sspp(struct dpu_global_state *global_state,
>>>>> +			     struct drm_crtc *crtc);
>>>>> +
>>>>>     /**
>>>>>      * Get hw resources of the given type that are assigned to this encoder.
>>>>>      */
>>>
>