[PATCH v4] drm/sched: Use struct for drm_sched_init() params
Philipp Stanner
phasta at mailbox.org
Wed Jul 9 10:14:54 UTC 2025
On Tue, 2025-07-08 at 14:02 +0100, Tvrtko Ursulin wrote:
>
>
> On 11/02/2025 11:14, Philipp Stanner wrote:
> > drm_sched_init() has a great many parameters and upcoming new
> > functionality for the scheduler might add even more. Generally, the
> > great number of parameters reduces readability and has already
> > caused
> > one missnaming, addressed in:
> >
> > commit 6f1cacf4eba7 ("drm/nouveau: Improve variable name in
> > nouveau_sched_init()").
> >
> > Introduce a new struct for the scheduler init parameters and port
> > all
> > users.
> >
> > Signed-off-by: Philipp Stanner <phasta at kernel.org>
> > Reviewed-by: Liviu Dudau <liviu.dudau at arm.com>
> > Acked-by: Matthew Brost <matthew.brost at intel.com> # for Xe
> > Reviewed-by: Boris Brezillon <boris.brezillon at collabora.com> # for
> > Panfrost and Panthor
> > Reviewed-by: Christian Gmeiner <cgmeiner at igalia.com> # for Etnaviv
> > Reviewed-by: Frank Binns <frank.binns at imgtec.com> # for Imagination
> > Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin at igalia.com> # for Sched
> > Reviewed-by: Maíra Canal <mcanal at igalia.com> # for v3d
> > ---
> > Changes in v4:
> > - Add forgotten driver accel/amdxdna. (Me)
> > - Rephrase the "init to NULL" comments. (Tvrtko)
> > - Apply RBs by Tvrtko and Maira.
> > - Terminate the last struct members with a comma, so that future
> > fields can be added with a minimal patch diff. (Me)
> >
> > Changes in v3:
> > - Various formatting requirements.
> >
> > Changes in v2:
> > - Point out that the hang-limit is deprecated. (Christian)
> > - Initialize the structs to 0 at declaration. (Planet Earth)
> > - Don't set stuff explicitly to 0 / NULL. (Tvrtko)
> > - Make the structs const where possible. (Boris)
> > - v3d: Use just 1, universal, function for sched-init. (Maíra)
> > ---
>
> 8><
>
> > diff --git a/drivers/gpu/drm/panfrost/panfrost_job.c
> > b/drivers/gpu/drm/panfrost/panfrost_job.c
> > index 9b8e82fb8bc4..5657106c2f7d 100644
> > --- a/drivers/gpu/drm/panfrost/panfrost_job.c
> > +++ b/drivers/gpu/drm/panfrost/panfrost_job.c
> > @@ -836,8 +836,16 @@ static irqreturn_t
> > panfrost_job_irq_handler(int irq, void *data)
> >
> > int panfrost_job_init(struct panfrost_device *pfdev)
> > {
> > + struct drm_sched_init_args args = {
> > + .ops = &panfrost_sched_ops,
> > + .num_rqs = DRM_SCHED_PRIORITY_COUNT,
> > + .credit_limit = 2,
> > + .timeout = msecs_to_jiffies(JOB_TIMEOUT_MS),
> > + .timeout_wq = pfdev->reset.wq,
>
> ^^^
>
> > + .name = "pan_js",
> > + .dev = pfdev->dev,
> > + };
> > struct panfrost_job_slot *js;
> > - unsigned int nentries = 2;
> > int ret, j;
> >
> > /* All GPUs have two entries per queue, but without
> > jobchain
> > @@ -845,7 +853,7 @@ int panfrost_job_init(struct panfrost_device
> > *pfdev)
> > * so let's just advertise one entry in that case.
> > */
> > if (!panfrost_has_hw_feature(pfdev,
> > HW_FEATURE_JOBCHAIN_DISAMBIGUATION))
> > - nentries = 1;
> > + args.credit_limit = 1;
> >
> > pfdev->js = js = devm_kzalloc(pfdev->dev, sizeof(*js),
> > GFP_KERNEL);
> > if (!js)
>
> Stumbled on this while looking at drm_sched_init() workqueue usage.
>
> I think this patch might need a fixup. Because somewhere around here
> in
> the code there is this:
>
> pfdev->reset.wq = alloc_ordered_workqueue("panfrost-reset",
> 0);
> if (!pfdev->reset.wq)
> return -ENOMEM;
>
> Which means that after the patch panfrost is using system_wq for the
> timeout handler instead the one it creates.
Ouch yes, that's definitely a very subtle bug. AFAICS it comes to be by
pfdev being initialized to 0.
Let me provide a fix..
P.
>
> > @@ -875,13 +883,7 @@ int panfrost_job_init(struct panfrost_device
> > *pfdev)
> > for (j = 0; j < NUM_JOB_SLOTS; j++) {
> > js->queue[j].fence_context =
> > dma_fence_context_alloc(1);
> >
> > - ret = drm_sched_init(&js->queue[j].sched,
> > - &panfrost_sched_ops, NULL,
> > - DRM_SCHED_PRIORITY_COUNT,
> > - nentries, 0,
> > -
> > msecs_to_jiffies(JOB_TIMEOUT_MS),
> > - pfdev->reset.wq,
> > - NULL, "pan_js", pfdev->dev);
> > + ret = drm_sched_init(&js->queue[j].sched, &args);
>
> ^^^
>
> > if (ret) {
> > dev_err(pfdev->dev, "Failed to create
> > scheduler: %d.", ret);
> > goto err_sched;
>
> Regards,
>
> Tvrtko
>
More information about the amd-gfx
mailing list