[PATCH v5 1/4] clk: renesas: rzv2h-cpg: Add support for DSI clocks
Lad, Prabhakar
prabhakar.csengg at gmail.com
Tue May 27 21:50:49 UTC 2025
Hi Geert,
Thank you for the review.
On Fri, May 23, 2025 at 3:45 PM Geert Uytterhoeven <geert at linux-m68k.org> wrote:
>
> Hi Prabhakar, Fabrizio,
>
> On Mon, 12 May 2025 at 20:43, Prabhakar <prabhakar.csengg at gmail.com> wrote:
> > From: Lad Prabhakar <prabhakar.mahadev-lad.rj at bp.renesas.com>
> >
> > Add support for PLLDSI and PLLDSI divider clocks.
> >
> > Introduce the `renesas-rzv2h-dsi.h` header to centralize and share
> > PLLDSI-related data structures, limits, and algorithms between the RZ/V2H
> > CPG and DSI drivers.
> >
> > The DSI PLL is functionally similar to the CPG's PLLDSI, but has slightly
> > different parameter limits and omits the programmable divider present in
> > CPG. To ensure precise frequency calculations-especially for milliHz-level
> > accuracy needed by the DSI driver-the shared algorithm allows both drivers
> > to compute PLL parameters consistently using the same logic and input
> > clock.
> >
> > Co-developed-by: Fabrizio Castro <fabrizio.castro.jz at renesas.com>
> > Signed-off-by: Fabrizio Castro <fabrizio.castro.jz at renesas.com>
> > Signed-off-by: Lad Prabhakar <prabhakar.mahadev-lad.rj at bp.renesas.com>
>
> Thanks for your patch!
>
> > --- a/drivers/clk/renesas/rzv2h-cpg.c
> > +++ b/drivers/clk/renesas/rzv2h-cpg.c
> > @@ -48,6 +53,7 @@
> > #define CPG_PLL_STBY(x) ((x))
> > #define CPG_PLL_STBY_RESETB BIT(0)
> > #define CPG_PLL_STBY_RESETB_WEN BIT(16)
> > +#define CPG_PLL_STBY_SSCGEN_WEN BIT(18)
>
> CPG_PLL_STBY_SSC_EN_WEN?
>
OK, I will rename it as above.
> > #define CPG_PLL_CLK1(x) ((x) + 0x004)
> > #define CPG_PLL_CLK1_KDIV(x) ((s16)FIELD_GET(GENMASK(31, 16), (x)))
>
> You are already using FIELD_GET() for extracting the K, M, P, and
> S fields, but are still still open-coding shifts for writing in
> rzv2h_cpg_pll_set_rate().
>
> What about replacing CPG_PLL_CLK1_KDIV() by:
>
> #define CPG_PLL_CLK1_DIV_K GENMASK(31,16)
>
> Then the code can use:
>
> (s16)FIELD_GET(CPG_PLL_CLK1_DIV_K, clk1)
>
> for reading and:
>
> FIELD_PREP(CPG_PLL_CLK1_DIV_K, (u16)k) | ...
>
> for writing?
>
> Same for the M, P, and S fields (but without the s16/u16 casts, as
> they are unsigned, unlike K).
>
Ok, I will update the macros as below:
#define CPG_PLL_CLK1_KDIV GENMASK(31, 16)
#define CPG_PLL_CLK1_MDIV GENMASK(15, 6)
#define CPG_PLL_CLK1_PDIV GENMASK(5, 0)
#define CPG_PLL_CLK2_SDIV GENMASK(2, 0)
> > #define CPG_PLL_CLK1_MDIV(x) FIELD_GET(GENMASK(15, 6), (x))
>
> > @@ -198,6 +227,188 @@ static int rzv2h_cpg_pll_clk_enable(struct clk_hw *hw)
> > return ret;
> > }
> >
> > +static unsigned long rzv2h_cpg_plldsi_div_recalc_rate(struct clk_hw *hw,
> > + unsigned long parent_rate)
> > +{
> > + struct rzv2h_plldsi_div_clk *dsi_div = to_plldsi_div_clk(hw);
> > + struct rzv2h_cpg_priv *priv = dsi_div->priv;
> > + struct ddiv ddiv = dsi_div->ddiv;
> > + u32 div;
> > +
> > + div = readl(priv->base + ddiv.offset);
> > + div >>= ddiv.shift;
> > + div &= clk_div_mask(ddiv.width);
> > + div = dsi_div->dtable[div].div;
> > +
> > + return DIV_ROUND_CLOSEST_ULL(parent_rate, div);
> > +}
> > +
> > +static int rzv2h_cpg_plldsi_div_determine_rate(struct clk_hw *hw,
> > + struct clk_rate_request *req)
> > +{
> > + struct rzv2h_plldsi_div_clk *dsi_div = to_plldsi_div_clk(hw);
> > + struct rzv2h_cpg_priv *priv = dsi_div->priv;
> > + struct rzv2h_plldsi_parameters *dsi_dividers = &priv->plldsi_div_parameters;
> > + u64 rate_millihz;
> > +
> > + /*
> > + * Adjust the requested clock rate (`req->rate`) to ensure it falls within
> > + * the supported range of 5.44 MHz to 187.5 MHz.
> > + */
> > + req->rate = clamp(req->rate, 5440000UL, 187500000UL);
> > +
> > + rate_millihz = mul_u32_u32(req->rate, MILLI);
> > + if (rate_millihz == dsi_dividers->error_millihz + dsi_dividers->freq_millihz)
> > + goto exit_determine_rate;
> > +
> > + if (!rzv2h_dsi_get_pll_parameters_values(priv->dsi_limits,
> > + dsi_dividers, rate_millihz)) {
> > + dev_err(priv->dev,
> > + "failed to determine rate for req->rate: %lu\n",
> > + req->rate);
> > + return -EINVAL;
> > + }
> > +
> > +exit_determine_rate:
> > + req->best_parent_rate = req->rate * dsi_dividers->csdiv;
>
> Shouldn't this also update req->rate with the actual rate?
>
> req->rate = DIV_ROUND_CLOSEST_ULL(dsi_dividers->freq_millihz, MILLI);
>
Agreed, I will update it.
> Would it help the DSI driver if this clock would provide a
> .recalc_accuracy() callback that takes into account the difference
> between req->rate and dsi_dividers->freq_millihz?
> Or would that be considered abuse of the accuracy concept?
>
Our understanding is that this describes how precisely a clock keeps
time. A clock with 1 ppb accuracy will gain or lose one second in
approximately 31.5 million seconds (1 year). In our case the meaning
is completely different.
> > +
> > + return 0;
> > +};
> > +
> > +static int rzv2h_cpg_plldsi_div_set_rate(struct clk_hw *hw,
> > + unsigned long rate,
> > + unsigned long parent_rate)
> > +{
> > + struct rzv2h_plldsi_div_clk *dsi_div = to_plldsi_div_clk(hw);
> > + struct rzv2h_cpg_priv *priv = dsi_div->priv;
> > + struct rzv2h_plldsi_parameters *dsi_dividers = &priv->plldsi_div_parameters;
> > + struct ddiv ddiv = dsi_div->ddiv;
> > + const struct clk_div_table *clkt;
> > + bool div_found = false;
> > + u32 val, shift, div;
> > +
> > + div = dsi_dividers->csdiv;
> > + for (clkt = dsi_div->dtable; clkt->div; clkt++) {
> > + if (clkt->div == div) {
> > + div_found = true;
> > + break;
> > + }
> > + }
> > +
> > + if (!div_found)
> > + return -EINVAL;
> > +
> > + shift = ddiv.shift;
> > + val = readl(priv->base + ddiv.offset) | DDIV_DIVCTL_WEN(shift);
> > + val &= ~(clk_div_mask(ddiv.width) << shift);
> > + val |= (u32)clkt->val << shift;
>
> No need for the cast.
>
Agreed, I will drop it.
> > + writel(val, priv->base + ddiv.offset);
> > +
> > + return 0;
> > +};
> > +
> > +static const struct clk_ops rzv2h_cpg_plldsi_div_ops = {
> > + .recalc_rate = rzv2h_cpg_plldsi_div_recalc_rate,
> > + .determine_rate = rzv2h_cpg_plldsi_div_determine_rate,
> > + .set_rate = rzv2h_cpg_plldsi_div_set_rate,
> > +};
>
> > +static long rzv2h_cpg_plldsi_round_rate(struct clk_hw *hw,
> > + unsigned long rate,
> > + unsigned long *parent_rate)
> > +{
> > + return clamp(rate, 25000000UL, 375000000UL);
>
> This only brings the desired rate into the supported range, but does
> not round it to the nearest rate that is actually supported.
>
Agreed. Actually I'll replace round rate with .determine_rate() for
the next iteration.
> > +}
> > +
> > +static int rzv2h_cpg_pll_set_rate(struct clk_hw *hw,
> > + unsigned long rate,
> > + unsigned long parent_rate)
> > +{
> > + struct pll_clk *pll_clk = to_pll(hw);
> > + struct rzv2h_cpg_priv *priv = pll_clk->priv;
> > + struct rzv2h_plldsi_parameters *dsi_dividers;
> > + struct pll pll = pll_clk->pll;
> > + u16 offset = pll.offset;
> > + u32 val;
> > + int ret;
> > +
> > + /* Put PLL into standby mode */
> > + writel(CPG_PLL_STBY_RESETB_WEN, priv->base + CPG_PLL_STBY(offset));
> > + ret = readl_poll_timeout_atomic(priv->base + CPG_PLL_MON(offset),
> > + val, !(val & CPG_PLL_MON_LOCK),
> > + 100, 2000);
> > + if (ret) {
> > + dev_err(priv->dev, "Failed to put PLLDSI into standby mode");
> > + return ret;
> > + }
> > +
> > + dsi_dividers = &priv->plldsi_div_parameters;
> > + /* Output clock setting 1 */
> > + writel((dsi_dividers->k << 16) | (dsi_dividers->m << 6) | (dsi_dividers->p),
>
> This is where you want to use FIELD_PREP().
>
Ok, I'll replace it with below:
writel(FIELD_PREP(CPG_PLL_CLK1_KDIV, (u16)dsi_dividers->k) |
FIELD_PREP(CPG_PLL_CLK1_MDIV, dsi_dividers->m) |
FIELD_PREP(CPG_PLL_CLK1_PDIV, dsi_dividers->p),
priv->base + CPG_PLL_CLK1(offset));
> > + priv->base + CPG_PLL_CLK1(offset));
> > +
> > + /* Output clock setting 2 */
> > + val = readl(priv->base + CPG_PLL_CLK2(offset));
> > + writel((val & ~GENMASK(2, 0)) | dsi_dividers->s,
>
> (val & ~CPG_PLL_CLK2_DIV_S) | FIELD_PREP(...)
>
Agreed, I will replace it with `writel((val & ~CPG_PLL_CLK2_SDIV) |
FIELD_PREP(CPG_PLL_CLK2_SDIV, dsi_dividers->s),`
> > + priv->base + CPG_PLL_CLK2(offset));
> > +
> > + /* Put PLL to normal mode */
> > + writel(CPG_PLL_STBY_RESETB_WEN | CPG_PLL_STBY_RESETB,
> > + priv->base + CPG_PLL_STBY(offset));
> > +
> > + /* PLL normal mode transition, output clock stability check */
> > + ret = readl_poll_timeout_atomic(priv->base + CPG_PLL_MON(offset),
> > + val, (val & CPG_PLL_MON_LOCK),
> > + 100, 2000);
> > + if (ret) {
> > + dev_err(priv->dev, "Failed to put PLLDSI into normal mode");
> > + return ret;
> > + }
> > +
> > + return 0;
> > +};
> > +
> > static unsigned long rzv2h_cpg_pll_clk_recalc_rate(struct clk_hw *hw,
> > unsigned long parent_rate)
> > {
>
> > --- a/drivers/clk/renesas/rzv2h-cpg.h
> > +++ b/drivers/clk/renesas/rzv2h-cpg.h
> > @@ -100,6 +100,7 @@ struct smuxed {
> > #define CPG_CDDIV3 (0x40C)
> > #define CPG_CDDIV4 (0x410)
> > #define CPG_CSDIV0 (0x500)
> > +#define CPG_CSDIV1 (0x504)
>
> Unused until [PATCH v5 2/4], so please move it there.
>
I'll move this patch 2/4
> >
> > #define CDDIV0_DIVCTL1 DDIV_PACK(CPG_CDDIV0, 4, 3, 1)
> > #define CDDIV0_DIVCTL2 DDIV_PACK(CPG_CDDIV0, 8, 3, 2)
>
> > --- /dev/null
> > +++ b/include/linux/clk/renesas-rzv2h-dsi.h
> > @@ -0,0 +1,211 @@
> > +/* SPDX-License-Identifier: GPL-2.0 */
> > +/*
> > + * Renesas RZ/V2H(P) DSI CPG helper
> > + *
> > + * Copyright (C) 2025 Renesas Electronics Corp.
> > + */
> > +#ifndef __RENESAS_RZV2H_DSI_H__
> > +#define __RENESAS_RZV2H_DSI_H__
> > +
> > +#include <linux/limits.h>
> > +#include <linux/math.h>
> > +#include <linux/math64.h>
> > +#include <linux/units.h>
> > +
> > +#define OSC_CLK_IN_MEGA (24 * MEGA)
> > +
> > +struct rzv2h_pll_div_limits {
> > + struct {
> > + u32 min;
> > + u32 max;
> > + } fvco;
> > +
> > + struct {
> > + u16 min;
> > + u16 max;
> > + } m;
> > +
> > + struct {
> > + u8 min;
> > + u8 max;
> > + } p;
> > +
> > + struct {
> > + u8 min;
> > + u8 max;
> > + } s;
> > +
> > + struct {
> > + s16 min;
> > + s16 max;
> > + } k;
> > +
> > + struct {
> > + u8 min;
> > + u8 max;
> > + } csdiv;
> > +};
> > +
> > +struct rzv2h_plldsi_parameters {
> > + u64 freq_millihz;
> > + s64 error_millihz;
> > + u16 m;
> > + s16 k;
> > + u8 csdiv;
> > + u8 p;
> > + u8 s;
> > +};
> > +
> > +#define RZV2H_CPG_PLL_DSI_LIMITS(name) \
> > + static const struct rzv2h_pll_div_limits (name) = { \
> > + .fvco = { .min = 1600 * MEGA, .max = 3200 * MEGA }, \
> > + .m = { .min = 64, .max = 533 }, \
> > + .p = { .min = 1, .max = 4 }, \
> > + .s = { .min = 0, .max = 6 }, \
> > + .k = { .min = -32768, .max = 32767 }, \
> > + .csdiv = { .min = 2, .max = 32 }, \
> > + } \
> > +
> > +/**
> > + * rzv2h_dsi_get_pll_parameters_values - Finds the best combination of PLL parameters
> > + * and divider value for a given frequency.
> > + *
> > + * @limits: Pointer to the structure containing the limits for the PLL parameters and
> > + * divider values
> > + * @pars: Pointer to the structure where the best calculated PLL parameters and divider
> > + * values will be stored
> > + * @freq_millihz: Target output frequency in millihertz
> > + *
> > + * This function calculates the best set of PLL parameters (M, K, P, S) and divider
> > + * value (CSDIV) to achieve the desired frequency.
> > + * There is no direct formula to calculate the PLL parameters and the divider value,
> > + * as it's an open system of equations, therefore this function uses an iterative
> > + * approach to determine the best solution. The best solution is one that minimizes
> > + * the error (desired frequency - actual frequency).
> > + *
> > + * Return: true if a valid set of divider values is found, false otherwise.
> > + */
> > +static __maybe_unused bool
> > +rzv2h_dsi_get_pll_parameters_values(const struct rzv2h_pll_div_limits *limits,
> > + struct rzv2h_plldsi_parameters *pars,
> > + u64 freq_millihz)
> > +{
> > + struct rzv2h_plldsi_parameters p, best;
> > +
> > + /* Initialize best error to maximum possible value */
> > + best.error_millihz = S64_MAX;
> > +
> > + for (p.csdiv = limits->csdiv.min; p.csdiv <= limits->csdiv.max; p.csdiv += 2) {
> > + for (p.p = limits->p.min; p.p <= limits->p.max; p.p++) {
> > + u32 fref = OSC_CLK_IN_MEGA / p.p;
> > +
> > + for (p.s = limits->s.min; p.s <= limits->s.max; p.s++) {
> > + u16 two_pow_s = 1 << p.s;
> > + u16 divider = two_pow_s * p.csdiv;
>
> No need for two_pow_s. You can initialize divider = p.csdiv << s.min
> at the start of the loop, and multiply by two after each iteration.
>
Agreed, I'll replace it with something like below:
for (divider = p.csdiv << limits->s.min, p.s =
limits->s.min;
p.s <= limits->s.max; p.s++, divider *= 2) {
> > +
> > + for (p.m = limits->m.min; p.m <= limits->m.max; p.m++) {
> > + u64 output_m, output_k_range;
> > + s64 pll_k, output_k;
> > + u64 fvco, output;
> > +
> > + /*
> > + * The frequency generated by the combination of the
> > + * PLL + divider is calculated as follows:
> > + *
> > + * Freq = Ffout / csdiv
> > + *
> > + * With:
> > + * Ffout = Ffvco / 2^(pll_s)
> > + * Ffvco = (pll_m + (pll_k / 65536)) * Ffref
> > + * Ffref = 24MHz / pll_p
> > + *
> > + * Freq can also be rewritten as:
> > + * Freq = Ffvco / (2^(pll_s) * csdiv))
> > + * = Ffvco / divider
> > + * = (pll_m * Ffref) / divider + ((pll_k / 65536) * Ffref) / divider
> > + * = output_m + output_k
> > + *
> > + * Every parameter has been determined at this point, but pll_k.
> > + * Considering that:
> > + * -32768 <= pll_k <= 32767
> > + * Then:
> > + * -0.5 <= (pll_k / 65536) < 0.5
> > + * Therefore:
> > + * -Ffref / (2 * divider) <= output_k < Ffref / (2 * divider)
> > + */
> > +
> > + /* Compute output M component (in mHz) */
> > + output_m = DIV_ROUND_CLOSEST_ULL(p.m * fref * 1000ULL,
>
> "p.m * fref" may overflow => mul_u32_u32(p.m, fref) * MILLI;
>
Agreed, I'll switch to mul_u32_u32().
> > + divider);
> > + /* Compute range for output K (in mHz) */
> > + output_k_range = DIV_ROUND_CLOSEST_ULL(fref * 1000ULL,
>
> mul_u32_u32(fref, MILLI)
>
OK.
> > + divider * 2);
> > + /*
> > + * No point in continuing if we can't achieve the
> > + * desired frequency
> > + */
> > + if (freq_millihz < (output_m - output_k_range) ||
> > + freq_millihz >= (output_m + output_k_range))
> > + continue;
> > +
> > + /*
> > + * Compute the K component
> > + *
> > + * Since:
> > + * Freq = output_m + output_k
> > + * Then:
> > + * output_k = Freq - output_m
> > + * = ((pll_k / 65536) * Ffref) / divider
> > + * Therefore:
> > + * pll_k = (output_k * 65536 * divider) / Ffref
> > + */
> > + output_k = freq_millihz - output_m;
> > + pll_k = div64_s64(output_k * 65536ULL * divider, fref);
>
> div_s64(), as fref is 32-bit.
>
OK.
> > + pll_k = DIV_S64_ROUND_CLOSEST(pll_k, 1000);
>
> MILLI
>
OK.
> > +
> > + /* Validate K value within allowed limits */
> > + if (pll_k < limits->k.min || pll_k > limits->k.max)
> > + continue;
> > +
> > + p.k = pll_k;
> > +
> > + /* Compute (Ffvco * 65536) */
> > + fvco = ((p.m * 65536ULL) + p.k) * fref;
>
> mul_u32(p.m * 65536 + p.k, fref)
>
OK, I'll switch to the above.
> I guess the compiler is sufficiently smart to turn that into a shift.
> The alternative would be to use a cast (I try to avoid them) and a shift:
>
> mul_u32((u64)p.m << 16 + p.k, fref)
>
> > + if ((fvco < (limits->fvco.min * 65536ULL)) ||
> > + (fvco > (limits->fvco.max * 65536ULL)))
>
> mul_u32_u32(..., 65536) for both
>
Agreed.
> > + continue;
> > +
> > + /* PLL_M component of (output * 65536 * PLL_P) */
> > + output = p.m * 65536ULL * OSC_CLK_IN_MEGA;
>
> mul_u32(p.m * 65536, OSC_CLK_IN_MEGA)
>
OK, I'll switch to mul_u32_u32().
> > + /* PLL_K component of (output * 65536 * PLL_P) */
> > + output += p.k * OSC_CLK_IN_MEGA;
> > + /* Make it in mHz */
> > + output *= 1000ULL;
>
> No need for the ULL => MILLI
Agreed, I will drop it.
>
> > + output /= 65536ULL * p.p * divider;
>
> mul_u32()
>
> No rounding for the division?
>
Indeed, we missed it. Actually I drop the mul_u32_u32() suggestion
just use the below:
output = DIV_U64_ROUND_CLOSEST(output, 65536 * p.p * divider);
Cheers,
Prabhakar
More information about the dri-devel
mailing list