[PATCH] drm/xe/xe3: Generate and store the L3 bank mask
Matt Roper
matthew.d.roper at intel.com
Wed Jan 15 15:56:17 UTC 2025
On Wed, Jan 15, 2025 at 12:25:13PM +0100, Francois Dugast wrote:
> Hi,
>
> On Tue, Jan 14, 2025 at 01:29:54PM -0800, Matt Roper wrote:
> > On Tue, Jan 14, 2025 at 12:38:53PM -0800, Matt Atwood wrote:
> > > From: Francois Dugast <francois.dugast at intel.com>
> > >
> > > On Xe3, the register used to indicate which L3 banks are enabled on
> > > the system is a new one called MIRROR_L3BANK_ENABLE. Each bit
> > > represents one bank enabled in each node.
> > > Extend the existing topology code for Xe3 to read this register and
> > > generate the correct L3 bank mask, which can be read by user space
> > > throug the topology query.
> > >
> > > Bspec: 72573, 73439
> > > Signed-off-by: Francois Dugast <francois.dugast at intel.com>
> > > Signed-off-by: Matt Atwood <matthew.s.atwood at intel.com>
> > > ---
> > > drivers/gpu/drm/xe/regs/xe_gt_regs.h | 3 +++
> > > drivers/gpu/drm/xe/xe_gt_topology.c | 16 +++++++++++++---
> > > 2 files changed, 16 insertions(+), 3 deletions(-)
> > >
> > > diff --git a/drivers/gpu/drm/xe/regs/xe_gt_regs.h b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > > index b4283ac030f4..096859072396 100644
> > > --- a/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > > +++ b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > > @@ -221,6 +221,9 @@
> > >
> > > #define MIRROR_FUSE1 XE_REG(0x911c)
> > >
> > > +#define MIRROR_L3BANK_ENABLE XE_REG(0x9130)
> > > +#define XE3_L3BANK_ENABLE REG_GENMASK(31, 0)
> > > +
> > > #define XELP_EU_ENABLE XE_REG(0x9134) /* "_DISABLE" on Xe_LP */
> > > #define XELP_EU_MASK REG_GENMASK(7, 0)
> > > #define XELP_GT_SLICE_ENABLE XE_REG(0x9138)
> > > diff --git a/drivers/gpu/drm/xe/xe_gt_topology.c b/drivers/gpu/drm/xe/xe_gt_topology.c
> > > index df2042db7ee6..516c81e3b8dd 100644
> > > --- a/drivers/gpu/drm/xe/xe_gt_topology.c
> > > +++ b/drivers/gpu/drm/xe/xe_gt_topology.c
> > > @@ -129,7 +129,8 @@ static void
> > > load_l3_bank_mask(struct xe_gt *gt, xe_l3_bank_mask_t l3_bank_mask)
> > > {
> > > struct xe_device *xe = gt_to_xe(gt);
> > > - u32 fuse3 = xe_mmio_read32(>->mmio, MIRROR_FUSE3);
> > > + struct xe_mmio *mmio = >->mmio;
> > > + u32 fuse3 = xe_mmio_read32(mmio, MIRROR_FUSE3);
> > >
> > > /*
> > > * PTL platforms with media version 30.00 do not provide proper values
> > > @@ -143,7 +144,16 @@ load_l3_bank_mask(struct xe_gt *gt, xe_l3_bank_mask_t l3_bank_mask)
> > > if (XE_WA(gt, no_media_l3))
> > > return;
> > >
> > > - if (GRAPHICS_VER(xe) >= 20) {
> > > + if (GRAPHICS_VER(xe) >= 30) {
> > > + xe_l3_bank_mask_t per_node = {};
> > > + u32 meml3_en = REG_FIELD_GET(XE2_NODE_ENABLE_MASK, fuse3);
> > > + u32 mirror_l3bank_enable = xe_mmio_read32(mmio, MIRROR_L3BANK_ENABLE);
> > > + u32 bank_val = REG_FIELD_GET(XE3_L3BANK_ENABLE, mirror_l3bank_enable);
> > > +
> > > + bitmap_from_arr32(per_node, &bank_val, 32);
> >
> > Doesn't each bit in L3BANK_ENABLE represent a pair of banks rather than
> > a single bank? It doesn't look like that's accounted for here.
>
> On this platform, MEML3_EN represents enabled nodes, which can be 0x1 (1 node)
> or 0x3 (2 nodes). L3BANK_ENABLE represents enabled banks in each enabled node.
>
> If 2 nodes are enabled, each bit in L3BANK_ENABLE will account for 2 banks in
> the resulting mask, one in each bank, thanks to gen_l3_mask_from_pattern().
>
> For example, if 2 nodes are enabled with 8 banks each then:
> meml3_en = 0x3
> per_node = 0xf
> l3_bank_mask = 0x0000000f0000000f
>
> This seems correct.
Hmm, it seems the two bspec pages listed on this patch are inconsistent.
73439 indicates that each bit refers to a "bank pair" rather than single
bits. However 72573 indicates "1 bank in each node." The latter is
likely the correct one, but we may want to file a ticket to get the
spec's description fixed too, otherwise it's going to cause more
confusion.
Aside from that, things look good.
Reviewed-by: Matt Roper <matthew.d.roper at intel.com>
>
> Francois
>
> >
> >
> > Matt
> >
> > > + gen_l3_mask_from_pattern(xe, l3_bank_mask, per_node, 32,
> > > + meml3_en);
> > > + } else if (GRAPHICS_VER(xe) >= 20) {
> > > xe_l3_bank_mask_t per_node = {};
> > > u32 meml3_en = REG_FIELD_GET(XE2_NODE_ENABLE_MASK, fuse3);
> > > u32 bank_val = REG_FIELD_GET(XE2_GT_L3_MODE_MASK, fuse3);
> > > @@ -155,7 +165,7 @@ load_l3_bank_mask(struct xe_gt *gt, xe_l3_bank_mask_t l3_bank_mask)
> > > xe_l3_bank_mask_t per_node = {};
> > > xe_l3_bank_mask_t per_mask_bit = {};
> > > u32 meml3_en = REG_FIELD_GET(MEML3_EN_MASK, fuse3);
> > > - u32 fuse4 = xe_mmio_read32(>->mmio, XEHP_FUSE4);
> > > + u32 fuse4 = xe_mmio_read32(mmio, XEHP_FUSE4);
> > > u32 bank_val = REG_FIELD_GET(GT_L3_EXC_MASK, fuse4);
> > >
> > > bitmap_set_value8(per_mask_bit, 0x3, 0);
> > > --
> > > 2.45.0
> > >
> >
> > --
> > Matt Roper
> > Graphics Software Engineer
> > Linux GPU Platform Enablement
> > Intel Corporation
--
Matt Roper
Graphics Software Engineer
Linux GPU Platform Enablement
Intel Corporation
More information about the Intel-xe
mailing list