[PATCH 06/11] drm/xe/xe2hpg: Determine flat ccs offset for vram
Vivekanandan, Balasubramani
balasubramani.vivekanandan at intel.com
Wed Apr 3 06:53:34 UTC 2024
On 02.04.2024 07:30, Matt Roper wrote:
> On Tue, Apr 02, 2024 at 06:17:19PM +0530, Balasubramani Vivekanandan wrote:
> > From: Himal Prasad Ghimiray <himal.prasad.ghimiray at intel.com>
> >
> > on Xe2 dgfx platform determine the offset using Flat CCS size
> > bitfield of XE2_FLAT_CCS_BASE_RANGE_[UPPER/LOWER] mcr registers.
> >
> > Bspec: 68023
> >
> > Signed-off-by: Himal Prasad Ghimiray <himal.prasad.ghimiray at intel.com>
> > Signed-off-by: Akshata Jahagirdar <akshata.jahagirdar at intel.com>
> > Signed-off-by: Matthew Auld <matthew.auld at intel.com>
> > Signed-off-by: Balasubramani Vivekanandan <balasubramani.vivekanandan at intel.com>
> > ---
> > drivers/gpu/drm/xe/regs/xe_gt_regs.h | 5 ++++
> > drivers/gpu/drm/xe/xe_mmio.c | 39 ++++++++++++++++++++++++++--
> > 2 files changed, 42 insertions(+), 2 deletions(-)
> >
> > diff --git a/drivers/gpu/drm/xe/regs/xe_gt_regs.h b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > index d5b21f03beaa..ea75c1f0ebd0 100644
> > --- a/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > +++ b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > @@ -69,6 +69,7 @@
> >
> > #define XEHP_TILE_ADDR_RANGE(_idx) XE_REG_MCR(0x4900 + (_idx) * 4)
> > #define XEHP_FLAT_CCS_BASE_ADDR XE_REG_MCR(0x4910)
> > +#define XEHP_FLAT_CCS_PTR REG_GENMASK(31, 8)
> >
> > #define WM_CHICKEN3 XE_REG_MCR(0x5588, XE_REG_OPTION_MASKED)
> > #define HIZ_PLANE_COMPRESSION_DIS REG_BIT(10)
> > @@ -141,6 +142,10 @@
> >
> > #define XE2_FLAT_CCS_BASE_RANGE_LOWER XE_REG_MCR(0x8800)
> > #define XE2_FLAT_CCS_ENABLE REG_BIT(0)
> > +#define XE2_FLAT_CCS_BASE_LOWER_ADDR_MASK REG_GENMASK(31, 6)
> > +
> > +#define XE2_FLAT_CCS_BASE_RANGE_UPPER XE_REG_MCR(0x8804)
> > +#define XE2_FLAT_CCS_BASE_UPPER_ADDR_MASK REG_GENMASK(7, 0)
> >
> > #define GSCPSMI_BASE XE_REG(0x880c)
> >
> > diff --git a/drivers/gpu/drm/xe/xe_mmio.c b/drivers/gpu/drm/xe/xe_mmio.c
> > index 5d13fc7cb9d2..49a32b03fcf8 100644
> > --- a/drivers/gpu/drm/xe/xe_mmio.c
> > +++ b/drivers/gpu/drm/xe/xe_mmio.c
> > @@ -163,6 +163,42 @@ static int xe_determine_lmem_bar_size(struct xe_device *xe)
> > return 0;
> > }
> >
> > +static inline u64 get_flat_ccs_offset(struct xe_gt *gt, u64 *tile_size)
>
> It doesn't seem to be necessary to pass by reference for tile_size.
Correct, I will change it.
>
> > +{
> > + struct xe_device *xe = gt_to_xe(gt);
> > + u64 offset;
> > + u32 reg;
> > +
> > + if (GRAPHICS_VER(xe) >= 20) {
> > + u64 ccs_size = *tile_size / 512;
>
> This value only gets used for the assertion below, but should we be
> taking the value from 0x8804[23:14] rather than deriving it from
> TILE_ADDR_RANGE?
The intention of the assertion is to verify if the flatccs base address
provided by XE2_FLAT_CCS_BASE_RANGE_* register pair results in flatccs
size which exactly matches the tile_size/512. In other words, it is
validating the XE2_FLAT_CCS_BASE_RANGE_* content.
Therefore getting the flat_Css size from the
XE2_FLAT_CCS_BASE_RANGE_UPPER(0x8804) itself doesn't seem to be the
right approach to me.
>
> > + u64 offset_hi, offset_lo;
> > + u32 nodes, num_enabled;
> > +
> > + reg = xe_mmio_read32(gt, MIRROR_FUSE3);
> > + nodes = REG_FIELD_GET(XE2_NODE_ENABLE_MASK, reg);
> > + num_enabled = hweight32(nodes); /* Number of enabled l3 nodes */
> > +
> > + reg = xe_gt_mcr_unicast_read_any(gt, XE2_FLAT_CCS_BASE_RANGE_LOWER);
> > + offset_lo = REG_FIELD_GET(XE2_FLAT_CCS_BASE_LOWER_ADDR_MASK, reg);
>
> This register also appears to have a general "enable" bit for FlatCCS in
> bit(0). Should we be reading that to make sure FlatCCS hasn't been
> fused off or disabled by the IFWI?
yes, makes sense. I will update the patch.
Regards,
Bala
>
>
> Matt
>
> > +
> > + reg = xe_gt_mcr_unicast_read_any(gt, XE2_FLAT_CCS_BASE_RANGE_UPPER);
> > + offset_hi = REG_FIELD_GET(XE2_FLAT_CCS_BASE_UPPER_ADDR_MASK, reg);
> > +
> > + offset = offset_hi << 32; /* HW view bits 39:32 */
> > + offset |= offset_lo << 6; /* HW view bits 31:6 */
> > + offset *= num_enabled; /* convert to SW view */
> > +
> > + /* We don't expect any holes */
> > + xe_assert_msg(xe, offset == (xe_mmio_read64_2x32(gt, GSMBASE) - ccs_size),
> > + "Hole between CCS and GSM.\n");
> > + } else {
> > + reg = xe_gt_mcr_unicast_read_any(gt, XEHP_FLAT_CCS_BASE_ADDR);
> > + offset = (u64)REG_FIELD_GET(XEHP_FLAT_CCS_PTR, reg) * SZ_64K;
> > + }
> > +
> > + return offset;
> > +}
> > +
> > /**
> > * xe_mmio_tile_vram_size() - Collect vram size and offset information
> > * @tile: tile to get info for
> > @@ -207,8 +243,7 @@ static int xe_mmio_tile_vram_size(struct xe_tile *tile, u64 *vram_size,
> >
> > /* minus device usage */
> > if (xe->info.has_flat_ccs) {
> > - reg = xe_gt_mcr_unicast_read_any(gt, XEHP_FLAT_CCS_BASE_ADDR);
> > - offset = (u64)REG_FIELD_GET(GENMASK(31, 8), reg) * SZ_64K;
> > + offset = get_flat_ccs_offset(gt, tile_size);
> > } else {
> > offset = xe_mmio_read64_2x32(gt, GSMBASE);
> > }
> > --
> > 2.25.1
> >
>
> --
> Matt Roper
> Graphics Software Engineer
> Linux GPU Platform Enablement
> Intel Corporation
More information about the Intel-xe
mailing list