[PATCH 17/29] drm/amdkfd: Add debug trap enabled flag to TMA
Kim, Jonathan
Jonathan.Kim at amd.com
Thu Nov 24 20:27:58 UTC 2022
[AMD Official Use Only - General]
> -----Original Message-----
> From: Kuehling, Felix <Felix.Kuehling at amd.com>
> Sent: November 24, 2022 11:24 AM
> To: Kim, Jonathan <Jonathan.Kim at amd.com>; amd-
> gfx at lists.freedesktop.org
> Subject: Re: [PATCH 17/29] drm/amdkfd: Add debug trap enabled flag to
> TMA
>
>
> Am 2022-11-24 um 09:51 schrieb Kim, Jonathan:
> > [Public]
> >
> >> -----Original Message-----
> >> From: Kuehling, Felix <Felix.Kuehling at amd.com>
> >> Sent: November 22, 2022 7:45 PM
> >> To: Kim, Jonathan <Jonathan.Kim at amd.com>; amd-
> >> gfx at lists.freedesktop.org
> >> Subject: Re: [PATCH 17/29] drm/amdkfd: Add debug trap enabled flag to
> >> TMA
> >>
> >>
> >> On 2022-10-31 12:23, Jonathan Kim wrote:
> >>> From: Jay Cornwall <jay.cornwall at amd.com>
> >>>
> >>> Trap handler behavior will differ when a debugger is attached.
> >>>
> >>> Make the debug trap flag available in the trap handler TMA.
> >>> Update it when the debug trap ioctl is invoked.
> >>>
> >>> v3: Rebase for upstream
> >>>
> >>> v2:
> >>> Add missing debug flag setup on APUs
> >>>
> >>> Signed-off-by: Jay Cornwall <jay.cornwall at amd.com>
> >>> Reviewed-by: Felix Kuehling <Felix.Kuehling at amd.com>
> >>> Signed-off-by: Jonathan Kim <jonathan.kim at amd.com>
> >>> ---
> >>> drivers/gpu/drm/amd/amdkfd/kfd_debug.c | 4 ++++
> >>> drivers/gpu/drm/amd/amdkfd/kfd_priv.h | 2 ++
> >>> drivers/gpu/drm/amd/amdkfd/kfd_process.c | 16
> ++++++++++++++++
> >>> 3 files changed, 22 insertions(+)
> >>>
> >>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >> b/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >>> index ae6e701a2656..d4f87f2adada 100644
> >>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_debug.c
> >>> @@ -193,6 +193,8 @@ void kfd_dbg_trap_deactivate(struct
> kfd_process
> >> *target, bool unwind, int unwind
> >>> if (unwind && count == unwind_count)
> >>> break;
> >>>
> >>> + kfd_process_set_trap_debug_flag(&pdd->qpd, false);
> >>> +
> >>> /* GFX off is already disabled by debug activate if not RLC
> >> restore supported. */
> >>> if (kfd_dbg_is_rlc_restore_supported(pdd->dev))
> >>> amdgpu_gfx_off_ctrl(pdd->dev->adev, false);
> >>> @@ -278,6 +280,8 @@ int kfd_dbg_trap_activate(struct kfd_process
> >> *target)
> >>> if (kfd_dbg_is_rlc_restore_supported(pdd->dev))
> >>> amdgpu_gfx_off_ctrl(pdd->dev->adev, true);
> >>>
> >>> + kfd_process_set_trap_debug_flag(&pdd->qpd, true);
> >>> +
> >>> r = debug_refresh_runlist(pdd->dev->dqm);
> >>> if (r) {
> >>> target->runtime_info.runtime_state =
> >>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >> b/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >>> index 9690a2adb9ed..82b28588ab72 100644
> >>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_priv.h
> >>> @@ -1101,6 +1101,8 @@ int kfd_init_apertures(struct kfd_process
> >> *process);
> >>> void kfd_process_set_trap_handler(struct qcm_process_device *qpd,
> >>> uint64_t tba_addr,
> >>> uint64_t tma_addr);
> >>> +void kfd_process_set_trap_debug_flag(struct qcm_process_device
> >> *qpd,
> >>> + bool enabled);
> >>>
> >>> /* CWSR initialization */
> >>> int kfd_process_init_cwsr_apu(struct kfd_process *process, struct file
> >> *filep);
> >>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >> b/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >>> index 59c4c38833b6..d62e0c62df76 100644
> >>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_process.c
> >>> @@ -1252,6 +1252,8 @@ int kfd_process_init_cwsr_apu(struct
> >> kfd_process *p, struct file *filep)
> >>> memcpy(qpd->cwsr_kaddr, dev->cwsr_isa, dev-
> >>> cwsr_isa_size);
> >>>
> >>> + kfd_process_set_trap_debug_flag(qpd, p-
> >>> debug_trap_enabled);
> >>> +
> >>> qpd->tma_addr = qpd->tba_addr +
> >> KFD_CWSR_TMA_OFFSET;
> >>> pr_debug("set tba :0x%llx, tma:0x%llx, cwsr_kaddr:%p for
> >> pqm.\n",
> >>> qpd->tba_addr, qpd->tma_addr, qpd->cwsr_kaddr);
> >>> @@ -1288,6 +1290,9 @@ static int
> >> kfd_process_device_init_cwsr_dgpu(struct kfd_process_device *pdd)
> >>> memcpy(qpd->cwsr_kaddr, dev->cwsr_isa, dev->cwsr_isa_size);
> >>>
> >>> + kfd_process_set_trap_debug_flag(&pdd->qpd,
> >>> + pdd->process-
> >>> debug_trap_enabled);
> >>> +
> >>> qpd->tma_addr = qpd->tba_addr + KFD_CWSR_TMA_OFFSET;
> >>> pr_debug("set tba :0x%llx, tma:0x%llx, cwsr_kaddr:%p for pqm.\n",
> >>> qpd->tba_addr, qpd->tma_addr, qpd->cwsr_kaddr);
> >>> @@ -1374,6 +1379,17 @@ bool kfd_process_xnack_mode(struct
> >> kfd_process *p, bool supported)
> >>> return true;
> >>> }
> >>>
> >>> +void kfd_process_set_trap_debug_flag(struct qcm_process_device
> >> *qpd,
> >>> + bool enabled)
> >>> +{
> >>> + /* If TMA doesn't exist then flag will be set during allocation. */
> >> I would expect a change to the TMA allocation function, but that isn't
> >> in this patch?
> > The TMA is allocated under kfd_process_init_cwsr_* and CWSR enabled is
> a pre-condition for the 1st level trap handler loading.
> > The lack of context in the patch for those functions may be hiding that fact.
> > Is the placement of this comment misleading? Maybe it should go in
> kfd_dbg_trap_activate when kfd_process_set_trap_debug_flag is called?
> > Or should it just be removed since the combined calls within initialization of
> CWSR + debug enable seem complete for enablement?
>
> I think the comment is fine. I was sort of expecting to see the
> corresponding change in the TMA allocation in the same patch. So my
> question is just lack of context. If that change in the TMA allocation
> got squashed into another patch in the series, maybe it would make sense
> to move it into this patch instead.
The change to set flag on TMA allocation is done in this patch as kfd_process_set_trap_debug_flag is now called in kfd_process_init_cwsr_*.
To my knowledge, CWSR init and trap handler memory allocation should be atomic and that has been upstreamed for a while.
Did you mean the user trap handler assignment? That should be independent from flagging.
Thanks,
Jon
>
> Regards,
> Felix
>
>
> >
> > Thanks,
> >
> > Jon
> >
> >> Regards,
> >> Felix
> >>
> >>> + if (qpd->cwsr_kaddr) {
> >>> + uint64_t *tma =
> >>> + (uint64_t *)(qpd->cwsr_kaddr +
> >> KFD_CWSR_TMA_OFFSET);
> >>> + tma[2] = enabled;
> >>> + }
> >>> +}
> >>> +
> >>> /*
> >>> * On return the kfd_process is fully operational and will be freed when
> >> the
> >>> * mm is released
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winmail.dat
Type: application/ms-tnef
Size: 18865 bytes
Desc: not available
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20221124/4d47583c/attachment-0001.bin>
More information about the amd-gfx
mailing list