<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:DengXian;
panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:"\@DengXian";
panose-1:2 1 6 0 3 1 1 1 1 1;}
@font-face
{font-family:"Bookman Old Style";
panose-1:2 5 6 4 5 5 5 2 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
.MsoChpDefault
{mso-style-type:export-only;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.25in 1.0in 1.25in;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-US" link="blue" vlink="#954F72">
<div class="WordSection1">
<p class="MsoNormal">amdgpu_vm_reserve_vmid doesn’t return the reserved vmid back to user space. There is no chance for user mode driver to update RLC_SPM_VMID.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Bookman Old Style",serif">Thanks<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Bookman Old Style",serif">Jacob<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="mso-element:para-border-div;border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="border:none;padding:0in"><b>From: </b><a href="mailto:Jacob.He@amd.com">He, Jacob</a><br>
<b>Sent: </b>Thursday, February 20, 2020 6:20 PM<br>
<b>To: </b><a href="mailto:David1.Zhou@amd.com">Zhou, David(ChunMing)</a>; <a href="mailto:Christian.Koenig@amd.com">
Koenig, Christian</a>; <a href="mailto:amd-gfx@lists.freedesktop.org">amd-gfx@lists.freedesktop.org</a><br>
<b>Subject: </b>RE: [PATCH] drm/amdgpu: Add a chunk ID for spm trace</p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Looks like amdgpu_vm_reserve_vmid could work, let me have a try to update the RLC_SPM_VMID with pm4 packets in UMD.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Bookman Old Style",serif">Thanks<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Bookman Old Style",serif">Jacob<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From: </b><a href="mailto:David1.Zhou@amd.com">Zhou, David(ChunMing)</a><br>
<b>Sent: </b>Thursday, February 20, 2020 10:13 AM<br>
<b>To: </b><a href="mailto:Christian.Koenig@amd.com">Koenig, Christian</a>; <a href="mailto:Jacob.He@amd.com">
He, Jacob</a>; <a href="mailto:amd-gfx@lists.freedesktop.org">amd-gfx@lists.freedesktop.org</a><br>
<b>Subject: </b>RE: [PATCH] drm/amdgpu: Add a chunk ID for spm trace<o:p></o:p></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">[AMD Official Use Only - Internal Distribution Only]<br>
<br>
Christian is right here, that will cause many problems for simply using VMID in kernel.<br>
We already have an pair interface for RGP, I think you can use it instead of involving additional kernel change.<br>
amdgpu_vm_reserve_vmid/ amdgpu_vm_unreserve_vmid.<br>
<br>
-David<br>
<br>
-----Original Message-----<br>
From: amd-gfx <amd-gfx-bounces@lists.freedesktop.org> On Behalf Of Christian König<br>
Sent: Wednesday, February 19, 2020 7:03 PM<br>
To: He, Jacob <Jacob.He@amd.com>; amd-gfx@lists.freedesktop.org<br>
Subject: Re: [PATCH] drm/amdgpu: Add a chunk ID for spm trace<br>
<br>
Am 19.02.20 um 11:15 schrieb Jacob He:<br>
> [WHY]<br>
> When SPM trace enabled, SPM_VMID should be updated with the current <br>
> vmid.<br>
><br>
> [HOW]<br>
> Add a chunk id, AMDGPU_CHUNK_ID_SPM_TRACE, so that UMD can tell us <br>
> which job should update SPM_VMID.<br>
> Right before a job is submitted to GPU, set the SPM_VMID accordingly.<br>
><br>
> [Limitation]<br>
> Running more than one SPM trace enabled processes simultaneously is <br>
> not supported.<br>
<br>
Well there are multiple problems with that patch.<br>
<br>
First of all you need to better describe what SPM tracing is in the commit message.<br>
<br>
Then the updating of mmRLC_SPM_MC_CNTL must be executed asynchronously on the ring. Otherwise we might corrupt an already executing SPM trace.<br>
<br>
And you also need to make sure to disable the tracing again or otherwise we run into a bunch of trouble when the VMID is reused.<br>
<br>
You also need to make sure that IBs using the SPM trace are serialized with each other, e.g. hack into amdgpu_ids.c file and make sure that only one VMID at a time can have that attribute.<br>
<br>
Regards,<br>
Christian.<br>
<br>
><br>
> Change-Id: Ic932ef6ac9dbf244f03aaee90550e8ff3a675666<br>
> Signed-off-by: Jacob He <jacob.he@amd.com><br>
> ---<br>
> drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 7 +++++++<br>
> drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 10 +++++++---<br>
> drivers/gpu/drm/amd/amdgpu/amdgpu_job.h | 1 +<br>
> drivers/gpu/drm/amd/amdgpu/amdgpu_rlc.h | 1 +<br>
> drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 15 ++++++++++++++-<br>
> drivers/gpu/drm/amd/amdgpu/gfx_v7_0.c | 3 ++-<br>
> drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c | 3 ++-<br>
> drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 15 ++++++++++++++-<br>
> 8 files changed, 48 insertions(+), 7 deletions(-)<br>
><br>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c <br>
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c<br>
> index f9fa6e104fef..3f32c4db5232 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c<br>
> @@ -113,6 +113,7 @@ static int amdgpu_cs_parser_init(struct amdgpu_cs_parser *p, union drm_amdgpu_cs<br>
> uint32_t uf_offset = 0;<br>
> int i;<br>
> int ret;<br>
> + bool update_spm_vmid = false;<br>
> <br>
> if (cs->in.num_chunks == 0)<br>
> return 0;<br>
> @@ -221,6 +222,10 @@ static int amdgpu_cs_parser_init(struct amdgpu_cs_parser *p, union drm_amdgpu_cs<br>
> case AMDGPU_CHUNK_ID_SYNCOBJ_TIMELINE_SIGNAL:<br>
> break;<br>
> <br>
> + case AMDGPU_CHUNK_ID_SPM_TRACE:<br>
> + update_spm_vmid = true;<br>
> + break;<br>
> +<br>
> default:<br>
> ret = -EINVAL;<br>
> goto free_partial_kdata;<br>
> @@ -231,6 +236,8 @@ static int amdgpu_cs_parser_init(struct amdgpu_cs_parser *p, union drm_amdgpu_cs<br>
> if (ret)<br>
> goto free_all_kdata;<br>
> <br>
> + p->job->need_update_spm_vmid = update_spm_vmid;<br>
> +<br>
> if (p->ctx->vram_lost_counter != p->job->vram_lost_counter) {<br>
> ret = -ECANCELED;<br>
> goto free_all_kdata;<br>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c <br>
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c<br>
> index cae81914c821..36faab12b585 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c<br>
> @@ -156,9 +156,13 @@ int amdgpu_ib_schedule(struct amdgpu_ring *ring, unsigned num_ibs,<br>
> return -EINVAL;<br>
> }<br>
> <br>
> - if (vm && !job->vmid) {<br>
> - dev_err(adev->dev, "VM IB without ID\n");<br>
> - return -EINVAL;<br>
> + if (vm) {<br>
> + if (!job->vmid) {<br>
> + dev_err(adev->dev, "VM IB without ID\n");<br>
> + return -EINVAL;<br>
> + } else if (adev->gfx.rlc.funcs->update_spm_vmid && job->need_update_spm_vmid) {<br>
> + adev->gfx.rlc.funcs->update_spm_vmid(adev, job->vmid);<br>
> + }<br>
> }<br>
> <br>
> alloc_size = ring->funcs->emit_frame_size + num_ibs * diff --git <br>
> a/drivers/gpu/drm/amd/amdgpu/amdgpu_job.h <br>
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_job.h<br>
> index 2e2110dddb76..4582536961c7 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_job.h<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_job.h<br>
> @@ -52,6 +52,7 @@ struct amdgpu_job {<br>
> bool vm_needs_flush;<br>
> uint64_t vm_pd_addr;<br>
> unsigned vmid;<br>
> + bool need_update_spm_vmid;<br>
> unsigned pasid;<br>
> uint32_t gds_base, gds_size;<br>
> uint32_t gws_base, gws_size;<br>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_rlc.h <br>
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_rlc.h<br>
> index d3d4707f2168..52509c254cbd 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_rlc.h<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_rlc.h<br>
> @@ -126,6 +126,7 @@ struct amdgpu_rlc_funcs {<br>
> void (*stop)(struct amdgpu_device *adev);<br>
> void (*reset)(struct amdgpu_device *adev);<br>
> void (*start)(struct amdgpu_device *adev);<br>
> + void (*update_spm_vmid)(struct amdgpu_device *adev, unsigned vmid);<br>
> };<br>
> <br>
> struct amdgpu_rlc {<br>
> diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c <br>
> b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c<br>
> index 5e9fb0976c6c..91eb788d6229 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c<br>
> @@ -4214,6 +4214,18 @@ static int gfx_v10_0_update_gfx_clock_gating(struct amdgpu_device *adev,<br>
> return 0;<br>
> }<br>
> <br>
> +static void gfx_v10_0_update_spm_vmid(struct amdgpu_device *adev, <br>
> +unsigned vmid) {<br>
> + u32 data;<br>
> +<br>
> + data = RREG32_SOC15(GC, 0, mmRLC_SPM_MC_CNTL);<br>
> +<br>
> + data &= ~RLC_SPM_MC_CNTL__RLC_SPM_VMID_MASK;<br>
> + data |= (vmid & RLC_SPM_MC_CNTL__RLC_SPM_VMID_MASK) << <br>
> +RLC_SPM_MC_CNTL__RLC_SPM_VMID__SHIFT;<br>
> +<br>
> + WREG32_SOC15(GC, 0, mmRLC_SPM_MC_CNTL, data); }<br>
> +<br>
> static const struct amdgpu_rlc_funcs gfx_v10_0_rlc_funcs = {<br>
> .is_rlc_enabled = gfx_v10_0_is_rlc_enabled,<br>
> .set_safe_mode = gfx_v10_0_set_safe_mode, @@ -4224,7 +4236,8 @@ <br>
> static const struct amdgpu_rlc_funcs gfx_v10_0_rlc_funcs = {<br>
> .resume = gfx_v10_0_rlc_resume,<br>
> .stop = gfx_v10_0_rlc_stop,<br>
> .reset = gfx_v10_0_rlc_reset,<br>
> - .start = gfx_v10_0_rlc_start<br>
> + .start = gfx_v10_0_rlc_start,<br>
> + .update_spm_vmid = gfx_v10_0_update_spm_vmid<br>
> };<br>
> <br>
> static int gfx_v10_0_set_powergating_state(void *handle, diff --git <br>
> a/drivers/gpu/drm/amd/amdgpu/gfx_v7_0.c <br>
> b/drivers/gpu/drm/amd/amdgpu/gfx_v7_0.c<br>
> index 8f20a5dd44fe..b24fc55cf13a 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v7_0.c<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v7_0.c<br>
> @@ -4221,7 +4221,8 @@ static const struct amdgpu_rlc_funcs gfx_v7_0_rlc_funcs = {<br>
> .resume = gfx_v7_0_rlc_resume,<br>
> .stop = gfx_v7_0_rlc_stop,<br>
> .reset = gfx_v7_0_rlc_reset,<br>
> - .start = gfx_v7_0_rlc_start<br>
> + .start = gfx_v7_0_rlc_start,<br>
> + .update_spm_vmid = NULL<br>
> };<br>
> <br>
> static int gfx_v7_0_early_init(void *handle) diff --git <br>
> a/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c <br>
> b/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c<br>
> index fa245973de12..66640d2b6b37 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c<br>
> @@ -5600,7 +5600,8 @@ static const struct amdgpu_rlc_funcs iceland_rlc_funcs = {<br>
> .resume = gfx_v8_0_rlc_resume,<br>
> .stop = gfx_v8_0_rlc_stop,<br>
> .reset = gfx_v8_0_rlc_reset,<br>
> - .start = gfx_v8_0_rlc_start<br>
> + .start = gfx_v8_0_rlc_start,<br>
> + .update_spm_vmid = NULL<br>
> };<br>
> <br>
> static void gfx_v8_0_update_medium_grain_clock_gating(struct <br>
> amdgpu_device *adev, diff --git <br>
> a/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c <br>
> b/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> index 9b7ff783e9a5..df872f949f68 100644<br>
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> @@ -4704,6 +4704,18 @@ static int gfx_v9_0_update_gfx_clock_gating(struct amdgpu_device *adev,<br>
> return 0;<br>
> }<br>
> <br>
> +static void gfx_v9_0_update_spm_vmid(struct amdgpu_device *adev, <br>
> +unsigned vmid) {<br>
> + u32 data;<br>
> +<br>
> + data = RREG32_SOC15(GC, 0, mmRLC_SPM_MC_CNTL);<br>
> +<br>
> + data &= ~RLC_SPM_MC_CNTL__RLC_SPM_VMID_MASK;<br>
> + data |= (vmid & RLC_SPM_MC_CNTL__RLC_SPM_VMID_MASK) << <br>
> +RLC_SPM_MC_CNTL__RLC_SPM_VMID__SHIFT;<br>
> +<br>
> + WREG32_SOC15(GC, 0, mmRLC_SPM_MC_CNTL, data); }<br>
> +<br>
> static const struct amdgpu_rlc_funcs gfx_v9_0_rlc_funcs = {<br>
> .is_rlc_enabled = gfx_v9_0_is_rlc_enabled,<br>
> .set_safe_mode = gfx_v9_0_set_safe_mode, @@ -4715,7 +4727,8 @@ <br>
> static const struct amdgpu_rlc_funcs gfx_v9_0_rlc_funcs = {<br>
> .resume = gfx_v9_0_rlc_resume,<br>
> .stop = gfx_v9_0_rlc_stop,<br>
> .reset = gfx_v9_0_rlc_reset,<br>
> - .start = gfx_v9_0_rlc_start<br>
> + .start = gfx_v9_0_rlc_start,<br>
> + .update_spm_vmid = gfx_v9_0_update_spm_vmid<br>
> };<br>
> <br>
> static int gfx_v9_0_set_powergating_state(void *handle,<br>
<br>
_______________________________________________<br>
amd-gfx mailing list<br>
amd-gfx@lists.freedesktop.org<br>
<a href="https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cjacob.he%40amd.com%7C0b340cec0e2d41dd4f8c08d7b5ee6f65%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637177908007702357&sdata=dRtYat6X178kHA8kJgsJOfiM9XhjDpmhM7dZTZy11lk%3D&reserved=0">https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cdavid1.zhou%40amd.com%7C354be34ff18e4f424f6708d7b52b43b0%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637177069753914395&sdata=9rSL4kgPJweuZ4EJpdqtqTxyCVGEkmsg6aUzbtvGFrs%3D&reserved=0</a><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</body>
</html>