[PATCH V2] drm/amdgpu: Fix SDMA UTC_L1 handling during start/stop sequences

Lazar, Lijo lijo.lazar at amd.com
Mon Jun 16 16:01:28 UTC 2025



On 6/16/2025 8:54 PM, Zhang, Jesse(Jie) wrote:
> [AMD Official Use Only - AMD Internal Distribution Only]
> 
> -----Original Message-----
> From: Lazar, Lijo <Lijo.Lazar at amd.com>
> Sent: Monday, June 16, 2025 11:05 PM
> To: Alex Deucher <alexdeucher at gmail.com>; Zhang, Jesse(Jie) <Jesse.Zhang at amd.com>
> Cc: amd-gfx at lists.freedesktop.org; Deucher, Alexander <Alexander.Deucher at amd.com>; Koenig, Christian <Christian.Koenig at amd.com>; Kim, Jonathan <Jonathan.Kim at amd.com>
> Subject: Re: [PATCH V2] drm/amdgpu: Fix SDMA UTC_L1 handling during start/stop sequences
> 
> 
> 
> On 6/16/2025 8:28 PM, Alex Deucher wrote:
>> On Mon, Jun 16, 2025 at 10:47 AM Jesse Zhang <jesse.zhang at amd.com> wrote:
>>>
>>> This commit makes two key fixes to SDMA v4.4.2 handling:
>>>
>>> 1. disable UTC_L1 in sdma_cntl register when stopping SDMA engines
>>>    by reading the current value before modifying UTC_L1_ENABLE bit.
>>>
>>> 2. Ensure UTC_L1_ENABLE is consistently managed by:
>>>    - Adding the missing register write when enabling UTC_L1 during start
>>>    - Keeping UTC_L1 enabled by default as per hardware requirements
>>>
>>> Fixes the commit 7f03b1d14d51 ("drm/amdgpu:
>>> Remove sdma halt/unhalt during frontdoor load") by maintaining proper
>>> SDMA controller state during initialization sequences.
>>
>> Proper fixes tag formatting:
>>
>> Fixes: 7f03b1d14d51 ("drm/amdgpu: Remove sdma halt/unhalt during
>> frontdoor load")
>>
> 
> The default reset value of that field is 1. Keeping SDMA unhalted but rings disabled shouldn't ideally be a problem - should be similar to the case when it's loaded also.
> 
> Yes, Lijo, I will remove the related description when submitting. This is our problem. We need a workaround to enable SDMA_CNTL::UTC_L1. Because when we call smu queue reset interface,
> it disables UTC_L1.
> 

Thanks for clarifying. While submitting, please update the description
to indicate that this is related to stop/restart of engines for an
engine reset.

Thanks,
Lijo

> Regards
> Jesse
> Thanks,
> Lijo
> 
>>>
>>> v2: Correct SDMA_CNTL setting (Philip)
>>>
>>> Suggested-by: Jonathan Kim <jonathan.kim at amd.com>
>>> Signed-off-by: Jesse Zhang <Jesse.Zhang at amd.com>
>>
>> Acked-by: Alex Deucher <alexander.deucher at amd.com>
>>
>>> ---
>>>  drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c | 6 +++++-
>>>  1 file changed, 5 insertions(+), 1 deletion(-)
>>>
>>> diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c
>>> b/drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c
>>> index e6af50c947a1..52157acab55a 100644
>>> --- a/drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c
>>> +++ b/drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c
>>> @@ -492,7 +492,7 @@ static void sdma_v4_4_2_inst_gfx_stop(struct
>>> amdgpu_device *adev,  {
>>>         struct amdgpu_ring *sdma[AMDGPU_MAX_SDMA_INSTANCES];
>>>         u32 doorbell_offset, doorbell;
>>> -       u32 rb_cntl, ib_cntl;
>>> +       u32 rb_cntl, ib_cntl, sdma_cntl;
>>>         int i;
>>>
>>>         for_each_inst(i, inst_mask) { @@ -504,6 +504,9 @@ static void
>>> sdma_v4_4_2_inst_gfx_stop(struct amdgpu_device *adev,
>>>                 ib_cntl = RREG32_SDMA(i, regSDMA_GFX_IB_CNTL);
>>>                 ib_cntl = REG_SET_FIELD(ib_cntl, SDMA_GFX_IB_CNTL, IB_ENABLE, 0);
>>>                 WREG32_SDMA(i, regSDMA_GFX_IB_CNTL, ib_cntl);
>>> +               sdma_cntl = RREG32_SDMA(i, SDMA_CNTL);
>>> +               sdma_cntl = REG_SET_FIELD(sdma_cntl, SDMA_CNTL, UTC_L1_ENABLE, 0);
>>> +               WREG32_SDMA(i, regSDMA_CNTL, sdma_cntl);
>>>
>>>                 if (sdma[i]->use_doorbell) {
>>>                         doorbell = RREG32_SDMA(i,
>>> regSDMA_GFX_DOORBELL); @@ -997,6 +1000,7 @@ static int sdma_v4_4_2_inst_start(struct amdgpu_device *adev,
>>>                 /* set utc l1 enable flag always to 1 */
>>>                 temp = RREG32_SDMA(i, regSDMA_CNTL);
>>>                 temp = REG_SET_FIELD(temp, SDMA_CNTL, UTC_L1_ENABLE,
>>> 1);
>>> +               WREG32_SDMA(i, regSDMA_CNTL, temp);
>>>
>>>                 if (amdgpu_ip_version(adev, SDMA0_HWIP, 0) < IP_VERSION(4, 4, 5)) {
>>>                         /* enable context empty interrupt during
>>> initialization */
>>> --
>>> 2.34.1
>>>
> 



More information about the amd-gfx mailing list