[PATCH 1/2] drm/amdgpu: Don't query CE and UE errors
Christian König
ckoenig.leichtzumerken at gmail.com
Thu May 13 07:56:58 UTC 2021
Am 12.05.21 um 19:03 schrieb Luben Tuikov:
> On QUERY2 IOCTL don't query counts of correctable
> and uncorrectable errors, since when RAS is
> enabled and supported on Vega20 server boards,
> this takes insurmountably long time, in O(n^3),
> which slows the system down to the point of it
> being unusable when we have GUI up.
>
> Fixes: ae363a212b14 ("drm/amdgpu: Add a new flag to AMDGPU_CTX_OP_QUERY_STATE2")
> Cc: Alexander Deucher <Alexander.Deucher at amd.com>
> Cc: stable at vger.kernel.org
> Signed-off-by: Luben Tuikov <luben.tuikov at amd.com>
> ---
> drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c | 26 ++++++++++++-------------
> 1 file changed, 13 insertions(+), 13 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c
> index 01fe60fedcbe..d481a33f4eaf 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c
> @@ -363,19 +363,19 @@ static int amdgpu_ctx_query2(struct amdgpu_device *adev,
> out->state.flags |= AMDGPU_CTX_QUERY2_FLAGS_GUILTY;
>
> /*query ue count*/
> - ras_counter = amdgpu_ras_query_error_count(adev, false);
> - /*ras counter is monotonic increasing*/
> - if (ras_counter != ctx->ras_counter_ue) {
> - out->state.flags |= AMDGPU_CTX_QUERY2_FLAGS_RAS_UE;
> - ctx->ras_counter_ue = ras_counter;
> - }
> -
> - /*query ce count*/
> - ras_counter = amdgpu_ras_query_error_count(adev, true);
> - if (ras_counter != ctx->ras_counter_ce) {
> - out->state.flags |= AMDGPU_CTX_QUERY2_FLAGS_RAS_CE;
> - ctx->ras_counter_ce = ras_counter;
> - }
> + /* ras_counter = amdgpu_ras_query_error_count(adev, false); */
> + /* /\*ras counter is monotonic increasing*\/ */
> + /* if (ras_counter != ctx->ras_counter_ue) { */
> + /* out->state.flags |= AMDGPU_CTX_QUERY2_FLAGS_RAS_UE; */
> + /* ctx->ras_counter_ue = ras_counter; */
> + /* } */
> +
> + /* /\*query ce count*\/ */
> + /* ras_counter = amdgpu_ras_query_error_count(adev, true); */
> + /* if (ras_counter != ctx->ras_counter_ce) { */
> + /* out->state.flags |= AMDGPU_CTX_QUERY2_FLAGS_RAS_CE; */
> + /* ctx->ras_counter_ce = ras_counter; */
> + /* } */
Please completely drop the code. We usually don't keep commented out
code in the driver.
With that done the patch is Reviewed-by: Christian König
<christian.koenig at amd.com>
Christian.
>
> mutex_unlock(&mgr->lock);
> return 0;
More information about the amd-gfx
mailing list