<html><head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body>
<p><br>
</p>
<div class="moz-cite-prefix">On 5/19/21 5:14 AM, Huang, Ray wrote:<br>
</div>
<blockquote type="cite" cite="mid:MWHPR12MB12481CD68B579778D851B88FEC2B9@MWHPR12MB1248.namprd12.prod.outlook.com">
<meta name="Generator" content="Microsoft Word 15 (filtered
medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]-->
<style>@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face
{font-family:DengXian;
panose-1:2 1 6 0 3 1 1 1 1 1;}@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}@font-face
{font-family:DengXian;
panose-1:2 1 6 0 3 1 1 1 1 1;}@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}p.xmsonormal, li.xmsonormal, div.xmsonormal
{mso-style-name:x_msonormal;
margin:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}span.EmailStyle19
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}p.msipheaderc10f11a2, li.msipheaderc10f11a2, div.msipheaderc10f11a2
{mso-style-name:msipheaderc10f11a2;
mso-margin-top-alt:auto;
margin-right:0in;
mso-margin-bottom-alt:auto;
margin-left:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}div.WordSection1
{page:WordSection1;}</style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
<div class="WordSection1">
<p class="msipheaderc10f11a2" style="margin:0in"><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:green">[Public]</span><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I check the patch (below) to disable
compute queues for raven is not landed into drm-next. So
actually all queues are enabled at this moment. Nirmoy, can we
get your confirmation?</p>
</div>
</blockquote>
<p><br>
</p>
<p>I indeed didn't push the commit that disable all but one cu for
raven. I was suppose to check with kfd as Felix wanted to</p>
<p>know if that bug affects KFD. I think I got distracted with
something else. <br>
</p>
<p><br>
</p>
<p>Regards,</p>
<p>Nirmoy<br>
</p>
<blockquote type="cite" cite="mid:MWHPR12MB12481CD68B579778D851B88FEC2B9@MWHPR12MB1248.namprd12.prod.outlook.com">
<div class="WordSection1">
<p class="MsoNormal"><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><b><span style="font-size:10.0pt;font-family:Consolas;color:seagreen">diff --git
a/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c
b/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c</span></b><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><b><span style="font-size:10.0pt;font-family:Consolas;color:seagreen">index
97a8f786cf85..9352fcb77fe9 100644</span></b><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><b><span style="font-size:10.0pt;font-family:Consolas;color:seagreen">---
a/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c</span></b><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><b><span style="font-size:10.0pt;font-family:Consolas;color:seagreen">+++
b/drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c</span></b><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><b><span style="font-size:10.0pt;font-family:Consolas;color:brown">@@
-812,6 +812,13 @@</span></b><span style="font-size:10.0pt;font-family:Consolas;color:#333333">
</span><span style="font-size:10.0pt;font-family:Consolas;color:#A020F0"> void
amdgpu_kiq_wreg(struct amdgpu_device *adev, uint32_t reg,
uint32_t v)</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:#333333">int
amdgpu_gfx_get_num_kcq(struct amdgpu_device *adev)<o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:#333333">{<o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:#333333">
if (amdgpu_num_kcq == -1) {<o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+
/* raven firmware currently can not load balance jobs</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+
* among multiple compute queues. Enable only one</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+
* compute queue till we have a firmware fix.</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+
*/</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+
if (adev->asic_type == CHIP_RAVEN)</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+
return 1;</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:darkcyan">+</span><span style="font-size:10.0pt;font-family:Consolas;color:#333333"><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:#333333">
return 8;<o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:#333333">
} else if (amdgpu_num_kcq > 8 || amdgpu_num_kcq < 0) {<o:p></o:p></span></p>
<p class="MsoNormal" style="margin-bottom:7.5pt;line-height:10.75pt;background:white;word-break:break-all"><span style="font-size:10.0pt;font-family:Consolas;color:#333333">
dev_warn(adev->dev, "set kernel compute queue number to 8
due to invalid parameter provided by user\n");<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">And I am glad to see that we have a
solution to fix this issue at current. Nice work, Changfeng!<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Best Regards,<o:p></o:p></p>
<p class="MsoNormal">Ray<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1
1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Deucher, Alexander
<a class="moz-txt-link-rfc2396E" href="mailto:Alexander.Deucher@amd.com"><Alexander.Deucher@amd.com></a> <br>
<b>Sent:</b> Wednesday, May 19, 2021 11:04 AM<br>
<b>To:</b> Chen, Guchun <a class="moz-txt-link-rfc2396E" href="mailto:Guchun.Chen@amd.com"><Guchun.Chen@amd.com></a>; Zhu,
Changfeng <a class="moz-txt-link-rfc2396E" href="mailto:Changfeng.Zhu@amd.com"><Changfeng.Zhu@amd.com></a>; Alex Deucher
<a class="moz-txt-link-rfc2396E" href="mailto:alexdeucher@gmail.com"><alexdeucher@gmail.com></a>; Das, Nirmoy
<a class="moz-txt-link-rfc2396E" href="mailto:Nirmoy.Das@amd.com"><Nirmoy.Das@amd.com></a><br>
<b>Cc:</b> Huang, Ray <a class="moz-txt-link-rfc2396E" href="mailto:Ray.Huang@amd.com"><Ray.Huang@amd.com></a>; amd-gfx
list <a class="moz-txt-link-rfc2396E" href="mailto:amd-gfx@lists.freedesktop.org"><amd-gfx@lists.freedesktop.org></a><br>
<b>Subject:</b> Re: [PATCH] drm/amdgpu: disable 3DCGCG on
picasso/raven1 to avoid compute hang<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p style="margin:15.0pt"><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:green">[Public]<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal"><span style="font-size:12.0pt;color:black">I thought we had
disabled all but one of the compute queues on raven due
to this issue or at least disabled the schedulers for
the additional queues, but maybe I'm misremembering.<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:12.0pt;color:black">Alex<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:12.0pt;color:black"><o:p> </o:p></span></p>
</div>
<div class="MsoNormal" style="text-align:center" align="center">
<hr width="98%" size="2" align="center">
</div>
<div id="divRplyFwdMsg">
<p class="MsoNormal"><b><span style="color:black">From:</span></b><span style="color:black"> Chen, Guchun <<a href="mailto:Guchun.Chen@amd.com" moz-do-not-send="true">Guchun.Chen@amd.com</a>><br>
<b>Sent:</b> Tuesday, May 18, 2021 11:00 PM<br>
<b>To:</b> Zhu, Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>>;
Deucher, Alexander <<a href="mailto:Alexander.Deucher@amd.com" moz-do-not-send="true">Alexander.Deucher@amd.com</a>>;
Alex Deucher <<a href="mailto:alexdeucher@gmail.com" moz-do-not-send="true">alexdeucher@gmail.com</a>>;
Das, Nirmoy <<a href="mailto:Nirmoy.Das@amd.com" moz-do-not-send="true">Nirmoy.Das@amd.com</a>><br>
<b>Cc:</b> Huang, Ray <<a href="mailto:Ray.Huang@amd.com" moz-do-not-send="true">Ray.Huang@amd.com</a>>;
amd-gfx list <<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a>><br>
<b>Subject:</b> RE: [PATCH] drm/amdgpu: disable 3DCGCG
on picasso/raven1 to avoid compute hang</span>
<o:p></o:p></p>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<p style="margin:15.0pt"><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:green">[Public]<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="xmsonormal">Nirmoy’s patch landed already if I
understand correctly.<o:p></o:p></p>
<p class="xmsonormal"> <o:p></o:p></p>
<p class="xmsonormal">d41a39dda140 drm/scheduler:
improve job distribution with multiple queues<o:p></o:p></p>
<div>
<p class="xmsonormal"> <o:p></o:p></p>
<p class="xmsonormal">Regards,<o:p></o:p></p>
<p class="xmsonormal">Guchun<o:p></o:p></p>
</div>
<p class="xmsonormal"> <o:p></o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1
1.0pt;padding:3.0pt 0in 0in 0in">
<p class="xmsonormal"><b>From:</b> amd-gfx <<a href="mailto:amd-gfx-bounces@lists.freedesktop.org" moz-do-not-send="true">amd-gfx-bounces@lists.freedesktop.org</a>>
<b>On Behalf Of </b>Zhu, Changfeng<br>
<b>Sent:</b> Wednesday, May 19, 2021 10:56 AM<br>
<b>To:</b> Deucher, Alexander <<a href="mailto:Alexander.Deucher@amd.com" moz-do-not-send="true">Alexander.Deucher@amd.com</a>>;
Alex Deucher <<a href="mailto:alexdeucher@gmail.com" moz-do-not-send="true">alexdeucher@gmail.com</a>>;
Das, Nirmoy <<a href="mailto:Nirmoy.Das@amd.com" moz-do-not-send="true">Nirmoy.Das@amd.com</a>><br>
<b>Cc:</b> Huang, Ray <<a href="mailto:Ray.Huang@amd.com" moz-do-not-send="true">Ray.Huang@amd.com</a>>;
amd-gfx list <<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a>><br>
<b>Subject:</b> RE: [PATCH] drm/amdgpu: disable
3DCGCG on picasso/raven1 to avoid compute hang<o:p></o:p></p>
</div>
</div>
<p class="xmsonormal"> <o:p></o:p></p>
<p style="margin:15.0pt"><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:green">[Public]</span><o:p></o:p></p>
<p class="xmsonormal"> <o:p></o:p></p>
<div>
<p style="margin:15.0pt"><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:green">[Public]</span><o:p></o:p></p>
<p class="xmsonormal"> <o:p></o:p></p>
<div>
<p class="xmsonormal">Hi Alex,<o:p></o:p></p>
<p class="xmsonormal"> <o:p></o:p></p>
<p class="xmsonormal">This is the issue exposed by <span style="font-size:12.0pt;color:black">
Nirmoy's patch that provided better load
balancing across queues.</span><o:p></o:p></p>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black"> </span><o:p></o:p></p>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black">BR,</span><o:p></o:p></p>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black">Changfeng.</span><o:p></o:p></p>
<p class="xmsonormal"> <o:p></o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1
1.0pt;padding:3.0pt 0in 0in 0in">
<p class="xmsonormal"><b>From:</b> Deucher,
Alexander <<a href="mailto:Alexander.Deucher@amd.com" moz-do-not-send="true">Alexander.Deucher@amd.com</a>>
<br>
<b>Sent:</b> Wednesday, May 19, 2021 10:53 AM<br>
<b>To:</b> Zhu, Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>>;
Alex Deucher <<a href="mailto:alexdeucher@gmail.com" moz-do-not-send="true">alexdeucher@gmail.com</a>>;
Das, Nirmoy <<a href="mailto:Nirmoy.Das@amd.com" moz-do-not-send="true">Nirmoy.Das@amd.com</a>><br>
<b>Cc:</b> Huang, Ray <<a href="mailto:Ray.Huang@amd.com" moz-do-not-send="true">Ray.Huang@amd.com</a>>;
amd-gfx list <<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a>><br>
<b>Subject:</b> Re: [PATCH] drm/amdgpu:
disable 3DCGCG on picasso/raven1 to avoid
compute hang<o:p></o:p></p>
</div>
</div>
<p class="xmsonormal"> <o:p></o:p></p>
<p style="margin:15.0pt"><span style="font-size:10.0pt;font-family:"Arial",sans-serif;color:green">[Public]</span><o:p></o:p></p>
<p class="xmsonormal"> <o:p></o:p></p>
<div>
<div>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black">+
Nirmoy</span><o:p></o:p></p>
</div>
<div>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black"> </span><o:p></o:p></p>
</div>
<div>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black">I
thought we disabled all but one of the
compute queues on raven due to this issue.
Maybe that patch never landed? Wasn't this
the same issue that was exposed by Nirmoy's
patch that provided better load balancing
across queues?</span><o:p></o:p></p>
</div>
<div>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black"> </span><o:p></o:p></p>
</div>
<div>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black">Alex</span><o:p></o:p></p>
</div>
<div>
<p class="xmsonormal"><span style="font-size:12.0pt;color:black"> </span><o:p></o:p></p>
</div>
<div class="MsoNormal" style="text-align:center" align="center">
<hr width="98%" size="1" align="center">
</div>
<div id="x_divRplyFwdMsg">
<p class="xmsonormal"><b><span style="color:black">From:</span></b><span style="color:black"> amd-gfx <<a href="mailto:amd-gfx-bounces@lists.freedesktop.org" moz-do-not-send="true">amd-gfx-bounces@lists.freedesktop.org</a>>
on behalf of Zhu, Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>><br>
<b>Sent:</b> Tuesday, May 18, 2021 10:28 PM<br>
<b>To:</b> Alex Deucher <<a href="mailto:alexdeucher@gmail.com" moz-do-not-send="true">alexdeucher@gmail.com</a>><br>
<b>Cc:</b> Huang, Ray <<a href="mailto:Ray.Huang@amd.com" moz-do-not-send="true">Ray.Huang@amd.com</a>>;
amd-gfx list <<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a>><br>
<b>Subject:</b> RE: [PATCH] drm/amdgpu:
disable 3DCGCG on picasso/raven1 to avoid
compute hang</span>
<o:p></o:p></p>
<div>
<p class="xmsonormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="xmsonormal">[AMD Official Use Only -
Internal Distribution Only]<br>
<br>
Hi Alex.<br>
<br>
I have submitted the patch: drm/amdgpu:
disable 3DCGCG on picasso/raven1 to avoid
compute hang<br>
<br>
Do you mean we have something else to do for
re-enabling the extra compute queues?<br>
<br>
BR,<br>
Changfeng.<br>
<br>
-----Original Message-----<br>
From: Alex Deucher <<a href="mailto:alexdeucher@gmail.com" moz-do-not-send="true">alexdeucher@gmail.com</a>>
<br>
Sent: Wednesday, May 19, 2021 10:20 AM<br>
To: Zhu, Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>><br>
Cc: Huang, Ray <<a href="mailto:Ray.Huang@amd.com" moz-do-not-send="true">Ray.Huang@amd.com</a>>;
amd-gfx list <<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a>><br>
Subject: Re: [PATCH] drm/amdgpu: disable
3DCGCG on picasso/raven1 to avoid compute
hang<br>
<br>
Care to submit a patch to re-enable the
extra compute queues?<br>
<br>
Alex<br>
<br>
On Mon, May 17, 2021 at 4:09 AM Zhu,
Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>>
wrote:<br>
><br>
> [AMD Official Use Only - Internal
Distribution Only]<br>
><br>
> Hi Ray and Alex,<br>
><br>
> I have confirmed it can enable the
additional compute queues with this patch:<br>
><br>
> [ 41.823013] This is ring mec 1, pipe
0, queue 0, value 1<br>
> [ 41.823028] This is ring mec 1, pipe
1, queue 0, value 1<br>
> [ 41.823042] This is ring mec 1, pipe
2, queue 0, value 1<br>
> [ 41.823057] This is ring mec 1, pipe
3, queue 0, value 1<br>
> [ 41.823071] This is ring mec 1, pipe
0, queue 1, value 1<br>
> [ 41.823086] This is ring mec 1, pipe
1, queue 1, value 1<br>
> [ 41.823101] This is ring mec 1, pipe
2, queue 1, value 1<br>
> [ 41.823115] This is ring mec 1, pipe
3, queue 1, value 1<br>
><br>
> BR,<br>
> Changfeng.<br>
><br>
><br>
> -----Original Message-----<br>
> From: Huang, Ray <<a href="mailto:Ray.Huang@amd.com" moz-do-not-send="true">Ray.Huang@amd.com</a>><br>
> Sent: Monday, May 17, 2021 2:27 PM<br>
> To: Alex Deucher <<a href="mailto:alexdeucher@gmail.com" moz-do-not-send="true">alexdeucher@gmail.com</a>>;
Zhu, Changfeng
<br>
> <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>><br>
> Cc: amd-gfx list <<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a>><br>
> Subject: Re: [PATCH] drm/amdgpu:
disable 3DCGCG on picasso/raven1 to <br>
> avoid compute hang<br>
><br>
> On Fri, May 14, 2021 at 10:13:55PM
+0800, Alex Deucher wrote:<br>
> > On Fri, May 14, 2021 at 4:20 AM
<<a href="mailto:changfeng.zhu@amd.com" moz-do-not-send="true">changfeng.zhu@amd.com</a>>
wrote:<br>
> > ><br>
> > > From: changzhu <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>><br>
> > ><br>
> > > From: Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>><br>
> > ><br>
> > > There is problem with 3DCGCG
firmware and it will cause compute <br>
> > > test hang on picasso/raven1.
It needs to disable 3DCGCG in driver <br>
> > > to avoid compute hang.<br>
> > ><br>
> > > Change-Id:
Ic7d3c7922b2b32f7ac5193d6a4869cbc5b3baa87<br>
> > > Signed-off-by: Changfeng <<a href="mailto:Changfeng.Zhu@amd.com" moz-do-not-send="true">Changfeng.Zhu@amd.com</a>><br>
> ><br>
> > Reviewed-by: Alex Deucher <<a href="mailto:alexander.deucher@amd.com" moz-do-not-send="true">alexander.deucher@amd.com</a>><br>
> ><br>
> > WIth this applied, can we
re-enable the additional compute queues?<br>
> ><br>
><br>
> I think so.<br>
><br>
> Changfeng, could you please confirm
this on all raven series?<br>
><br>
> Patch is Reviewed-by: Huang Rui <<a href="mailto:ray.huang@amd.com" moz-do-not-send="true">ray.huang@amd.com</a>><br>
><br>
> > Alex<br>
> ><br>
> > > ---<br>
> > >
drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 10
+++++++---<br>
> > >
drivers/gpu/drm/amd/amdgpu/soc15.c | 2
--<br>
> > > 2 files changed, 7
insertions(+), 5 deletions(-)<br>
> > ><br>
> > > diff --git
a/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> > >
b/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> > > index
22608c45f07c..feaa5e4a5538 100644<br>
> > > ---
a/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> > > +++
b/drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c<br>
> > > @@ -4947,7 +4947,7 @@ static
void gfx_v9_0_update_3d_clock_gating(struct
amdgpu_device *adev,<br>
> > >
amdgpu_gfx_rlc_enter_safe_mode(adev);<br>
> > ><br>
> > > /* Enable 3D
CGCG/CGLS */<br>
> > > - if (enable &&
(adev->cg_flags &
AMD_CG_SUPPORT_GFX_3D_CGCG)) {<br>
> > > + if (enable) {<br>
> > > /* write cmd
to clear cgcg/cgls ov */<br>
> > > def = data =
RREG32_SOC15(GC, 0,
mmRLC_CGTT_MGCG_OVERRIDE);<br>
> > > /* unset CGCG
override */ @@ -4959,8 +4959,12 @@ <br>
> > > static void
gfx_v9_0_update_3d_clock_gating(struct
amdgpu_device *adev,<br>
> > > /* enable
3Dcgcg FSM(0x0000363f) */<br>
> > > def =
RREG32_SOC15(GC, 0, <br>
> > > mmRLC_CGCG_CGLS_CTRL_3D);<br>
> > ><br>
> > > - data = (0x36
<<
RLC_CGCG_CGLS_CTRL_3D__CGCG_GFX_IDLE_THRESHOLD__SHIFT)
|<br>
> > > -
RLC_CGCG_CGLS_CTRL_3D__CGCG_EN_MASK;<br>
> > > + if
(adev->cg_flags &
AMD_CG_SUPPORT_GFX_3D_CGCG)<br>
> > > + data
= (0x36 <<
RLC_CGCG_CGLS_CTRL_3D__CGCG_GFX_IDLE_THRESHOLD__SHIFT)
|<br>
> > >
+
RLC_CGCG_CGLS_CTRL_3D__CGCG_EN_MASK;<br>
> > > + else<br>
> > > + data
= 0x0 << <br>
> > > +
RLC_CGCG_CGLS_CTRL_3D__CGCG_GFX_IDLE_THRESHOLD__SHIFT;<br>
> > > +<br>
> > > if
(adev->cg_flags &
AMD_CG_SUPPORT_GFX_3D_CGLS)<br>
> > > data
|= (0x000F <<
RLC_CGCG_CGLS_CTRL_3D__CGLS_REP_COMPANSAT_DELAY__SHIFT)
|<br>
> >
> <br>
> > >
RLC_CGCG_CGLS_CTRL_3D__CGLS_EN_MASK;<br>
> > > diff --git
a/drivers/gpu/drm/amd/amdgpu/soc15.c<br>
> > >
b/drivers/gpu/drm/amd/amdgpu/soc15.c<br>
> > > index
4b660b2d1c22..080e715799d4 100644<br>
> > > ---
a/drivers/gpu/drm/amd/amdgpu/soc15.c<br>
> > > +++
b/drivers/gpu/drm/amd/amdgpu/soc15.c<br>
> > > @@ -1393,7 +1393,6 @@ static
int soc15_common_early_init(void *handle)<br>
> > >
adev->cg_flags = AMD_CG_SUPPORT_GFX_MGCG
|<br>
> >
>
AMD_CG_SUPPORT_GFX_MGLS |<br>
> >
>
AMD_CG_SUPPORT_GFX_CP_LS |<br>
> > >
-
AMD_CG_SUPPORT_GFX_3D_CGCG |<br>
> >
>
AMD_CG_SUPPORT_GFX_3D_CGLS |<br>
> >
>
AMD_CG_SUPPORT_GFX_CGCG |<br>
> >
>
AMD_CG_SUPPORT_GFX_CGLS | @@ <br>
> > > -1413,7<br>
> > > +1412,6 @@ static int
soc15_common_early_init(void *handle)<br>
> >
>
AMD_CG_SUPPORT_GFX_MGLS |<br>
> >
>
AMD_CG_SUPPORT_GFX_RLC_LS |<br>
> >
>
AMD_CG_SUPPORT_GFX_CP_LS |<br>
> > >
-
AMD_CG_SUPPORT_GFX_3D_CGCG |<br>
> >
>
AMD_CG_SUPPORT_GFX_3D_CGLS |<br>
> >
>
AMD_CG_SUPPORT_GFX_CGCG |<br>
> >
>
AMD_CG_SUPPORT_GFX_CGLS |<br>
> > > --<br>
> > > 2.17.1<br>
> > ><br>
> > >
_______________________________________________<br>
> > > amd-gfx mailing list<br>
> > > <a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a><br>
> > > <a href="https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2F" moz-do-not-send="true">
https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2F</a><br>
> > > li <br>
> > >
sts.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=04%7C0<br>
> > > 1% <br>
> > >
7CRay.Huang%40amd.com%7C0e273856253d4b3efd0b08d916e2892a%7C3dd8961<br>
> > > fe <br>
> > >
4884e608e11a82d994e183d%7C0%7C0%7C637565984495414849%7CUnknown%7CT<br>
> > > WF <br>
> > >
pbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXV<br>
> > > CI <br>
> > >
6Mn0%3D%7C1000&sdata=lBzswAPBguL0mWFglEk%2Bg2eDCEuhir7JfFjov%2<br>
> > > BV<br>
> > > 7pSY%3D&reserved=0<br>
_______________________________________________<br>
amd-gfx mailing list<br>
<a href="mailto:amd-gfx@lists.freedesktop.org" moz-do-not-send="true">amd-gfx@lists.freedesktop.org</a><br>
<a href="https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=04%7C01%7Cguchun.chen%40amd.com%7C3fc7a549816d4c8061c008d91a719cb8%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637569897555065647%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=YTC%2FvVR%2BbPKw9JKayhmHapRkkEFaczoGzJJ3jFJqBAM%3D&reserved=0" moz-do-not-send="true">https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=04%7C01%7Calexander.deucher%40amd.com%7C6d2cfe6e59f54875f6fa08d91a6dd27f%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637569881259273626%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=33Is2P3sqdabI7PPuHFOmzuvXyFId%2BOTAMyJ8G5PhzI%3D&reserved=0</a><o:p></o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</body>
</html>