<!DOCTYPE html>
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
Am 05.08.24 um 08:02 schrieb James Lawrence:<br>
<blockquote type="cite"
cite="mid:BE0aVjD8pNALcbd-ZS-4Nc00rErCRqNVp1QYdltXo8SnW5W844fHbukefLZeZYqPxp1-GhcSSMLI6IfQqR4vfKd3NJVWAO20cCsOIfUptLo=@egdaemon.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div style="font-family: Arial, sans-serif; font-size: 14px;">Apologies
if I'm hitting the wrong mailing list. long time user, first
time reporter and all that.</div>
</blockquote>
<br>
Sorry for the delayed reply Without a maintainer in CC such
requests are usually overlooked on the mailing list.<br>
<br>
<blockquote type="cite"
cite="mid:BE0aVjD8pNALcbd-ZS-4Nc00rErCRqNVp1QYdltXo8SnW5W844fHbukefLZeZYqPxp1-GhcSSMLI6IfQqR4vfKd3NJVWAO20cCsOIfUptLo=@egdaemon.com">
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">recently
my system has been suffering from instability with the graphics
system. essentially some application on my system is causing oom
for graphics memory.</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">normally
I'd just expect a hard crash of the application in such a
scenario. instead the system enters a spin loop of command
submissions,</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">slows
down dramatically generally resulting in the system freezing up.<br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">There
are a couple issues I'd like to point out with the current
situation I'm experiencing:</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">
<ul
data-editing-info="{"orderedStyleType":1,"unorderedStyleType":2}">
<li style="list-style-type: "- ";"><span>most
importantly the error message doesn't provide any useful
information for tracing the source of the issue. no pid,
or other diagnostic information.</span></li>
<li style="list-style-type: "- ";"><span>its very
noisy when trying to debug. I can occasionally drop my
system to a separate TTY and the message just spams the
entire screen. making it impossible to interact with my
system even if I wanted to load up debugging tools to
analyze the situation.</span></li>
</ul>
<div><br>
</div>
<div>given the error message I believe this line is the source
of the log statement.<br>
</div>
<div><code>[drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Not enough
memory for command submission!</code><br>
</div>
<div><span><a target="_blank" rel="noreferrer nofollow noopener"
href="https://github.com/torvalds/linux/blob/master/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c#L1431"
moz-do-not-send="true" class="moz-txt-link-freetext">https://github.com/torvalds/linux/blob/master/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c#L1431</a></span><br>
</div>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">Generally
I'm wondering if there is anything that can be done to improve
the experience for end users in such a scenario.</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">Ideally
the system would nuke the misbehaving process similar to how ram
ooms are handled.</div>
</blockquote>
<br>
If you see this message you should get the OOM killer running along
with it.<br>
<br>
If you don't see this then you probably run into a BUG or something
like that.<br>
<br>
What kernel version are you using and what did you do to trigger
that?<br>
<br>
Regards,<br>
Christian.<br>
<br>
<br>
<blockquote type="cite"
cite="mid:BE0aVjD8pNALcbd-ZS-4Nc00rErCRqNVp1QYdltXo8SnW5W844fHbukefLZeZYqPxp1-GhcSSMLI6IfQqR4vfKd3NJVWAO20cCsOIfUptLo=@egdaemon.com">
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;">but
at a minimum I'd like to be able to figure out how to back track
this to the misbehaving process. any help in this regard would
be appreciated.<br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br>
</div>
<div class="protonmail_signature_block"
style="font-family: Arial, sans-serif; font-size: 14px;">
<div
class="protonmail_signature_block-user protonmail_signature_block-empty">
</div>
<div class="protonmail_signature_block-proton"> Sent with <a
target="_blank" href="https://proton.me/"
rel="noopener noreferrer" moz-do-not-send="true">Proton Mail</a>
secure email. </div>
</div>
</blockquote>
<br>
</body>
</html>