<div style="font-family: Arial, sans-serif; font-size: 14px;">Apologies if I'm hitting the wrong mailing list. long time user, first time reporter and all that.</div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">recently my system has been suffering from instability with the graphics system. essentially some application on my system is causing oom for graphics memory.</div><div style="font-family: Arial, sans-serif; font-size: 14px;">normally I'd just expect a hard crash of the application in such a scenario. instead the system enters a spin loop of command submissions,</div><div style="font-family: Arial, sans-serif; font-size: 14px;">slows down dramatically generally resulting in the system freezing up.<br></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">There are a couple issues I'd like to point out with the current situation I'm experiencing:</div><div style="font-family: Arial, sans-serif; font-size: 14px;"><ul data-editing-info="{"orderedStyleType":1,"unorderedStyleType":2}"><li style="list-style-type: "- ";"><span>most importantly the error message doesn't provide any useful information for tracing the source of the issue. no pid, or other diagnostic information.</span></li><li style="list-style-type: "- ";"><span>its very noisy when trying to debug. I can occasionally drop my system to a separate TTY and the message just spams the entire screen. making it impossible to interact with my system even if I wanted to load up debugging tools to analyze the situation.</span></li></ul><div><br></div><div>given the error message I believe this line is the source of the log statement.<br></div><div><code>[drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Not enough memory for command submission!</code><br></div><div><span><a target="_blank" rel="noreferrer nofollow noopener" href="https://github.com/torvalds/linux/blob/master/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c#L1431">https://github.com/torvalds/linux/blob/master/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c#L1431</a></span><br></div></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">Generally I'm wondering if there is anything that can be done to improve the experience for end users in such a scenario.</div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">Ideally the system would nuke the misbehaving process similar to how ram ooms are handled.</div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">but at a minimum I'd like to be able to figure out how to back track this to the misbehaving process. any help in this regard would be appreciated.<br></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div>
<div class="protonmail_signature_block" style="font-family: Arial, sans-serif; font-size: 14px;">
<div class="protonmail_signature_block-user protonmail_signature_block-empty">
</div>
<div class="protonmail_signature_block-proton">
Sent with <a target="_blank" href="https://proton.me/" rel="noopener noreferrer">Proton Mail</a> secure email.
</div>
</div>