[Bug 206475] amdgpu under load drop signal to monitor until hard reset

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Wed Jun 24 20:41:42 UTC 2020


https://bugzilla.kernel.org/show_bug.cgi?id=206475

Alex Deucher (alexdeucher at gmail.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |alexdeucher at gmail.com

--- Comment #16 from Alex Deucher (alexdeucher at gmail.com) ---
(In reply to Andrew Ammerlaan from comment #15)
> However, now that the iGPU is default, I can still see the system monitor
> that I usually run on the other monitor when this issue occurs. Every single
> time the thermal sensor of the GPU would show a ridiculous value (e.g. 511
> degrees Celsius).

When the GPU is in reset all reads to the MMIO BAR return 1s so you are just
getting all ones until the reset succeeds.  511 is just all ones.  This patch
will fix that issue:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=9271dfd9e0f79e2969dcbe28568bce0fdc4f8f73

-- 
You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list