[PATCHv3 2/2] drm/amdgpu: Register MCE notifier for Aldebaran RAS
Borislav Petkov
bp at alien8.de
Thu Sep 23 18:14:15 UTC 2021
On Thu, Sep 23, 2021 at 05:23:21PM +0000, Yazen Ghannam wrote:
> Shouldn't the error still be reported to EDAC for decoding and counting? I
> think users want this.
You know what happens with users getting ECCs reported, right? They
think immediately their hw is going bad and start wanting to replace
it...
So what does actually tell you if you were a simple user and you had 5
correctable errors in the GPU VRAM?
All you wanna do is play, I'd say.
:-)
--
Regards/Gruss,
Boris.
https://people.kernel.org/tglx/notes-about-netiquette
More information about the amd-gfx
mailing list