[PATCH v4 0/3] drm/xe: Fix survivability
Raag Jadav
raag.jadav at intel.com
Tue Mar 18 15:50:57 UTC 2025
On Thu, Mar 13, 2025 at 01:40:58PM -0700, Lucas De Marchi wrote:
> It turns out commit d40f275d96e8 ("drm/xe: Move survivability entirely
> to xe_pci") did a bad job moving things to xe_pci. The fix provided by
> Riana in 20250306055407.511405-1-riana.tauro at intel.com fixes it
> partially, but injecting a failure in xe_pcode_probe_early still causes
> the kernel to give warnings/errors.
>
> Correct the course and better split what is done in xe_pci vs xe_device.
> This time, also add a patch to test we can handle errors in
> xe_pcode_probe_early() and other early probe functions.
>
> Entering survivability mode was tested with an additional one line to
> change the return of xe_survivability_mode_requested(). If we want to
> inject error, we'd need to change it's return type, but there's also
> another patch series to force it via configs, so this doesn't seem very
> important right now.
Not sure how much it helps our case, but still leaving it here in case
it turns out to be useful.
https://lore.kernel.org/r/0-v5-642aa0c94070+4447f-fwctl_jgg@nvidia.com
Raag
More information about the Intel-xe
mailing list