[PATCH 0/2] drm/xe: Fix survivability

Lucas De Marchi lucas.demarchi at intel.com
Tue Mar 11 05:35:15 UTC 2025


It turns out commit d40f275d96e8 ("drm/xe: Move survivability entirely
to xe_pci") did a bad job moving things to xe_pci. The fix provided by
Riana in 20250306055407.511405-1-riana.tauro at intel.com fixes it
partially, but injecting a failure in xe_pcode_probe_early still causes
the kernel to give warnings/errors.

Correct the course and better split what is done in xe_pci vs xe_device.
This time, also add a patch to test we can handle errors in
xe_pcode_probe_early().

Entering survivability mode was tested with an additional one line to
change the return of xe_survivability_mode_capable(). If we want to
inject error, we'd need to change it's return type, but there's also
another patch series to force it via configs, so this doesn't seem very
important right now.

Signed-off-by: Lucas De Marchi <lucas.demarchi at intel.com>
---
Lucas De Marchi (2):
      drm/xe: Move survivability back to xe
      drm/xe: Allow to inject error in xe_pcode_probe_early()

 drivers/gpu/drm/xe/xe_device.c             | 14 +++++++++++++-
 drivers/gpu/drm/xe/xe_pci.c                | 16 +++++++---------
 drivers/gpu/drm/xe/xe_pcode.c              |  2 ++
 drivers/gpu/drm/xe/xe_survivability_mode.c | 14 +++++++++-----
 4 files changed, 31 insertions(+), 15 deletions(-)
---
base-commit: 003c44ec0b7d86569bd13d4a810ee24176c3d034
change-id: 20250310-fix-survivability-703246c0c480

Best regards,
-- 
Lucas De Marchi <lucas.demarchi at intel.com>



More information about the Intel-xe mailing list