[PATCH v2 0/2] drm/xe: Fix survivability
Lucas De Marchi
lucas.demarchi at intel.com
Tue Mar 11 18:34:54 UTC 2025
It turns out commit d40f275d96e8 ("drm/xe: Move survivability entirely
to xe_pci") did a bad job moving things to xe_pci. The fix provided by
Riana in 20250306055407.511405-1-riana.tauro at intel.com fixes it
partially, but injecting a failure in xe_pcode_probe_early still causes
the kernel to give warnings/errors.
Correct the course and better split what is done in xe_pci vs xe_device.
This time, also add a patch to test we can handle errors in
xe_pcode_probe_early().
Entering survivability mode was tested with an additional one line to
change the return of xe_survivability_mode_capable(). If we want to
inject error, we'd need to change it's return type, but there's also
another patch series to force it via configs, so this doesn't seem very
important right now.
Signed-off-by: Lucas De Marchi <lucas.demarchi at intel.com>
Signed-off-by: Lucas De Marchi <lucas.demarchi at intel.com>
---
Changes in v2:
- Cover more error injections in the second patch
- Link to v1: https://lore.kernel.org/r/20250310-fix-survivability-v1-0-7af31432bbd0@intel.com
---
Lucas De Marchi (2):
drm/xe: Move survivability back to xe
drm/xe: Allow to inject error in early probe
drivers/gpu/drm/xe/xe_device.c | 15 ++++++++++++++-
drivers/gpu/drm/xe/xe_mmio.c | 1 +
drivers/gpu/drm/xe/xe_pci.c | 16 +++++++---------
drivers/gpu/drm/xe/xe_pcode.c | 2 ++
drivers/gpu/drm/xe/xe_survivability_mode.c | 14 +++++++++-----
5 files changed, 33 insertions(+), 15 deletions(-)
---
base-commit: f8df428b3850ed87a1e2f3b12b6025328d8a6373
change-id: 20250310-fix-survivability-703246c0c480
Best regards,
--
Lucas De Marchi <lucas.demarchi at intel.com>
More information about the Intel-xe
mailing list