[Intel-gfx] [PATCH v2 1/2] drm/i915: Reboot CI if we get wedged during driver init

Chris Wilson chris at chris-wilson.co.uk
Mon Jul 6 11:05:44 UTC 2020


Quoting Michał Winiarski (2020-07-06 11:49:52)
> From: Michał Winiarski <michal.winiarski at intel.com>
> 
> Getting wedged device on driver init is pretty much unrecoverable.
> Since we're running various scenarios that may potentially hit this in
> CI (module reload / selftests / hotunplug), and if it happens, it means
> that we can't trust any subsequent CI results, we should just apply the
> taint to let the CI know that it should reboot (CI checks taint between
> test runs).
> 
> v2: Comment that WEDGED_ON_INIT is non-recoverable, distinguish
>     WEDGED_ON_INIT from WEDGED_ON_FINI (Chris)
> 
> Signed-off-by: Michał Winiarski <michal.winiarski at intel.com>
> Cc: Chris Wilson <chris at chris-wilson.co.uk>
> Cc: Michal Wajdeczko <michal.wajdeczko at intel.com>
> Cc: Petri Latvala <petri.latvala at intel.com>

+1 for has_unrecoverable_error()
Reviewed-by: Chris Wilson <chris at chris-wilson.co.uk>

> -static inline bool intel_gt_has_init_error(const struct intel_gt *gt)
> +static inline bool intel_gt_is_wedged(const struct intel_gt *gt)
>  {
> -       return test_bit(I915_WEDGED_ON_INIT, &gt->reset.flags);
> +       GEM_BUG_ON(intel_gt_has_unrecoverable_error(gt) ?
> +                  !test_bit(I915_WEDGED, &gt->reset.flags) : false);

GEM_BUG_ON(intel_gt_has_unrecoverable_error(gt) &&
	   !test_bit(I915_WEDGED, &gt->reset.flags));

Perhaps a bit more conventional way of saying the same :)
-Chris


More information about the Intel-gfx mailing list