[PATCH v7 1/5] drm: Introduce device wedged event
Raag Jadav
raag.jadav at intel.com
Tue Oct 8 15:02:43 UTC 2024
On Thu, Oct 03, 2024 at 03:23:22PM +0300, Raag Jadav wrote:
> On Tue, Oct 01, 2024 at 02:20:29PM +0200, Michal Wajdeczko wrote:
> > On 30.09.2024 09:38, Raag Jadav wrote:
> > >
> > > +/**
> > > + * enum drm_wedge_recovery - Recovery method for wedged device in order of
> > > + * severity. To be set as bit fields in drm_device.wedge_recovery variable.
> > > + * Drivers can choose to support any one or multiple of them depending on
> > > + * their needs.
> > > + */
> > > +enum drm_wedge_recovery {
> > > + /** @DRM_WEDGE_RECOVERY_REBIND: unbind + rebind driver */
> > > + DRM_WEDGE_RECOVERY_REBIND,
> > > +
> > > + /** @DRM_WEDGE_RECOVERY_BUS_RESET: unbind + reset bus device + rebind */
> > > + DRM_WEDGE_RECOVERY_BUS_RESET,
> > > +
> > > + /** @DRM_WEDGE_RECOVERY_REBOOT: reboot system */
> > > + DRM_WEDGE_RECOVERY_REBOOT,
> > > +
> > > + /** @DRM_WEDGE_RECOVERY_MAX: for bounds checking, do not use */
> > > + DRM_WEDGE_RECOVERY_MAX
> > > +};
> > > +
> > > /**
> > > * struct drm_device - DRM device structure
> > > *
> > > @@ -317,6 +337,9 @@ struct drm_device {
> > > * Root directory for debugfs files.
> > > */
> > > struct dentry *debugfs_root;
> > > +
> > > + /** @wedge_recovery: Supported recovery methods for wedged device */
> > > + unsigned long wedge_recovery;
> >
> > hmm, so before the driver can ask for a reboot as a recovery method from
> > wedge it has to somehow add 'reboot' as available method? why it that?
>
> It's for consumers to use as fallbacks in case the preferred recovery method
> (sent along with uevent) don't workout. (patch 2/5)
On second thought...
Lucas, do we have a convincing enough usecase for fallback recovery?
If <method> were to fail, I would expect there to be even bigger problems
like kernel crash or unrecoverable hardware failure.
At that point is it worth retrying?
Raag
More information about the Intel-xe
mailing list