[Intel-xe] [PATCH V9 6/6] drm/xe: Add min/max cap for engine scheduler properties

Upadhyay, Tejas tejas.upadhyay at intel.com
Fri Jul 28 07:36:29 UTC 2023



> -----Original Message-----
> From: Vishwanathapura, Niranjana <niranjana.vishwanathapura at intel.com>
> Sent: Friday, July 28, 2023 11:56 AM
> To: Upadhyay, Tejas <tejas.upadhyay at intel.com>
> Cc: Brost, Matthew <matthew.brost at intel.com>; intel-
> xe at lists.freedesktop.org
> Subject: Re: [PATCH V9 6/6] drm/xe: Add min/max cap for engine scheduler
> properties
> 
> On Thu, Jul 27, 2023 at 10:54:42PM -0700, Upadhyay, Tejas wrote:
> >
> >
> >> -----Original Message-----
> >> From: Brost, Matthew <matthew.brost at intel.com>
> >> Sent: Friday, July 28, 2023 10:35 AM
> >> To: Vishwanathapura, Niranjana
> <niranjana.vishwanathapura at intel.com>
> >> Cc: Upadhyay, Tejas <tejas.upadhyay at intel.com>; intel-
> >> xe at lists.freedesktop.org
> >> Subject: Re: [PATCH V9 6/6] drm/xe: Add min/max cap for engine
> >> scheduler properties
> >>
> >> On Thu, Jul 27, 2023 at 09:44:23PM -0700, Niranjana Vishwanathapura
> >> wrote:
> >> > On Thu, Jul 27, 2023 at 03:26:49PM +0000, Matthew Brost wrote:
> >> > > On Thu, Jul 27, 2023 at 01:52:35PM +0530, Tejas Upadhyay wrote:
> >> > > > Add sysfs entries for the min, max, and defaults for each of
> >> > > > engine scheduler controls for every hardware engine class.
> >> > > >
> >> > > > Non-elevated user IOCTLs to set these controls must be within
> >> > > > the min-max ranges of the sysfs entries, elevated user can set
> >> > > > these controls to any value. However, introduced compile time
> >> > > > CONFIG min-max values which restricts elevated user to be in
> >> > > > compile time min-max range if at all sysfs min/max are violated.
> >> > > >
> >> > > > Sysfs entries examples are,
> >> > > > DUT# cat
> /sys/class/drm/cardX/device/tileN/gtN/engines/ccs/.defaults/
> >> > > > job_timeout_max         job_timeout_ms          preempt_timeout_min
> >> timeslice_duration_max  timeslice_duration_us
> >> > > > job_timeout_min         preempt_timeout_max
> preempt_timeout_us
> >> timeslice_duration_min
> >> > > >
> >> > > > DUT# cat /sys/class/drm/card1/device/tileN/gtN/engines/ccs/
> >> > > > .defaults/              job_timeout_min         preempt_timeout_max
> >> preempt_timeout_us      timeslice_duration_min
> >> > > > job_timeout_max         job_timeout_ms          preempt_timeout_min
> >> timeslice_duration_max  timeslice_duration_us
> >> > > >
> >> > > > V9 :
> >> > > >    - Rebase to use s/xe_engine/xe_hw_engine/ - Matt
> >> > > > V8 :
> >> > > >    - fix enforce_sched_limit and avoid code duplication - Niranjana
> >> > > >    - Make sure min < max - Niranjana
> >> > > > V7 :
> >> > > >    - Rebase to replace hw engine with eclass interface
> >> > > >    - return EINVAL in place of EPERM
> >> > > >    - Use some APIs to avoid code duplication
> >> > > > V6 :
> >> > > >    - Rebase changes to reflect per engine class props interface - MattB
> >> > > >    - Use #if ENABLED - MattB
> >> > > >    - Remove MAX_SCHED_TIMEOUT check as range validation is
> >> > > > enough
> >> > > > V5 :
> >> > > >    - Rebase to resolve conflicts - CI
> >> > > > V4 :
> >> > > >    - Rebase
> >> > > >    - Update commit to reflect tile addition
> >> > > >    - Use XE_HW macro directly as they are already filtered
> >> > > >      for CONFIG checks - Niranjana
> >> > > >    - Add CONFIG for enable/disable min/max limitation
> >> > > >      on elevated user. Default is enable - Matt/Joonas
> >> > > > V3 :
> >> > > >    - Resolve CI hooks warning for kernel-doc
> >> > > > V2 :
> >> > > >    - Restric min/max setting to #define default min/max for
> >> > > >      elevated user - Himal
> >> > > >    - Remove unrelated changes from patch - Niranjana
> >> > > >
> >> > > > Signed-off-by: Tejas Upadhyay <tejas.upadhyay at intel.com>
> >> > > > ---
> >> > > >  drivers/gpu/drm/xe/Kconfig                    |   6 +
> >> > > >  drivers/gpu/drm/xe/Kconfig.profile            |  46 +++
> >> > > >  drivers/gpu/drm/xe/xe_engine.c                |  25 +-
> >> > > >  drivers/gpu/drm/xe/xe_hw_engine.c             |   8 +
> >> > > >  drivers/gpu/drm/xe/xe_hw_engine.h             |  31 ++
> >> > > >  drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.c | 354
> >> ++++++++++++++++++
> >> > > >  drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.h |   2 +
> >> > > >  7 files changed, 466 insertions(+), 6 deletions(-)  create
> >> > > > mode
> >> > > > 100644 drivers/gpu/drm/xe/Kconfig.profile
> >> > > >
> >> > > > diff --git a/drivers/gpu/drm/xe/Kconfig
> >> > > > b/drivers/gpu/drm/xe/Kconfig index d44794f99338..0a4ea965645b
> >> > > > 100644
> >> > > > --- a/drivers/gpu/drm/xe/Kconfig
> >> > > > +++ b/drivers/gpu/drm/xe/Kconfig
> >> > > > @@ -83,3 +83,9 @@ depends on DRM_XE  depends on EXPERT
> source
> >> > > > "drivers/gpu/drm/xe/Kconfig.debug"
> >> > > >  endmenu
> >> > > > +
> >> > > > +menu "drm/xe Profile Guided Optimisation"
> >> > > > +	visible if EXPERT
> >> > > > +	depends on DRM_XE
> >> > > > +	source "drivers/gpu/drm/xe/Kconfig.profile"
> >> > > > +endmenu
> >> > > > diff --git a/drivers/gpu/drm/xe/Kconfig.profile
> >> > > > b/drivers/gpu/drm/xe/Kconfig.profile
> >> > > > new file mode 100644
> >> > > > index 000000000000..e72f15ec4bf6
> >> > > > --- /dev/null
> >> > > > +++ b/drivers/gpu/drm/xe/Kconfig.profile
> >> > > > @@ -0,0 +1,46 @@
> >> > > > +config DRM_XE_JOB_TIMEOUT_MAX
> >> > > > +       int "Default max job timeout (ms)"
> >> > > > +       default 10000 # milliseconds
> >> > > > +       help
> >> > > > +         Configures the default max job timeout after which job will
> >> > > > +         be forcefully taken away from scheduler.
> >> > > > +config DRM_XE_JOB_TIMEOUT_MIN
> >> > > > +       int "Default max job timeout (ms)"
> >> > > > +       default 1 # milliseconds
> >> > > > +       help
> >> > > > +         Configures the default min job timeout after which job will
> >> > > > +         be forcefully taken away from scheduler.
> >> > > > +config DRM_XE_TIMESLICE_MAX
> >> > > > +       int "Default max timeslice duration (us)"
> >> > > > +       default 10000000 # microseconds
> >> > > > +       help
> >> > > > +         Configures the default max timeslice duration between
> multiple
> >> > > > +         contexts by guc scheduling.
> >> > > > +config DRM_XE_TIMESLICE_MIN
> >> > > > +       int "Default min timeslice duration (us)"
> >> > > > +       default 1 # microseconds
> >> > > > +       help
> >> > > > +         Configures the default min timeslice duration between
> multiple
> >> > > > +         contexts by guc scheduling.
> >> > > > +config DRM_XE_PREEMPT_TIMEOUT_MAX
> >> > > > +       int "Default max  preempt timeout (us)"
> >> > > > +       default 10000000 # microseconds
> >> > > > +       help
> >> > > > +         Configures the default max preempt timeout after which
> context
> >> > > > +         will be forcefully taken away and higher priority context will
> >> > > > +         run.
> >> > > > +config DRM_XE_PREEMPT_TIMEOUT_MIN
> >> > > > +       int "Default min  preempt timeout (us)"
> >> > > > +       default 1 # microseconds
> >> > > > +       help
> >> > > > +         Configures the default min preempt timeout after which
> context
> >> > > > +         will be forcefully taken away and higher priority context will
> >> > > > +         run.
> >> > > > +config DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT
> >> > > > +       bool "Default configuration of limitation on scheduler timeout"
> >> > > > +       default y
> >> > > > +       help
> >> > > > +	 Configures the enablement of limitation on scheduler
> timeout
> >> > > > +	 to apply to applicable user. For elevated user, all above MIN
> >> > > > +	 and MAX values will apply when this configuration is enable
> to
> >> > > > +	 apply limitation. By default limitation is applied.
> >> > > > diff --git a/drivers/gpu/drm/xe/xe_engine.c
> >> > > > b/drivers/gpu/drm/xe/xe_engine.c index
> >> > > > 9e167b113963..d934196eb79f
> >> > > > 100644
> >> > > > --- a/drivers/gpu/drm/xe/xe_engine.c
> >> > > > +++ b/drivers/gpu/drm/xe/xe_engine.c
> >> > > > @@ -13,6 +13,7 @@
> >> > > >
> >> > > >  #include "xe_device.h"
> >> > > >  #include "xe_gt.h"
> >> > > > +#include "xe_hw_engine_class_sysfs.h"
> >> > > >  #include "xe_hw_fence.h"
> >> > > >  #include "xe_lrc.h"
> >> > > >  #include "xe_macros.h"
> >> > > > @@ -191,8 +192,12 @@ static int engine_set_priority(struct
> >> > > > xe_device *xe, struct xe_engine *e,  static int
> >> engine_set_timeslice(struct xe_device *xe, struct xe_engine *e,
> >> > > >  				u64 value, bool create)
> >> > > >  {
> >> > > > -	if (!capable(CAP_SYS_NICE))
> >> > > > -		return -EPERM;
> >> > > > +	u32 min = e->hwe->eclass->sched_props.timeslice_min;
> >> > > > +	u32 max = e->hwe->eclass->sched_props.timeslice_max;
> >> > >
> >> > > So I screwed this up in my last comment, sorry.
> >> > >
> >> > > min = cap_nice ? KConfig option for min :
> >> > > e->hwe->eclass->sched_props.timeslice_min;
> >> > > same logic for max;
> >> > >
> >> > > Probably add helper for this.
> >> > >
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(value, min, max))
> >> > >
> >> > > Then I think logic works too...
> >> > >
> >> > > if cap nice && !Kconfig option to enfore limits
> >> > > (enforce_schedule_limit
> >> returns false):
> >> > > 	user can do whatever
> >> > > else if cap nice
> >> > > 	user within Kconfig range
> >> > > else
> >> > > 	user within sysfs range
> >> > >
> >> > > This is what we want in the end.
> >> > >
> >> >
> >> > Matt, I think this is exactly what the previous revision of the
> >> > patch was
> >> doing.
> >> >
> >>
> >> No... Previous version
> >>
> >> min = Kconfig option to enfore limits ? KConfig option for min :
> >> e->hwe-
> >> >eclass->sched_props.timeslice_min
> >> same for max
> >>
> >> Results in:
> >>
> >> if cap nice && !Kconfig option to enfore limits
> >> (enforce_schedule_limit returns false):
> >> 	user can do whatever
> >> else if cap_nice || Kconfig option to enfore limit
> >> 	user within Kconfig range
> >> else
> >>  	user within sysfs range
> >>
> >> !cap_nice && Kconfig option to enfore limits == user within Kconfig
> >> range
> >> (previously) !cap_nice && Kconfig option to enfore limits == user
> >> within sysfs range (my suggestion)
> >
> >In previous version we can achieve it by just adding one below check right?
> >#if IS_ENABLED(CONFIG_DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT) &&
> cap_nice
> >+       u32 min = XE_HW_ENGINE_TIMESLICE_MIN;
> >+       u32 max = XE_HW_ENGINE_TIMESLICE_MAX; #else
> >+       u32 min = e->hwe->eclass->sched_props.timeslice_min;
> >+       u32 max = e->hwe->eclass->sched_props.timeslice_max;
> >+#endif
> >
> >Please let me know!
> >
> 
> Ah I see. Yah, it is a limits selection issue.
> We can do,
> 
> u32 min = e->hwe->eclass->sched_props.timeslice_min;
> u32 max = e->hwe->eclass->sched_props.timeslice_max;
> 
> #if IS_ENABLED(CONFIG_DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT)
>          if (cap_nice) {
>                  u32 min = XE_HW_ENGINE_TIMESLICE_MIN;
>                  u32 max = XE_HW_ENGINE_TIMESLICE_MAX;
>          }
> #endif
> 
> Matt, Tejas
> Also, looks like you missed my another comment below.
> 
> Niranjana
> 
> >>
> >> The logic is different and I believe we want what I'm suggesting.
> >>
> >> Matt
> >>
> >> > > > +		return -EINVAL;
> >> > > >
> >> > > >  	return e->ops->set_timeslice(e, value);  } @@ -201,8 +206,12
> >> > > > @@ static int engine_set_preemption_timeout(struct xe_device *xe,
> >> > > >  					 struct xe_engine *e, u64
> value,
> >> > > >  					 bool create)
> >> > > >  {
> >> > > > -	if (!capable(CAP_SYS_NICE))
> >> > > > -		return -EPERM;
> >> > > > +	u32 min = e->hwe->eclass-
> >sched_props.preempt_timeout_min;
> >> > > > +	u32 max = e->hwe->eclass-
> >sched_props.preempt_timeout_max;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(value, min, max))
> >> > > > +		return -EINVAL;
> >> > > >
> >> > > >  	return e->ops->set_preempt_timeout(e, value);  } @@ -
> 266,11
> >> > > > +275,15 @@ static int engine_set_persistence(struct xe_device
> >> > > > +*xe,
> >> > > > struct xe_engine *e,  static int engine_set_job_timeout(struct
> >> > > > xe_device
> >> *xe, struct xe_engine *e,
> >> > > >  				  u64 value, bool create)
> >> > > >  {
> >> > > > +	u32 min = e->hwe->eclass->sched_props.job_timeout_min;
> >> > > > +	u32 max = e->hwe->eclass->sched_props.job_timeout_max;
> >> > > > +
> >> > > >  	if (XE_IOCTL_DBG(xe, !create))
> >> > > >  		return -EINVAL;
> >> > > >
> >> > > > -	if (!capable(CAP_SYS_NICE))
> >> > > > -		return -EPERM;
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(value, min, max))
> >> > > > +		return -EINVAL;
> >> > > >
> >> > > >  	return e->ops->set_job_timeout(e, value);  } diff --git
> >> > > > a/drivers/gpu/drm/xe/xe_hw_engine.c
> >> > > > b/drivers/gpu/drm/xe/xe_hw_engine.c
> >> > > > index afa7d25c3852..e601bffe3b13 100644
> >> > > > --- a/drivers/gpu/drm/xe/xe_hw_engine.c
> >> > > > +++ b/drivers/gpu/drm/xe/xe_hw_engine.c
> >> > > > @@ -364,8 +364,16 @@ static void hw_engine_init_early(struct
> >> > > > xe_gt *gt, struct xe_hw_engine *hwe,
> >> > > >
> >> > > >  	if (!gt->eclass[hwe->class].sched_props.job_timeout_ms) {
> >> > > >  		gt->eclass[hwe->class].sched_props.job_timeout_ms
> = HZ *
> >> 5;
> >> > > > +		gt->eclass[hwe-
> >class].sched_props.job_timeout_min =
> >> XE_HW_ENGINE_JOB_TIMEOUT_MIN;
> >> > > > +		gt->eclass[hwe-
> >class].sched_props.job_timeout_max =
> >> > > > +XE_HW_ENGINE_JOB_TIMEOUT_MAX;
> >> > > >  		gt->eclass[hwe->class].sched_props.timeslice_us = 1
> * 1000;
> >> > > > +		gt->eclass[hwe->class].sched_props.timeslice_min =
> >> XE_HW_ENGINE_TIMESLICE_MIN;
> >> > > > +		gt->eclass[hwe->class].sched_props.timeslice_max =
> >> > > > +XE_HW_ENGINE_TIMESLICE_MAX;
> >> > > >  		gt->eclass[hwe-
> >class].sched_props.preempt_timeout_us =
> >> 640 *
> >> > > > 1000;
> >> > > > +		gt->eclass[hwe-
> >class].sched_props.preempt_timeout_min =
> >> > > > +
> >> 	XE_HW_ENGINE_PREEMPT_TIMEOUT_MIN;
> >> > > > +		gt->eclass[hwe-
> >class].sched_props.preempt_timeout_max =
> >> > > > +
> >> 	XE_HW_ENGINE_PREEMPT_TIMEOUT_MAX;
> >> > > >  		/* Record default props */
> >> > > >  		gt->eclass[hwe->class].defaults = gt->eclass[hwe-
> >> >class].sched_props;
> >> > > >  	}
> >> > > > diff --git a/drivers/gpu/drm/xe/xe_hw_engine.h
> >> > > > b/drivers/gpu/drm/xe/xe_hw_engine.h
> >> > > > index 7eca9d53c7b1..3d37d6d44261 100644
> >> > > > --- a/drivers/gpu/drm/xe/xe_hw_engine.h
> >> > > > +++ b/drivers/gpu/drm/xe/xe_hw_engine.h
> >> > > > @@ -10,6 +10,37 @@
> >> > > >
> >> > > >  struct drm_printer;
> >> > > >
> >> > > > +#ifdef CONFIG_DRM_XE_JOB_TIMEOUT_MIN #define
> >> > > > +XE_HW_ENGINE_JOB_TIMEOUT_MIN
> >> CONFIG_DRM_XE_JOB_TIMEOUT_MIN #else
> >> > > > +#define XE_HW_ENGINE_JOB_TIMEOUT_MIN 1 #endif #ifdef
> >> > > > +CONFIG_DRM_XE_JOB_TIMEOUT_MAX #define
> >> > > > +XE_HW_ENGINE_JOB_TIMEOUT_MAX
> >> CONFIG_DRM_XE_JOB_TIMEOUT_MAX #else
> >> > > > +#define XE_HW_ENGINE_JOB_TIMEOUT_MAX (10 * 1000) #endif
> #ifdef
> >> > > > +CONFIG_DRM_XE_TIMESLICE_MIN #define
> >> XE_HW_ENGINE_TIMESLICE_MIN
> >> > > > +CONFIG_DRM_XE_TIMESLICE_MIN #else #define
> >> > > > +XE_HW_ENGINE_TIMESLICE_MIN 1 #endif #ifdef
> >> > > > +CONFIG_DRM_XE_TIMESLICE_MAX #define
> >> XE_HW_ENGINE_TIMESLICE_MAX
> >> > > > +CONFIG_DRM_XE_TIMESLICE_MAX #else #define
> >> > > > +XE_HW_ENGINE_TIMESLICE_MAX (10 * 1000 * 1000) #endif #ifdef
> >> > > > +CONFIG_DRM_XE_PREEMPT_TIMEOUT_MIN
> >> > > > +#define XE_HW_ENGINE_PREEMPT_TIMEOUT_MIN
> >> > > > +CONFIG_DRM_XE_PREEMPT_TIMEOUT_MIN #else #define
> >> > > > +XE_HW_ENGINE_PREEMPT_TIMEOUT_MIN 1 #endif #ifdef
> >> > > > +CONFIG_DRM_XE_PREEMPT_TIMEOUT_MAX
> >> > > > +#define XE_HW_ENGINE_PREEMPT_TIMEOUT_MAX
> >> > > > +CONFIG_DRM_XE_PREEMPT_TIMEOUT_MAX #else #define
> >> > > > +XE_HW_ENGINE_PREEMPT_TIMEOUT_MAX (10 * 1000 * 1000)
> #endif
> >> > > > +
> >> > > >  int xe_hw_engines_init_early(struct xe_gt *gt);  int
> >> > > > xe_hw_engines_init(struct xe_gt *gt);  void
> >> > > > xe_hw_engine_handle_irq(struct xe_hw_engine *hwe, u16
> >> > > > intr_vec); diff --git
> >> > > > a/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.c
> >> > > > b/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.c
> >> > > > index 990bb675d1e0..2b7ac4e02db6 100644
> >> > > > --- a/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.c
> >> > > > +++ b/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.c
> >> > > > @@ -11,6 +11,20 @@
> >> > > >
> >> > > >  static int xe_add_hw_engine_class_defaults(struct kobject
> >> > > > *parent);
> >> > > >
> >> > > > +bool enforce_schedule_limit(void) { #if
> >> > > > +IS_ENABLED(CONFIG_DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT)
> >> > > > +	return true;
> >> > > > +#else
> >> > > > +	return !capable(CAP_SYS_NICE); #endif }
> >> > > > +
> >> > > > +bool engine_timeout_in_range(u64 timeout, u64 min, u64 max) {
> >> > > > +	return timeout >= min && timeout <= max; }
> >> > > > +
> >> > > >  static void kobj_xe_hw_engine_release(struct kobject *kobj)  {
> >> > > >  	kfree(kobj);
> >> > > > @@ -39,11 +53,92 @@ kobj_xe_hw_engine_class(struct kobject
> >> *parent, char *name)
> >> > > >  	return keclass;
> >> > > >  }
> >> > > >
> >> > > > +static ssize_t job_timeout_max_store(struct kobject *kobj,
> >> > > > +				     struct kobj_attribute *attr,
> >> > > > +				     const char *buf, size_t count) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> kobj_to_eclass(kobj);
> >> > > > +	u32 timeout;
> >> > > > +	int err;
> >> > > > +
> >> > > > +	err = kstrtou32(buf, 0, &timeout);
> >> > > > +	if (err)
> >> > > > +		return err;
> >> > > > +
> >> > > > +	if (timeout < eclass->sched_props.job_timeout_min)
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(timeout,
> >> > > > +
> XE_HW_ENGINE_JOB_TIMEOUT_MIN,
> >> > > > +
> XE_HW_ENGINE_JOB_TIMEOUT_MAX))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	WRITE_ONCE(eclass->sched_props.job_timeout_max,
> timeout);
> >> > > > +
> >> > > > +	return count;
> >> > > > +}
> >> > > > +
> >> > > > +static ssize_t job_timeout_max_show(struct kobject *kobj,
> >> > > > +				    struct kobj_attribute *attr, char
> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->sched_props.job_timeout_max);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute job_timeout_max_attr =
> >> > > > +__ATTR(job_timeout_max, 0644, job_timeout_max_show,
> >> > > > +job_timeout_max_store);
> >> > > > +
> >> > > > +static ssize_t job_timeout_min_store(struct kobject *kobj,
> >> > > > +				     struct kobj_attribute *attr,
> >> > > > +				     const char *buf, size_t count) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> kobj_to_eclass(kobj);
> >> > > > +	u32 timeout;
> >> > > > +	int err;
> >> > > > +
> >> > > > +	err = kstrtou32(buf, 0, &timeout);
> >> > > > +	if (err)
> >> > > > +		return err;
> >> > > > +
> >> > > > +	if (timeout > eclass->sched_props.job_timeout_max)
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(timeout,
> >> > > > +
> XE_HW_ENGINE_JOB_TIMEOUT_MIN,
> >> > > > +
> XE_HW_ENGINE_JOB_TIMEOUT_MAX))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	WRITE_ONCE(eclass->sched_props.job_timeout_min,
> timeout);
> >> > > > +
> >> > > > +	return count;
> >> > > > +}
> >> > > > +
> >> > > > +static ssize_t job_timeout_min_show(struct kobject *kobj,
> >> > > > +				    struct kobj_attribute *attr, char
> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->sched_props.job_timeout_min);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute job_timeout_min_attr =
> >> > > > +__ATTR(job_timeout_min, 0644, job_timeout_min_show,
> >> > > > +job_timeout_min_store);
> >> > > > +
> >> > > >  static ssize_t job_timeout_store(struct kobject *kobj,
> >> > > >  				 struct kobj_attribute *attr,
> >> > > >  				 const char *buf, size_t count)  {
> >> > > >  	struct xe_hw_engine_class_intf *eclass =
> >> > > > kobj_to_eclass(kobj);
> >> > > > +#if IS_ENABLED(CONFIG_DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT)
> >> > > > +	u32 min = XE_HW_ENGINE_JOB_TIMEOUT_MIN;
> >> > > > +	u32 max = XE_HW_ENGINE_JOB_TIMEOUT_MAX; #else
> >> > > > +	u32 min = e->hwe->eclass->sched_props.job_timeout_min;
> >> > > > +	u32 max = e->hwe->eclass->sched_props.job_timeout_max;
> >> > > > +#endif
> >> > >
> >> > > I don't think we need this, just make sure default is within min
> >> > > / max of the sched_props.
> >> > >
> >> >
> >> > Shouldn't the sched_prop set through sysfs here which applies to
> >> > all future user engines (xe_engine) has the same requirement as the
> >> > sched_prop set directly to user engine (xe_engine) through
> >> > set_properly
> >> ioctl?
> >> > ie., shoudln't the requirement be same in job_timeout_store() and
> >> > engine_set_job_timeout()?

I also think it should apply same to sysfs calls as well. Unless we think sysfs should limit to compile configs only. Since sysfs min/max cab be changed.

Thanks,
Tejas

> >> >
> >> > Niranjana
> >> >
> >> > > Matt
> >> > >
> >> > > >  	u32 timeout;
> >> > > >  	int err;
> >> > > >
> >> > > > @@ -51,6 +146,10 @@ static ssize_t job_timeout_store(struct
> >> > > > kobject
> >> *kobj,
> >> > > >  	if (err)
> >> > > >  		return err;
> >> > > >
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(timeout, min, max))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > >  	WRITE_ONCE(eclass->sched_props.job_timeout_ms,
> timeout);
> >> > > >
> >> > > >  	return count;
> >> > > > @@ -78,11 +177,40 @@ static ssize_t job_timeout_default(struct
> >> > > > kobject *kobj,  static struct kobj_attribute job_timeout_def =
> >> > > > __ATTR(job_timeout_ms, 0444, job_timeout_default, NULL);
> >> > > >
> >> > > > +static ssize_t job_timeout_min_default(struct kobject *kobj,
> >> > > > +				       struct kobj_attribute *attr, char
> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj->parent);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->defaults.job_timeout_min);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute job_timeout_min_def =
> >> > > > +__ATTR(job_timeout_min, 0444, job_timeout_min_default, NULL);
> >> > > > +
> >> > > > +static ssize_t job_timeout_max_default(struct kobject *kobj,
> >> > > > +				       struct kobj_attribute *attr, char
> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj->parent);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->defaults.job_timeout_max);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute job_timeout_max_def =
> >> > > > +__ATTR(job_timeout_max, 0444, job_timeout_max_default, NULL);
> >> > > > +
> >> > > >  static ssize_t timeslice_duration_store(struct kobject *kobj,
> >> > > >  					struct kobj_attribute *attr,
> >> > > >  					const char *buf, size_t count)
> {
> >> > > >  	struct xe_hw_engine_class_intf *eclass =
> >> > > > kobj_to_eclass(kobj);
> >> > > > +#if IS_ENABLED(CONFIG_DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT)
> >> > > > +	u32 min = XE_HW_ENGINE_TIMESLICE_MIN;
> >> > > > +	u32 max = XE_HW_ENGINE_TIMESLICE_MAX; #else
> >> > > > +	u32 min = e->hwe->eclass->sched_props.timeslice_min;
> >> > > > +	u32 max = e->hwe->eclass->sched_props.timeslice_max;
> >> > > > +#endif
> >> > > >  	u32 duration;
> >> > > >  	int err;
> >> > > >
> >> > > > @@ -90,11 +218,92 @@ static ssize_t
> >> > > > timeslice_duration_store(struct
> >> kobject *kobj,
> >> > > >  	if (err)
> >> > > >  		return err;
> >> > > >
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(duration, min, max))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > >  	WRITE_ONCE(eclass->sched_props.timeslice_us, duration);
> >> > > >
> >> > > >  	return count;
> >> > > >  }
> >> > > >
> >> > > > +static ssize_t timeslice_duration_max_store(struct kobject *kobj,
> >> > > > +					    struct kobj_attribute *attr,
> >> > > > +					    const char *buf, size_t
> count) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> kobj_to_eclass(kobj);
> >> > > > +	u32 duration;
> >> > > > +	int err;
> >> > > > +
> >> > > > +	err = kstrtou32(buf, 0, &duration);
> >> > > > +	if (err)
> >> > > > +		return err;
> >> > > > +
> >> > > > +	if (duration < eclass->sched_props.timeslice_min)
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(duration,
> >> > > > +				     XE_HW_ENGINE_TIMESLICE_MIN,
> >> > > > +
> XE_HW_ENGINE_TIMESLICE_MAX))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	WRITE_ONCE(eclass->sched_props.timeslice_max, duration);
> >> > > > +
> >> > > > +	return count;
> >> > > > +}
> >> > > > +
> >> > > > +static ssize_t timeslice_duration_max_show(struct kobject *kobj,
> >> > > > +					   struct kobj_attribute *attr,
> >> > > > +					   char *buf)
> >> > > > +{
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->sched_props.timeslice_max);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute timeslice_duration_max_attr =
> >> > > > +	__ATTR(timeslice_duration_max, 0644,
> >> timeslice_duration_max_show,
> >> > > > +	       timeslice_duration_max_store);
> >> > > > +
> >> > > > +static ssize_t timeslice_duration_min_store(struct kobject *kobj,
> >> > > > +					    struct kobj_attribute *attr,
> >> > > > +					    const char *buf, size_t
> count) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> kobj_to_eclass(kobj);
> >> > > > +	u32 duration;
> >> > > > +	int err;
> >> > > > +
> >> > > > +	err = kstrtou32(buf, 0, &duration);
> >> > > > +	if (err)
> >> > > > +		return err;
> >> > > > +
> >> > > > +	if (duration > eclass->sched_props.timeslice_max)
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(duration,
> >> > > > +				     XE_HW_ENGINE_TIMESLICE_MIN,
> >> > > > +
> XE_HW_ENGINE_TIMESLICE_MAX))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	WRITE_ONCE(eclass->sched_props.timeslice_min, duration);
> >> > > > +
> >> > > > +	return count;
> >> > > > +}
> >> > > > +
> >> > > > +static ssize_t timeslice_duration_min_show(struct kobject *kobj,
> >> > > > +					   struct kobj_attribute *attr,
> char
> >> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->sched_props.timeslice_min);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute timeslice_duration_min_attr =
> >> > > > +	__ATTR(timeslice_duration_min, 0644,
> >> timeslice_duration_min_show,
> >> > > > +	       timeslice_duration_min_store);
> >> > > > +
> >> > > >  static ssize_t timeslice_duration_show(struct kobject *kobj,
> >> > > >  		struct kobj_attribute *attr, char *buf)  { @@ -118,11
> >> > > > +327,40 @@ static ssize_t timeslice_default(struct kobject
> >> > > > *kobj,  static struct kobj_attribute timeslice_duration_def =
> >> > > > __ATTR(timeslice_duration_us, 0444, timeslice_default, NULL);
> >> > > >
> >> > > > +static ssize_t timeslice_min_default(struct kobject *kobj,
> >> > > > +				     struct kobj_attribute *attr, char
> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj->parent);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n", eclass->defaults.timeslice_min);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute timeslice_duration_min_def =
> >> > > > +__ATTR(timeslice_duration_min, 0444, timeslice_min_default,
> >> > > > +NULL);
> >> > > > +
> >> > > > +static ssize_t timeslice_max_default(struct kobject *kobj,
> >> > > > +				     struct kobj_attribute *attr, char
> *buf) {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj->parent);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n", eclass->defaults.timeslice_max);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute timeslice_duration_max_def =
> >> > > > +__ATTR(timeslice_duration_max, 0444, timeslice_max_default,
> >> > > > +NULL);
> >> > > > +
> >> > > >  static ssize_t preempt_timeout_store(struct kobject *kobj,
> >> > > >  				     struct kobj_attribute *attr,
> >> > > >  				     const char *buf, size_t count)  {
> >> > > >  	struct xe_hw_engine_class_intf *eclass =
> >> > > > kobj_to_eclass(kobj);
> >> > > > +#if IS_ENABLED(CONFIG_DRM_XE_ENABLE_SCHEDTIMEOUT_LIMIT)
> >> > > > +	u32 min = XE_HW_ENGINE_PREEMPT_TIMEOUT_MIN;
> >> > > > +	u32 max = XE_HW_ENGINE_PREEMPT_TIMEOUT_MAX; #else
> >> > > > +	u32 min = e->hwe->eclass-
> >sched_props.preempt_timeout_min;
> >> > > > +	u32 max = e->hwe->eclass-
> >sched_props.preempt_timeout_max;
> >> > > > +#endif
> >> > > >  	u32 timeout;
> >> > > >  	int err;
> >> > > >
> >> > > > @@ -130,6 +368,10 @@ static ssize_t
> >> > > > preempt_timeout_store(struct
> >> kobject *kobj,
> >> > > >  	if (err)
> >> > > >  		return err;
> >> > > >
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(timeout, min, max))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > >  	WRITE_ONCE(eclass->sched_props.preempt_timeout_us,
> timeout);
> >> > > >
> >> > > >  	return count;
> >> > > > @@ -158,17 +400,129 @@ static ssize_t
> >> > > > preempt_timeout_default(struct kobject *kobj,  static struct
> >> > > > kobj_attribute preempt_timeout_def =
> >> > > > __ATTR(preempt_timeout_us, 0444, preempt_timeout_default,
> >> > > > NULL);
> >> > > >
> >> > > > +static ssize_t preempt_timeout_min_default(struct kobject *kobj,
> >> > > > +					   struct kobj_attribute *attr,
> >> > > > +					   char *buf)
> >> > > > +{
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj->parent);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->defaults.preempt_timeout_min);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute preempt_timeout_min_def =
> >> > > > +__ATTR(preempt_timeout_min, 0444,
> preempt_timeout_min_default,
> >> > > > +NULL);
> >> > > > +
> >> > > > +static ssize_t preempt_timeout_max_default(struct kobject *kobj,
> >> > > > +					   struct kobj_attribute *attr,
> >> > > > +					   char *buf)
> >> > > > +{
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj->parent);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->defaults.preempt_timeout_max);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute preempt_timeout_max_def =
> >> > > > +__ATTR(preempt_timeout_max, 0444,
> >> preempt_timeout_max_default,
> >> > > > +NULL);
> >> > > > +
> >> > > > +static ssize_t preempt_timeout_max_store(struct kobject *kobj,
> >> > > > +					 struct kobj_attribute *attr,
> >> > > > +					 const char *buf, size_t count)
> {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> kobj_to_eclass(kobj);
> >> > > > +	u32 timeout;
> >> > > > +	int err;
> >> > > > +
> >> > > > +	err = kstrtou32(buf, 0, &timeout);
> >> > > > +	if (err)
> >> > > > +		return err;
> >> > > > +
> >> > > > +	if (timeout < eclass->sched_props.preempt_timeout_min)
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(timeout,
> >> > > > +
> >> XE_HW_ENGINE_PREEMPT_TIMEOUT_MIN,
> >> > > > +
> >> XE_HW_ENGINE_PREEMPT_TIMEOUT_MAX))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	WRITE_ONCE(eclass->sched_props.preempt_timeout_max,
> timeout);
> >> > > > +
> >> > > > +	return count;
> >> > > > +}
> >> > > > +
> >> > > > +static ssize_t preempt_timeout_max_show(struct kobject *kobj,
> >> > > > +					struct kobj_attribute *attr,
> char *buf)
> >> {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->sched_props.preempt_timeout_max);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute preempt_timeout_max_attr =
> >> > > > +	__ATTR(preempt_timeout_max, 0644,
> preempt_timeout_max_show,
> >> > > > +	       preempt_timeout_max_store);
> >> > > > +
> >> > > > +static ssize_t preempt_timeout_min_store(struct kobject *kobj,
> >> > > > +					 struct kobj_attribute *attr,
> >> > > > +					 const char *buf, size_t count)
> {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> kobj_to_eclass(kobj);
> >> > > > +	u32 timeout;
> >> > > > +	int err;
> >> > > > +
> >> > > > +	err = kstrtou32(buf, 0, &timeout);
> >> > > > +	if (err)
> >> > > > +		return err;
> >> > > > +
> >> > > > +	if (timeout > eclass->sched_props.preempt_timeout_max)
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	if (enforce_schedule_limit() &&
> >> > > > +	    !engine_timeout_in_range(timeout,
> >> > > > +
> >> XE_HW_ENGINE_PREEMPT_TIMEOUT_MIN,
> >> > > > +
> >> XE_HW_ENGINE_PREEMPT_TIMEOUT_MAX))
> >> > > > +		return -EINVAL;
> >> > > > +
> >> > > > +	WRITE_ONCE(eclass->sched_props.preempt_timeout_min,
> timeout);
> >> > > > +
> >> > > > +	return count;
> >> > > > +}
> >> > > > +
> >> > > > +static ssize_t preempt_timeout_min_show(struct kobject *kobj,
> >> > > > +					struct kobj_attribute *attr,
> char *buf)
> >> {
> >> > > > +	struct xe_hw_engine_class_intf *eclass =
> >> > > > +kobj_to_eclass(kobj);
> >> > > > +
> >> > > > +	return sprintf(buf, "%u\n",
> >> > > > +eclass->sched_props.preempt_timeout_min);
> >> > > > +}
> >> > > > +
> >> > > > +static struct kobj_attribute preempt_timeout_min_attr =
> >> > > > +	__ATTR(preempt_timeout_min, 0644,
> preempt_timeout_min_show,
> >> > > > +	       preempt_timeout_min_store);
> >> > > > +
> >> > > >  static const struct attribute *defaults[] = {
> >> > > >  	&job_timeout_def.attr,
> >> > > > +	&job_timeout_min_def.attr,
> >> > > > +	&job_timeout_max_def.attr,
> >> > > >  	&timeslice_duration_def.attr,
> >> > > > +	&timeslice_duration_min_def.attr,
> >> > > > +	&timeslice_duration_max_def.attr,
> >> > > >  	&preempt_timeout_def.attr,
> >> > > > +	&preempt_timeout_min_def.attr,
> >> > > > +	&preempt_timeout_max_def.attr,
> >> > > >  	NULL
> >> > > >  };
> >> > > >
> >> > > >  static const struct attribute *files[] = {
> >> > > >  	&job_timeout_attr.attr,
> >> > > > +	&job_timeout_min_attr.attr,
> >> > > > +	&job_timeout_max_attr.attr,
> >> > > >  	&timeslice_duration_attr.attr,
> >> > > > +	&timeslice_duration_min_attr.attr,
> >> > > > +	&timeslice_duration_max_attr.attr,
> >> > > >  	&preempt_timeout_attr.attr,
> >> > > > +	&preempt_timeout_min_attr.attr,
> >> > > > +	&preempt_timeout_max_attr.attr,
> >> > > >  	NULL
> >> > > >  };
> >> > > >
> >> > > > diff --git a/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.h
> >> > > > b/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.h
> >> > > > index 757136614672..2e2ab351a991 100644
> >> > > > --- a/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.h
> >> > > > +++ b/drivers/gpu/drm/xe/xe_hw_engine_class_sysfs.h
> >> > > > @@ -10,6 +10,8 @@
> >> > > >
> >> > > >  #define MAX_ENGINE_CLASS_NAME_LEN    16
> >> > > >  int xe_hw_engine_class_sysfs_init(struct xe_gt *gt);
> >> > > > +bool enforce_schedule_limit(void); bool
> >> > > > +engine_timeout_in_range(u64 timeout, u64 min, u64 max);
> >> > > >
> >> > > >  /**
> >> > > >   * struct kobj_eclass - A eclass's kobject struct that
> >> > > > connects the kobject and the
> >> > > > --
> >> > > > 2.25.1
> >> > > >


More information about the Intel-xe mailing list