[PATCH 01/14] drm/xe: Document Xe PM component
Vivi, Rodrigo
rodrigo.vivi at intel.com
Thu Feb 22 16:29:50 UTC 2024
On Thu, 2024-02-22 at 15:06 +0000, Gupta, Anshuman wrote:
>
>
> > -----Original Message-----
> > From: Vivi, Rodrigo <rodrigo.vivi at intel.com>
> > Sent: Thursday, February 22, 2024 12:30 AM
> > To: Gupta, Anshuman <anshuman.gupta at intel.com>
> > Cc: Dugast, Francois <francois.dugast at intel.com>; intel-
> > xe at lists.freedesktop.org; Auld, Matthew <matthew.auld at intel.com>
> > Subject: Re: [PATCH 01/14] drm/xe: Document Xe PM component
> >
> > On Wed, Feb 21, 2024 at 09:41:44AM -0500, Gupta, Anshuman wrote:
> > >
> > >
> > > > -----Original Message-----
> > > > From: Dugast, Francois <francois.dugast at intel.com>
> > > > Sent: Wednesday, February 21, 2024 5:38 PM
> > > > To: Vivi, Rodrigo <rodrigo.vivi at intel.com>
> > > > Cc: intel-xe at lists.freedesktop.org; Auld, Matthew
> > > > <matthew.auld at intel.com>; Gupta, Anshuman
> > <anshuman.gupta at intel.com>
> > > > Subject: Re: [PATCH 01/14] drm/xe: Document Xe PM component
> > > >
> > > > On Thu, Feb 15, 2024 at 02:34:17PM -0500, Rodrigo Vivi wrote:
> > > > > Replace outdated information with a proper PM documentation.
> > > > > Already establish the rules for the runtime PM get and put
> > > > > that Xe
> > > > > needs to follow.
> > > > >
> > > > > Also add missing function documentation to all the "exported"
> > functions.
> > > > >
> > > > > v2: updated after Francois' feedback.
> > > > > s/grater/greater (Matt)
> > > > >
> > > > > Cc: Matthew Auld <matthew.auld at intel.com>
> > > > > Cc: Anshuman Gupta <anshuman.gupta at intel.com>
> > > > > Signed-off-by: Rodrigo Vivi <rodrigo.vivi at intel.com>
> > > > > Acked-by: Francois Dugast <francois.dugast at intel.com>
> > > > > ---
> > > > > drivers/gpu/drm/xe/xe_pm.c | 108
> > > > > +++++++++++++++++++++++++++++++++----
> > > > > 1 file changed, 97 insertions(+), 11 deletions(-)
> > > > >
> > > > > diff --git a/drivers/gpu/drm/xe/xe_pm.c
> > > > > b/drivers/gpu/drm/xe/xe_pm.c index ab283e9a8b4e..64ffb9a35448
> > > > > 100644
> > > > > --- a/drivers/gpu/drm/xe/xe_pm.c
> > > > > +++ b/drivers/gpu/drm/xe/xe_pm.c
> > > > > @@ -25,21 +25,46 @@
> > > > > /**
> > > > > * DOC: Xe Power Management
> > > > > *
> > > > > - * Xe PM shall be guided by the simplicity.
> > > > > - * Use the simplest hook options whenever possible.
> > > > > - * Let's not reinvent the runtime_pm references and hooks.
> > > > > - * Shall have a clear separation of display and gt
> > > > > underneath this
> > component.
> > > > > + * Xe PM implements the main routines for both system level
> > > > > + suspend states and
> > > > > + * for the opportunistic runtime suspend states.
> > > > > *
> > > > > - * What's next:
> > > > > + * System Level Suspend (S-States) - In general this is OS
> > > > > + initiated suspend
> > > > > + * driven by ACPI for achieving S0ix (a.k.a. S2idle,
> > > > > freeze), S3
> > > > > + (suspend to ram),
> > > > > + * S4 (disk). The main functions here are `xe_pm_suspend`
> > > > > and
> > > > > + `xe_pm_resume`. They
> > > > > + * are the main point for the suspend to and resume from
> > > > > these
> > states.
> > > > > *
> > > > > - * For now s2idle and s3 are only working in integrated
> > > > > devices.
> > > > > The next step
> > > > > - * is to iterate through all VRAM's BO backing them up into
> > > > > the
> > > > > system memory
> > > > > - * before allowing the system suspend.
> > > > > + * Runtime Suspend (D-States) - This is the opportunistic
> > > > > PCIe
> > > > > + device low power
> > > > > + * state D3. Xe PM component provides
> > > > > `xe_pm_runtime_suspend` and
> > > > > + * `xe_pm_runtime_resume` systems that PCI subsystem will
> > > > > call
> > > > > + before transition
> > > > > + * to D3. Also, Xe PM provides get and put functions that Xe
> > > > > + driver will use to
> > > > > + * indicate activity. In order to avoid locking
> > > > > complications
> > > > > + with the memory
> > > > > + * management, whenever possible, these get and put
> > > > > functions
> > > > > + needs to be called
> > > > > + * from the higher/outer levels.
> > > > > *
> > > > > - * Also runtime_pm needs to be here from the beginning.
> > > > > + * The main cases that need to be protected from the outer
> > > > > levels
> > > > > + are: IOCTL,
> > > > > + * sysfs, debugfs, dma-buf sharing, GPU execution.
> > > > > *
> > > > > - * RC6/RPS are also critical PM features. Let's start with
> > > > > GuCRC
> > > > > and GuC SLPC
> > > > > - * and no wait boost. Frequency optimizations should come on
> > > > > a next
> > stage.
> > > > > + * PCI D3 is special and can mean D3hot, where Vcc power is
> > > > > on
> > > > > + for keeping memory
> > > > > + * alive and quicker low latency resume or D3Cold where Vcc
> > > > > power
> > > > > + is off for
> > > > > + * better power savings.
> > > > > + * The Vcc control of PCI hierarchy can only be controlled
> > > > > at the
> > > > > + PCI root port
> > > > > + * level, while the device driver can be behind multiple
> > > > > + bridges/switches and
> > > > > + * paired with other devices. For this reason, the PCI
> > > > > subsystem
> > > > > + cannot perform
> > > > > + * the transition towards D3Cold. The lowest runtime PM
> > > > > possible
> > > > > + from the PCI
> > > > > + * subsystem is D3hot. Then, if all these paired devices in
> > > > > the
> > > > > + same root port
> > > > > + * are in D3hot, ACPI will assist here and run its own
> > > > > methods
> > > > > + (_PR3 and _OFF)
> > > > > + * to perform the transition from D3hot to D3cold. Xe may
> > > > > + disallow this
> > > > > + * transition by calling pci_d3cold_disable(root_pdev)
> > > > > before
> > > > > + going to runtime
> > > > > + * suspend. It will be based on runtime conditions such as
> > > > > VRAM
> > > > > + usage for a
> > > > > + * quick and low latency resume for instance.
> > > > > + *
> > > > > + * Intel systems are capable of taking the system to S0ix
> > > > > when
> > > > > + devices are on
> > > > > + * D3hot through the runtime PM. This is also called as
> > > > > 'opportunistic-
> > S0iX'.
> > > > > + * But in this case, the `xe_pm_suspend` and `xe_pm_resume`
> > > > > won't
> > > > > + be called for
> > > > > + * S0ix.
> > > Hi Rodrigo,
> > > Sorry of late review comment, was busy with other stuff.
> >
> > no worries. Thank you so much for jumping in. Really appreciated.
> >
> > > we do need modify the doc slightly.
> >
> > indeed... I also realized that to avoid confusion we need to detach
> > the D3.
> > In our integrated devices it is power important for the package-C
> > states then
> > for 'D3' itself. One might look at our PCI config for integrated
> > and see D3hot-
> > and get confused.
> >
> > > For integrated graphics D3hot is not necessary to achieve for the
> > > host
> > > s0ix, like with PSR panel PCI device will be in D0 state as KMS
> > > CRTC will be
> > active and will held a device wakeref.
> >
> > Then let's also detach the opportunistic s0ix from the runtime_pm
> > with this
> > PSR case in mind.
> >
> > >
> > > AFAIK for discrete card perspective, host s0ix requires PCIe link
> > > to be in low
> > power state.
> >
> > yeap, but that's outside of our scope so let's minimize the doc to
> > the calls
> > around the functions that we are implementing here.
> >
> > So, what do you think of this diff between what I had here in this
> > patch and
> > what I'm seending soon on a v2:
> IMO let's keep the documentation limited to PCI PM D3 state, s0ix
> does not belongs here,
> gtidle component should document the s0ix dependency and display C
> states doc should
> document the runtime opportunistic s0ix with PSR. Xe_pm is not
> correct component.
okay then. I can simply remove this last block that explains the
opportunistic S0iX.
Although this is a confusion that we have to keep answering over and
over about these callbacks relation to S0ix.
But I agree on keeping it simple.
> Thanks,
> Anshuman Gupta.
> >
> > --- a/drivers/gpu/drm/xe/xe_pm.c
> > +++ b/drivers/gpu/drm/xe/xe_pm.c
> > @@ -33,17 +33,9 @@
> > * S4 (disk). The main functions here are `xe_pm_suspend` and
> > `xe_pm_resume`. They
> > * are the main point for the suspend to and resume from these
> > states.
> > *
> > - * Runtime Suspend (D-States) - This is the opportunistic PCIe
> > device low
> > power
> > - * state D3. Xe PM component provides `xe_pm_runtime_suspend` and
> > - * `xe_pm_runtime_resume` systems that PCI subsystem will call
> > before
> > transition
> > - * to D3. Also, Xe PM provides get and put functions that Xe
> > driver will use
> > to
> > - * indicate activity. In order to avoid locking complications with
> > the memory
> > - * management, whenever possible, these get and put functions
> > needs to be
> > called
> > - * from the higher/outer levels.
> > - *
> > - * The main cases that need to be protected from the outer levels
> > are:
> > IOCTL,
> > - * sysfs, debugfs, dma-buf sharing, GPU execution.
> > - *
> > + * PCI Device Suspend (D-States) - This is the opportunistic PCIe
> > + device low power
> > + * state D3, controlled by the PCI subsystem and ACPI with the
> > help
> > + from the
> > + * runtime_pm infrastructure.
> > * PCI D3 is special and can mean D3hot, where Vcc power is on for
> > keeping
> > memory
> > * alive and quicker low latency resume or D3Cold where Vcc power
> > is off for
> > * better power savings.
> > @@ -58,12 +50,28 @@
> > * suspend. It will be based on runtime conditions such as VRAM
> > usage for a
> > * quick and low latency resume for instance.
> > *
> > - * Intel systems are capable of taking the system to S0ix when
> > devices are on
> > - * D3hot through the runtime PM. This is also called as
> > 'opportunistic-S0iX'.
> > - * But in this case, the `xe_pm_suspend` and `xe_pm_resume` won't
> > be
> > called for
> > - * S0ix.
> > + * Runtime PM - This infrastructure provided by the Linux kernel
> > allows
> > + the
> > + * device drivers to indicate when the can be runtime suspended,
> > so the
> > + device
> > + * could be put at D3 (if supported), or allow deeper package
> > sleep
> > + states
> > + * (PC-states), and/or other low level power states. Xe PM
> > component
> > + provides
> > + * `xe_pm_runtime_suspend` and `xe_pm_runtime_resume` functions
> > that
> > + PCI
> > + * subsystem will call before transition to/from runtime suspend.
> > + *
> > + * Also, Xe PM provides get and put functions that Xe driver will
> > use
> > + to
> > + * indicate activity. In order to avoid locking complications with
> > the
> > + memory
> > + * management, whenever possible, these get and put functions
> > needs to
> > + be called
> > + * from the higher/outer levels.
> > + * The main cases that need to be protected from the outer levels
> > are:
> > + IOCTL,
> > + * sysfs, debugfs, dma-buf sharing, GPU execution.
> > + *
> > + * Intel systems are capable of taking the system to S0ix when all
> > + certain
> > + * conditions are met. This is also called as 'opportunistic-
> > S0iX'.
> > + * But in this case, the `xe_pm_suspend` and `xe_pm_resume` won't
> > be
> > + called
> > + * for S0ix. In certain cases, like when Display Panel Self-
> > Refresh
> > + (eDP PSR) is
> > + * active, not even `xe_pm_runtime_suspend` and
> > `xe_pm_runtime_resume`
> > + will be
> > + * called.
> > *
> > - * This component is no responsible for GT idleness (RC6) nor GT
> > frequency
> > + * This component is not responsible for GT idleness (RC6) nor GT
> > + frequency
> > * management (RPS).
> > */
> >
> > > Thanks,
> > > Anshuman Gupta.
> > > > > + *
> > > > > + * This component is no responsible for GT idleness (RC6)
> > > > > nor GT
> > > > > + frequency
> > > >
> > > > Isn't it s/no/not/? Or s/no/neither/ + s/nor/nor for/? In any
> > > > case:
> > > >
> > > > Reviewed-by: Francois Dugast <francois.dugast at intel.com>
> > > >
> > > > Francois
> > > >
> > > > > + * management (RPS).
> > > > > */
> > > > >
> > > > > /**
> > > > > @@ -178,6 +203,12 @@ void xe_pm_init_early(struct xe_device
> > > > > *xe)
> > > > > drmm_mutex_init(&xe->drm, &xe-
> > > > > >mem_access.vram_userfault.lock);
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_init - Initialize Xe Power Management
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * This component is responsible for System and Device sleep
> > > > > states.
> > > > > + */
> > > > > void xe_pm_init(struct xe_device *xe) {
> > > > > /* For now suspend/resume is only allowed with GuC */ @@ -
> > > > > 196,6
> > > > > +227,10 @@ void xe_pm_init(struct xe_device *xe)
> > > > > xe_pm_runtime_init(xe);
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_runtime_fini - Finalize Runtime PM
> > > > > + * @xe: xe device instance
> > > > > + */
> > > > > void xe_pm_runtime_fini(struct xe_device *xe) {
> > > > > struct device *dev = xe->drm.dev; @@ -225,6 +260,12 @@
> > > > > struct
> > > > > task_struct *xe_pm_read_callback_task(struct
> > > > xe_device *xe)
> > > > > return READ_ONCE(xe->pm_callback_task); }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_runtime_suspend - Prepare our device for
> > > > > D3hot/D3Cold
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * Returns 0 for success, negative error code otherwise.
> > > > > + */
> > > > > int xe_pm_runtime_suspend(struct xe_device *xe) {
> > > > > struct xe_bo *bo, *on;
> > > > > @@ -290,6 +331,12 @@ int xe_pm_runtime_suspend(struct
> > > > > xe_device
> > *xe)
> > > > > return err;
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_runtime_resume - Waking up from D3hot/D3Cold
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * Returns 0 for success, negative error code otherwise.
> > > > > + */
> > > > > int xe_pm_runtime_resume(struct xe_device *xe) {
> > > > > struct xe_gt *gt;
> > > > > @@ -341,22 +388,47 @@ int xe_pm_runtime_resume(struct
> > > > > xe_device
> > *xe)
> > > > > return err;
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_runtime_get - Get a runtime_pm reference and resume
> > > > > +synchronously
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * Returns: Any number greater than or equal to 0 for
> > > > > success,
> > > > > +negative error
> > > > > + * code otherwise.
> > > > > + */
> > > > > int xe_pm_runtime_get(struct xe_device *xe) {
> > > > > return pm_runtime_get_sync(xe->drm.dev); }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_runtime_put - Put the runtime_pm reference back and
> > > > > mark
> > > > > +as idle
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * Returns: Any number greater than or equal to 0 for
> > > > > success,
> > > > > +negative error
> > > > > + * code otherwise.
> > > > > + */
> > > > > int xe_pm_runtime_put(struct xe_device *xe) {
> > > > > pm_runtime_mark_last_busy(xe->drm.dev);
> > > > > return pm_runtime_put(xe->drm.dev); }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_runtime_get_if_active - Get a runtime_pm reference
> > > > > if
> > > > > +device active
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * Returns: Any number greater than or equal to 0 for
> > > > > success,
> > > > > +negative error
> > > > > + * code otherwise.
> > > > > + */
> > > > > int xe_pm_runtime_get_if_active(struct xe_device *xe) {
> > > > > return pm_runtime_get_if_active(xe->drm.dev, true); }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_assert_unbounded_bridge - Disable PM on unbounded
> > > > > pcie
> > > > > +parent bridge
> > > > > + * @xe: xe device instance
> > > > > + */
> > > > > void xe_pm_assert_unbounded_bridge(struct xe_device *xe) {
> > > > > struct pci_dev *pdev = to_pci_dev(xe->drm.dev); @@ -371,6
> > > > > +443,13
> > > > @@
> > > > > void xe_pm_assert_unbounded_bridge(struct xe_device *xe)
> > > > > }
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_set_vram_threshold - Set a vram threshold for
> > > > > +allowing/blocking D3Cold
> > > > > + * @xe: xe device instance
> > > > > + * @threshold: VRAM size in bites for the D3cold threshold
> > > > > + *
> > > > > + * Returns 0 for success, negative error code otherwise.
> > > > > + */
> > > > > int xe_pm_set_vram_threshold(struct xe_device *xe, u32
> > > > > threshold) {
> > > > > struct ttm_resource_manager *man; @@ -395,6 +474,13 @@ int
> > > > > xe_pm_set_vram_threshold(struct xe_device *xe,
> > > > u32 threshold)
> > > > > return 0;
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_pm_d3cold_allowed_toggle - Check conditions to toggle
> > > > > +d3cold.allowed
> > > > > + * @xe: xe device instance
> > > > > + *
> > > > > + * To be called during runtime_pm idle callback.
> > > > > + * Check for all the D3Cold conditions ahead of runtime
> > > > > suspend.
> > > > > + */
> > > > > void xe_pm_d3cold_allowed_toggle(struct xe_device *xe) {
> > > > > struct ttm_resource_manager *man;
> > > > > --
> > > > > 2.43.0
> > > > >
More information about the Intel-xe
mailing list