[Intel-gfx] [PATCH 1/2] drm/i915/guc: fix GuC suspend/resume
Daniele Ceraolo Spurio
daniele.ceraolospurio at intel.com
Tue Oct 16 16:35:42 UTC 2018
On 10/16/2018 3:26 AM, Michal Wajdeczko wrote:
> On Tue, 16 Oct 2018 00:10:08 +0200, Daniele Ceraolo Spurio
> <daniele.ceraolospurio at intel.com> wrote:
>
>> The ENTER/EXIT_S_STATE actions queue the save/restore operation in GuC
>> FW and then return, so waiting on the H2H is not enough to guarantee
>> GuC is done.
>> When all the processing is done, GuC writes 0 to scratch register 14,
>> so we can poll on that. Note that GuC does not ensure that the value
>> in the register is different from 0 while the action is in progress
>> so we need to take care of that ourselves as well.
>>
>> Cc: Chris Wilson <chris at chris-wilson.co.uk>
>> Cc: Michal Wajdeczko <michal.wajdeczko at intel.com>
>> Signed-off-by: Daniele Ceraolo Spurio <daniele.ceraolospurio at intel.com>
>> ---
>> drivers/gpu/drm/i915/intel_guc.c | 28 +++++++++++++++++++++++++--
>> drivers/gpu/drm/i915/intel_guc_fwif.h | 6 ++++++
>> 2 files changed, 32 insertions(+), 2 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/i915/intel_guc.c
>> b/drivers/gpu/drm/i915/intel_guc.c
>> index 230aea69385d..f238cd7a9dcf 100644
>> --- a/drivers/gpu/drm/i915/intel_guc.c
>> +++ b/drivers/gpu/drm/i915/intel_guc.c
>> @@ -521,6 +521,30 @@ int intel_guc_auth_huc(struct intel_guc *guc,
>> u32 rsa_offset)
>> return intel_guc_send(guc, action, ARRAY_SIZE(action));
>> }
>> +/*
>> + * The ENTER/EXIT_S_STATE actions queue the save/restore operation
>> in GuC FW and
>> + * then return, so waiting on the H2H is not enough to guarantee GuC
>> is done.
>> + * When all the processing is done, GuC writes 0 to scratch register
>> 14, so we
>
> s/writes 0/writes INTEL_GUC_SLEEP_STATE_SUCCESS ?
>
>> + * can poll on that. Note that GuC does not ensure that the value in
>> the
>> + * register is different from 0 while the action is in progress so
>> we need to
>> + * take care of that ourselves as well.
>> + */
>> +static int guc_sleep_state_action(struct intel_guc *guc,
>> + const u32 *action, u32 len)
>> +{
>> + struct drm_i915_private *dev_priv = guc_to_i915(guc);
>> + int ret;
>> +
>> + I915_WRITE(SOFT_SCRATCH(14), ~0x0);
>
> Do we want to add dedicated name for that scratch register?
As I replied on your patch, the register is used for other purposes in
other actions so I don't think it is a good idea to rename it.
>
> Also, as GuC is using scratch[14] then we need to remove it from our send
> register pool - patch is here [1]
>
>> +
>> + ret = intel_guc_send(guc, action, len);
>> + if (ret)
>> + return ret;
>> +
>> + return intel_wait_for_register(dev_priv, SOFT_SCRATCH(14), ~0x0,
>> + INTEL_GUC_SLEEP_STATE_SUCCESS, 10);
>
> Maybe it is worth to use __intel_wait_for_register() and print sleep
> state error code in case of failure ? And do we really need to wait
> full 10ms for SUCCESS if GuC already has reported PREEMPT_TO_IDLE_FAILED
> or ENGINE_RESET_FAILED ?
>
> u32 state;
>
> ret = __intel_wait_for_register(dev_priv, SOFT_SCRATCH(14),
> 0x80000000,
> 0, 10, &state);
> if (ret)
> return ret;
> if (status != INTEL_GUC_SLEEP_STATE_SUCCESS) {
> DRM_ERROR("... %u\n", state);
> return -EIO;
> }
> return 0;
>
It should be impossible for us to get PREEMPT_TO_IDLE_FAILED or
ENGINE_RESET_FAILED because those only come in play if guc_suspend() is
called while there are outstanding request inside GuC. However, no real
downsides in going with your solution either so I'll do the changes ;)
>> +}
>> +
>> /**
>> * intel_guc_suspend() - notify GuC entering suspend state
>> * @guc: the guc
>> @@ -533,7 +557,7 @@ int intel_guc_suspend(struct intel_guc *guc)
>> intel_guc_ggtt_offset(guc, guc->shared_data)
>> };
>> - return intel_guc_send(guc, data, ARRAY_SIZE(data));
>> + return guc_sleep_state_action(guc, data, ARRAY_SIZE(data));
>> }
>> /**
>> @@ -571,7 +595,7 @@ int intel_guc_resume(struct intel_guc *guc)
>> intel_guc_ggtt_offset(guc, guc->shared_data)
>> };
>> - return intel_guc_send(guc, data, ARRAY_SIZE(data));
>> + return guc_sleep_state_action(guc, data, ARRAY_SIZE(data));
>> }
>> /**
>> diff --git a/drivers/gpu/drm/i915/intel_guc_fwif.h
>> b/drivers/gpu/drm/i915/intel_guc_fwif.h
>> index 8382d591c784..b0eb5aabe0a7 100644
>> --- a/drivers/gpu/drm/i915/intel_guc_fwif.h
>> +++ b/drivers/gpu/drm/i915/intel_guc_fwif.h
>> @@ -687,6 +687,12 @@ enum intel_guc_report_status {
>> INTEL_GUC_REPORT_STATUS_COMPLETE = 0x4,
>> };
>> +enum intel_guc_sleep_state_status {
>> + INTEL_GUC_SLEEP_STATE_SUCCESS = 0x0,
>> + INTEL_GUC_SLEEP_STATE_PREEMPT_TO_IDLE_FAILED = 0x1,
>> + INTEL_GUC_SLEEP_STATE_ENGINE_RESET_FAILED = 0x2
>
> as we waiting for state change, maybe we should explicitly define
> INTEL_GUC_SLEEP_STATE_INVALID_MASK = 0x80000000,
> and use it in __intel_wait_for_register ?
ack
>
>> +};
>> +
>> #define GUC_LOG_CONTROL_LOGGING_ENABLED (1 << 0)
>> #define GUC_LOG_CONTROL_VERBOSITY_SHIFT 4
>> #define GUC_LOG_CONTROL_VERBOSITY_MASK (0xF <<
>> GUC_LOG_CONTROL_VERBOSITY_SHIFT)
>
Note for you while I'm it: I've been told that the gen11 GuC FW has a
simplified and improved flow for suspend/resume, so some changes will be
required in your series. Not sure about the details.
Thanks,
Daniele
> Thanks,
> Michal
>
> [1] https://patchwork.freedesktop.org/patch/256921/
More information about the Intel-gfx
mailing list