[Intel-gfx] [PATCH v4 9/9] drm/i915/perf: Add support for OA media units
Umesh Nerlige Ramappa
umesh.nerlige.ramappa at intel.com
Sat Mar 11 00:18:30 UTC 2023
On Fri, Mar 10, 2023 at 09:36:52AM -0800, Dixit, Ashutosh wrote:
>On Fri, 10 Mar 2023 08:39:27 -0800, Umesh Nerlige Ramappa wrote:
>>
>
>Hi Umesh,
>
>> On Thu, Mar 09, 2023 at 03:57:48PM -0800, Dixit, Ashutosh wrote:
>> > On Tue, 07 Mar 2023 12:16:11 -0800, Umesh Nerlige Ramappa wrote:
>> >>
>> >> -static int gen8_configure_context(struct i915_gem_context *ctx,
>> >> +static int gen8_configure_context(struct i915_perf_stream *stream,
>> >> + struct i915_gem_context *ctx,
>> >> struct flex *flex, unsigned int count)
>> >> {
>> >> struct i915_gem_engines_iter it;
>> >> @@ -2573,7 +2594,8 @@ static int gen8_configure_context(struct i915_gem_context *ctx,
>> >> for_each_gem_engine(ce, i915_gem_context_lock_engines(ctx), it) {
>> >> GEM_BUG_ON(ce == ce->engine->kernel_context);
>> >>
>> >> - if (!engine_supports_oa(ce->engine))
>> >> + if (!engine_supports_oa(ce->engine) ||
>> >> + ce->engine->class != stream->engine->class)
>> >> continue;
>> >>
>> >> /* Otherwise OA settings will be set upon first use */
>> >> @@ -2704,7 +2726,7 @@ oa_configure_all_contexts(struct i915_perf_stream *stream,
>> >>
>> >> spin_unlock(&i915->gem.contexts.lock);
>> >>
>> >> - err = gen8_configure_context(ctx, regs, num_regs);
>> >> + err = gen8_configure_context(stream, ctx, regs, num_regs);
>> >> if (err) {
>> >> i915_gem_context_put(ctx);
>> >> return err;
>> >> @@ -2724,7 +2746,8 @@ oa_configure_all_contexts(struct i915_perf_stream *stream,
>> >> for_each_uabi_engine(engine, i915) {
>> >> struct intel_context *ce = engine->kernel_context;
>> >>
>> >> - if (!engine_supports_oa(ce->engine))
>> >> + if (!engine_supports_oa(ce->engine) ||
>> >> + ce->engine->class != stream->engine->class)
>> >> continue;
>> >>
>> >> regs[0].value = intel_sseu_make_rpcs(engine->gt, &ce->sseu);
>> >> @@ -2749,6 +2772,9 @@ gen12_configure_all_contexts(struct i915_perf_stream *stream,
>> >> },
>> >> };
>> >>
>> >> + if (stream->engine->class != RENDER_CLASS)
>> >> + return 0;
>> >> +
>> >> return oa_configure_all_contexts(stream,
>> >> regs, ARRAY_SIZE(regs),
>> >> active);
>> >
>> > Can you please explain the above changes? Why are we checking for
>> > engine->class above? Should we be checking for both class and instance? Or
>> > all engines connected to an OA unit (multiple classes can be connected to
>> > an OA unit and be different from stream->engine->class, e.g. VDBOX and
>> > VEBOX)? oa_configure_all_contexts is also called from
>> > lrc_configure_all_contexts.
This check primarily blocks media engine use cases from entering
oa_configure_all_contexts().
lrc_configure_all_contexts applies to pre-gen12 only. On pre-gen12,
engine_supports_oa() should return true only for render.
>>
>> Only render (and compute when we support it) have OA specific configuration
>> in the context image. Media engines do not have any context specific
>> configurations.
>
>Yes I remember you answered this previously too. My question still is why
>did we make the 2 instances of this change above:
>
>From the original code in drm-tip:
>
> if (engine->class != RENDER_CLASS)
> continue;
>
>To the final code (changed in two patches):
>
> if (!engine_supports_oa(ce->engine) ||
> ce->engine->class != stream->engine->class)
> continue;
I think some changes are a result of incrementally supporting compute
and then media in OA. Since we have not upstreamed the compute support,
some lines of code remain.
With compute support the "if (engine->class != RENDER_CLASS)" changed to
"if (!engine_supports_oa(ce->engine)). Later, OAM support brought the
other condition that checks classes because this code is under
for_each_uabi_engine(engine, i915). When we run this for an OA use case
where user has passed rcs0 for ex, it will still iterate over the media
engines. Since we now support media engines, we should skip them in this
loop.
The other question on whether this should be class specific or span
multiple engines, I have to check that specifically for OAG. Ideally,
the PWR_CLK_STATE should be configured for all engines that support it
(render and compute where available), so the above check should be
if (!engine_supports_oa(ce->engine) ||
!engine_has_pwr_clk_state(ce->engine))
A jira will help track this and I can address that in a separate
patch/series if it turns out to be an issue.
Thanks,
Umesh
>
>Thanks.
>--
>Ashutosh
More information about the Intel-gfx
mailing list