[Intel-gfx] [PATCH v8 5/6] drm/i915/mtl: Add function to send command to GSC CS
Kandpal, Suraj
suraj.kandpal at intel.com
Mon Jan 30 07:12:24 UTC 2023
> -----Original Message-----
> From: Teres Alexis, Alan Previn <alan.previn.teres.alexis at intel.com>
> Sent: Friday, January 27, 2023 7:14 PM
> To: Kandpal, Suraj <suraj.kandpal at intel.com>; intel-
> gfx at lists.freedesktop.org
> Cc: Shankar, Uma <uma.shankar at intel.com>; Nautiyal, Ankit K
> <ankit.k.nautiyal at intel.com>; Ceraolo Spurio, Daniele
> <daniele.ceraolospurio at intel.com>; Gupta, Anshuman
> <anshuman.gupta at intel.com>
> Subject: Re: [PATCH v8 5/6] drm/i915/mtl: Add function to send command to
> GSC CS
>
> On Tue, 2023-01-24 at 15:12 +0530, Kandpal, Suraj wrote:
> > Add function that takes care of sending command to gsc cs. We start of
> > with allocation of memory for our command intel_hdcp_gsc_message that
> > contains gsc cs memory header as directed in specs followed by the
> > actual payload hdcp message that we want to send.
> > Spec states that we need to poll pending bit of response header around
> > 20 times each try being 50ms apart hence adding that to current
> > gsc_msg_send function Also we use the same function to take care of
> > both sending and receiving hence no separate function to get the
> > response.
> >
> > --v4
> > -Create common function to fill in gsc_mtl_header [Alan] -define host
> > session bitmask [Alan]
> >
> > --v5
> > -use i915 directly instead of gt->i915 [Alan] -No need to make fields
> > NULL as we are already using kzalloc [Alan]
> >
> > --v8
> > -change mechanism to reuse the same memory for one hdcp session[Alan]
> > -fix header ordering -add comments to explain flags and host session
> > mask [Alan]
> >
> > Cc: Ankit Nautiyal <ankit.k.nautiyal at intel.com>
> > Cc: Daniele Ceraolo Spurio <daniele.ceraolospurio at intel.com>
> > Cc: Alan Pervin Teres <alan.previn.teres.alexis at intel.com>
> > Cc: Uma Shankar <uma.shankar at intel.com>
> > Cc: Anshuman Gupta <anshuman.gupta at intel.com>
> > Signed-off-by: Suraj Kandpal <suraj.kandpal at intel.com>
> > ---
> > drivers/gpu/drm/i915/Makefile | 1 +
> > .../gpu/drm/i915/display/intel_display_core.h | 5 +
> > drivers/gpu/drm/i915/display/intel_hdcp.c | 20 ++
> > drivers/gpu/drm/i915/display/intel_hdcp_gsc.c | 185
> > ++++++++++++++++++
> > drivers/gpu/drm/i915/display/intel_hdcp_gsc.h | 27 +++
> > .../i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c | 15 ++
> > .../i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h | 16 ++
> > 7 files changed, 269 insertions(+)
> > create mode 100644 drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > create mode 100644 drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> >
> > diff --git a/drivers/gpu/drm/i915/Makefile
> > b/drivers/gpu/drm/i915/Makefile index 461d6b40656d..194aae92c354
> > 100644
> > --- a/drivers/gpu/drm/i915/Makefile
> > +++ b/drivers/gpu/drm/i915/Makefile
> > @@ -254,6 +254,7 @@ i915-y += \
> > display/intel_frontbuffer.o \
> > display/intel_global_state.o \
> > display/intel_hdcp.o \
> > + display/intel_hdcp_gsc.o \
> > display/intel_hotplug.o \
> > display/intel_hti.o \
> > display/intel_lpe_audio.o \
> > diff --git a/drivers/gpu/drm/i915/display/intel_display_core.h
> > b/drivers/gpu/drm/i915/display/intel_display_core.h
> > index 132e9134ba05..7e9f3f87c767 100644
> > --- a/drivers/gpu/drm/i915/display/intel_display_core.h
> > +++ b/drivers/gpu/drm/i915/display/intel_display_core.h
> > @@ -372,6 +372,11 @@ struct intel_display {
> > struct i915_hdcp_master *master;
> > bool comp_added;
> >
> > + /*HDCP message struct for allocation of memory which
> > +can be reused
> > + * when sending message to gsc cs
> > + * this is only populated post Meteorlake
> > + */
> > + struct intel_hdcp_gsc_message *hdcp_message;
> > /* Mutex to protect the above hdcp component related
> > values. */
> > struct mutex comp_mutex;
> alan: apologies for these additional self-narration drizzled with questions
> after an 8th rev - please bear with me:
>
> 1. Firstly thanks for changing the codes from allocating 'struct
> intel_hdcp_gsc_message'
> dynamically-as-we-need-send-a-msg it to an init-time allocation that we
> can continue
> to use over life of a hdcp-enabled-connector (so we dont have to keep
> allocating and
> freeing every few seconds).
> 2. With this new code i was lead to look at "intel_display->hdcp" usage.
> intel_display
> is a global context i.e. (not per crtc) and the hdcp substruct in intel_display
> is
> holding global structures too (with comp_mutex protecting master).
> However
> 'struct intel_hdcp_gsc_message' is used at a per-connector level (but only
> for active
> connectors which cannot exist without an attached crtc).
> 3. That said, although we do use comp_mutex for all the connector level
> hdcp activity,
> those activities sit behind a global lock and component interface. To me this
> looks
> inefficient because we are forcing serialization of hdcp operations across all
> the
> connectors with active hdcp operatoin. Putting an instance of
> hdcp_message per connector
> will allow parallelism and reduce overall latencies. (perhaps attaching
> hdcp_message
> to crtc works better as we might have more logical connectors than active -
> capped
> strictly by no. of crtcs).
> 4. I am aware that in the past we wouldn't have gained anything by such
> paralellism since
> the hdcp interactions would have to be serialized behind the hdcp
> component interface
> anyway. Today, with the gsc engine, we could try to parallelize stuff a bit
> more?
> (but not completely since there is only one gsc firmware and I can't recall if
> the GSC fw
> can operate on hdcp requests in parallel and if they support different host-
> session-
> handles per display pipe). As such, I am not sure if this ask can gain any
> value effort
> (thinking about dealing with 2 3 or more crtcs all with HDCP enabled and
> doing longer
> operations like re-establishing the hdcp link). Do you think its worth it?
> But i am not sure about whether this will work with display team direction
> (i.e. keeping
> the hdcp instructions as part of the component driver).
> Thus, this would be your and display team's call, not mine.
> 5. Event if #4 is "lets not move hdcp_message to connector level", i believe it
> looks like
So I don’t see the any significant gain to parallelizing our approach to the amount of effort that would be needed
To put in to change the design at this point
> you can allocate and free the hdcp_message at connector level hdcp init
> and teardown without
> checking if its been allocated before if multiple connectors are currently
> being enabled or
> already active. Please correct me if i am mistaken.
Good point will try to get this done on my next revision
>
>
> > } hdcp;
> > diff --git a/drivers/gpu/drm/i915/display/intel_hdcp.c
> > b/drivers/gpu/drm/i915/display/intel_hdcp.c
> > index 0d6aed1eb171..c578fcc34bfd 100644
> > --- a/drivers/gpu/drm/i915/display/intel_hdcp.c
> > +++ b/drivers/gpu/drm/i915/display/intel_hdcp.c
> > @@ -23,6 +23,7 @@
> > #include "intel_display_power_well.h"
> > #include "intel_display_types.h"
> > #include "intel_hdcp.h"
> > +#include "intel_hdcp_gsc.h"
> > #include "intel_hdcp_regs.h"
> > #include "intel_pcode.h"
> >
> > @@ -1966,12 +1967,28 @@ static int _intel_hdcp2_enable(struct
> > intel_connector *connector)
> > {
> > struct drm_i915_private *i915 = to_i915(connector->base.dev);
> > struct intel_hdcp *hdcp = &connector->hdcp;
> > + struct intel_hdcp_gsc_message *hdcp_message;
> > int ret;
> >
> > drm_dbg_kms(&i915->drm, "[%s:%d] HDCP2.2 is being enabled.
> > Type: %d\n",
> > connector->base.name, connector->base.base.id,
> > hdcp->content_type);
> >
> > + if (DISPLAY_VER(i915) >= 14) {
> > + hdcp_message = kzalloc(sizeof(*hdcp_message),
> > +GFP_KERNEL);
> > +
> > + if (!hdcp_message)
> > + return -ENOMEM;
> > +
> > + i915->display.hdcp.hdcp_message = hdcp_message;
> alan: non-blocker: honestly, just reading this code looks a bit of an issue (we
> are overriding a global's member-ptr from inside a connector level function).
> Perhaps a comment that says something like : Note: any connector level hdcp
> function would have already taken display->hdcp->comp_mutex farther up
> the stack so we can never trample hdcp_message between connector level
> hdcp functions concurrently. this is related to the last question in my point #5
> above.
>
This wont be an issue anymore as I am planning to move the allocation and deallocation to init
And teardown as suggested by you
>
> > + ret = intel_hdcp_gsc_initialize_message(i915,
> > +hdcp_message);
> > +
> > + if (ret) {
> > + drm_err(&i915->drm, "Could not initialize
> > +hdcp_message\n");
> > + return ret;
> > + }
> > + }
> > +
> > ret = hdcp2_authenticate_and_encrypt(connector);
> > if (ret) {
> > drm_dbg_kms(&i915->drm, "HDCP2 Type%d Enabling
> > Failed. (%d)\n", @@ -2018,6 +2035,9 @@ _intel_hdcp2_disable(struct
> > intel_connector *connector, bool hdcp2_link_recovery
> > if (hdcp2_deauthenticate_port(connector) < 0)
> > drm_dbg_kms(&i915->drm, "Port deauth failed.\n");
> >
> > + if (DISPLAY_VER(i915) >= 14)
> > + intel_hdcp_gsc_free_message(i915);
> > +
> > connector->hdcp.hdcp2_encrypted = false;
> > dig_port->hdcp_auth_status = false;
> > data->k = 0;
> > diff --git a/drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > new file mode 100644
> > index 000000000000..8da4faf9b10c
> > --- /dev/null
> > +++ b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > @@ -0,0 +1,185 @@
> > +// SPDX-License-Identifier: MIT
> > +/*
> > + * Copyright 2023, Intel Corporation.
> > + */
> > +
> > +#include "display/intel_hdcp_gsc.h"
> > +#include "gem/i915_gem_region.h"
> > +#include "gt/uc/intel_gsc_uc_heci_cmd_submit.h"
> > +#include "i915_drv.h"
> > +#include "i915_utils.h"
> > +
> > +/*This function helps allocate memory for the command that we will
> > +send to gsc cs */ int intel_hdcp_gsc_initialize_message(struct
> > +drm_i915_private *i915,
> > + struct intel_hdcp_gsc_message
> > +*hdcp_message) {
> > + struct intel_gt *gt = i915->media_gt;
> > + struct drm_i915_gem_object *obj = NULL;
> > + struct i915_vma *vma = NULL;
> > + void *cmd;
> > + int err;
> > +
> > + /* allocate object of one page for HDCP command memory and
> > +store it */
> > + obj = i915_gem_object_create_shmem(i915, PAGE_SIZE);
> > +
> > + if (IS_ERR(obj)) {
> > + drm_err(&i915->drm, "Failed to allocate HDCP streaming
> > +command!\n");
> > + return PTR_ERR(obj);
> > + }
> > +
> > + cmd = i915_gem_object_pin_map_unlocked(obj,
> > +i915_coherent_map_type(i915, obj, true));
> > + if (IS_ERR(cmd)) {
> > + drm_err(&i915->drm, "Failed to map gsc message
> > +page!\n");
> > + err = PTR_ERR(cmd);
> > + goto out_unpin;
> > + }
> > +
> > + vma = i915_vma_instance(obj, >->ggtt->vm, NULL);
> > + if (IS_ERR(vma)) {
> > + err = PTR_ERR(vma);
> > + goto out_unmap;
> > + }
> > +
> > + err = i915_vma_pin(vma, 0, 0, PIN_GLOBAL);
> > + if (err)
> > + goto out_unmap;
> > +
> > + memset(cmd, 0, obj->base.size);
> > +
> > + hdcp_message->obj = obj;
> see below comment about possible option to eliminate keeping obj
> > + hdcp_message->hdcp_cmd = cmd;
> > + hdcp_message->vma = vma;
> > +
> > + return 0;
> > +
> > +out_unmap:
> > + i915_gem_object_unpin_map(obj);
> > +out_unpin:
> > + i915_gem_object_put(obj);
> > + return err;
> > +}
> > +
> > +void intel_hdcp_gsc_free_message(struct drm_i915_private *i915) {
> > + struct intel_hdcp_gsc_message *hdcp_message =
> > +
> > +i915->display.hdcp.hdcp_message;
> > + struct drm_i915_gem_object *obj =
> > +fetch_and_zero(&hdcp_message->obj);
> > +
> > + if (!obj)
> > + return;
> > +
> > + if (hdcp_message->vma)
> > + i915_vma_unpin(fetch_and_zero(&hdcp_message->vma));
> > +
> > + i915_gem_object_unpin_map(obj);
> > + i915_gem_object_put(obj);
> > + kfree(hdcp_message);
> >
> above cleanup code might be significantly minimized / cleaned up by using
> just i915_vma_unpin_and_release(vma, I915_VMA_RELEASE_MAP) which
> will also unmap, unpit and release the object internally and with that you
> don't need to carry "obj" in hdcp_message anymore.
Ahh sorry for making you repeat this review missed this one while floating will
Look into this
Regards,
Suraj Kandpal
>
> > +}
> > +
> > +static int intel_gsc_send_sync(struct drm_i915_private *i915,
> > + struct intel_gsc_mtl_header *header,
> > +u64 addr,
> > + size_t msg_out_len) {
> > + struct intel_gt *gt = i915->media_gt;
> > + int ret;
> > +
> > + header->flags = 0;
> > + ret = intel_gsc_uc_heci_cmd_submit_packet(>->uc.gsc, addr,
> > +
> > +header->message_size,
> > + addr,
> > + msg_out_len +
> > +sizeof(*header));
> > + if (ret) {
> > + drm_err(&i915->drm, "failed to send gsc HDCP msg
> > +(%d)\n", ret);
> > + return ret;
> > + }
> > + /*
> > + * Checking validity marker for memory sanity
> > + */
> > + if (header->validity_marker != GSC_HECI_VALIDITY_MARKER) {
> > + drm_err(&i915->drm, "invalid validity marker\n");
> > + return -EINVAL;
> > + }
> > +
> > + if (header->status != 0) {
> > + drm_err(&i915->drm, "header status indicates error
> > +%d\n",
> > + header->status);
> > + return -EINVAL;
> > + }
> > +
> > + if (header->flags & GSC_OUTFLAG_MSG_PENDING)
> > + return -EAGAIN;
> > +
> > + return 0;
> > +}
> > +
> > +/*
> > + * This function can now be used for sending requests and will also
> > +handle
> > + * receipt of reply messages hence no different function of message
> > +retrieval
> > + * is required. We will initialize intel_hdcp_gsc_message structure
> > +then add
> > + * gsc cs memory header as stated in specs after which the normal
> > +HDCP payload
> > + * will follow
> > + */
> > +ssize_t intel_hdcp_gsc_msg_send(struct drm_i915_private *i915, u8
> > +*msg_in,
> > + size_t msg_in_len, u8 *msg_out, size_t
> > +msg_out_len) {
> > + struct intel_gt *gt = i915->media_gt;
> > + struct intel_gsc_mtl_header *header;
> > + const size_t max_msg_size = PAGE_SIZE - sizeof(*header);
> > + struct intel_hdcp_gsc_message *hdcp_message;
> > + u64 addr, host_session_id;
> > + u32 reply_size, msg_size;
> > + int ret, tries = 0;
> > +
> > + if (!intel_uc_uses_gsc_uc(>->uc))
> > + return -ENODEV;
> > +
> > + if (msg_in_len > max_msg_size || msg_out_len > max_msg_size)
> > + return -ENOSPC;
> > +
> > + hdcp_message = i915->display.hdcp.hdcp_message;
> > + header = hdcp_message->hdcp_cmd;
> > + addr = i915_ggtt_offset(hdcp_message->vma);
> > +
> > + msg_size = msg_in_len + sizeof(*header);
> > + memset(header, 0, msg_size);
> > + get_random_bytes(&host_session_id, sizeof(u64));
> > + intel_gsc_uc_heci_cmd_emit_mtl_header(header,
> > +HECI_MEADDRESS_HDCP,
> > + msg_size,
> > +host_session_id);
> > + memcpy(hdcp_message->hdcp_cmd + sizeof(*header), msg_in,
> > +msg_in_len);
> > +
> > + /*
> > + * Keep sending request in case the pending bit is set no need
> > +to add
> > + * message handle as we are using same address hence loc. of
> > +header is
> > + * same and it will contain the message handle. we will send
> > +the message
> > + * 20 times each message 50 ms apart
> > + */
> > + do {
> > + ret = intel_gsc_send_sync(i915, header, addr,
> > +msg_out_len);
> > +
> > + /* Only try again if gsc says so */
> > + if (ret != -EAGAIN)
> > + break;
> > +
> > + msleep(50);
> > +
> > + } while (++tries < 20);
> > +
> > + if (ret)
> > + goto err;
> > +
> > + /* we use the same mem for the reply, so header is in the same
> > +loc */
> > + reply_size = header->message_size - sizeof(*header);
> > + if (reply_size > msg_out_len) {
> > + drm_warn(&i915->drm, "caller with insufficient HDCP
> > +reply size %u (%d)\n",
> > + reply_size, (u32)msg_out_len);
> > + reply_size = msg_out_len;
> > + } else if (reply_size != msg_out_len) {
> > + drm_dbg_kms(&i915->drm, "caller unexpected HCDP reply
> > +size %u (%d)\n",
> > + reply_size, (u32)msg_out_len);
> > + }
> > +
> > + memcpy(msg_out, hdcp_message->hdcp_cmd + sizeof(*header),
> > +msg_out_len);
> > +
> > +err:
> > + return ret;
> > +}
> > diff --git a/drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> > b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> > new file mode 100644
> > index 000000000000..8352b31a7e4a
> > --- /dev/null
> > +++ b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> > @@ -0,0 +1,27 @@
> > +/* SPDX-License-Identifier: MIT */
> > +/*
> > + * Copyright © 2023 Intel Corporation */
> > +
> > +#ifndef __INTEL_HDCP_GSC_H__
> > +#define __INTEL_HDCP_GSC_H__
> > +
> > +#include <linux/err.h>
> > +#include <linux/types.h>
> > +
> > +struct drm_i915_private;
> > +
> > +struct intel_hdcp_gsc_message {
> > + struct drm_i915_gem_object *obj;
> > + struct i915_vma *vma;
> > + void *hdcp_cmd;
> > +};
> > +
> > +int intel_hdcp_gsc_initialize_message(struct drm_i915_private *i915,
> > + struct intel_hdcp_gsc_message
> > +*hdcp_message); void intel_hdcp_gsc_free_message(struct
> > +drm_i915_private *i915); ssize_t intel_hdcp_gsc_msg_send(struct
> > +drm_i915_private *i915, u8 *msg_in,
> > + size_t msg_in_len, u8 *msg_out,
> > + size_t msg_out_len);
> > +
> > +#endif /* __INTEL_HDCP_GCS_H__ */
> > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > index be2424af521d..ea0da06e2f39 100644
> > --- a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > +++ b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > @@ -92,3 +92,18 @@ int intel_gsc_uc_heci_cmd_submit_packet(struct
> > intel_gsc_uc *gsc, u64 addr_in,
> >
> > return err;
> > }
> > +
> > +void intel_gsc_uc_heci_cmd_emit_mtl_header(struct
> > +intel_gsc_mtl_header *header,
> > + u8 heci_client_id, u32
> > +message_size,
> > + u64 host_session_id) {
> > + host_session_id &= ~HOST_SESSION_MASK;
> > + if (heci_client_id == HECI_MEADDRESS_PXP)
> > + host_session_id |= HOST_SESSION_PXP_SINGLE;
> > +
> > + header->validity_marker = GSC_HECI_VALIDITY_MARKER;
> > + header->heci_client_id = heci_client_id;
> > + header->host_session_handle = host_session_id;
> > + header->header_version = MTL_GSC_HEADER_VERSION;
> > + header->message_size = message_size; }
> > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > index cf610dfca7a5..3d56ae501991 100644
> > --- a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > +++ b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > @@ -22,7 +22,17 @@ struct intel_gsc_mtl_header {
> > u16 header_version;
> > #define MTL_GSC_HEADER_VERSION 1
> >
> > + /*
> > + * FW allows host to decide host_session handle
> > + * as it sees fit.
> > + * For intertracebility reserving select bits(60-63)
> > + * to differentiate caller-target subsystem
> > + * 0000 - HDCP
> > + * 0001 - PXP Single Session
> > + */
> > u64 host_session_handle;
> > +#define HOST_SESSION_MASK REG_GENMASK64(63, 60) #define
> > +HOST_SESSION_PXP_SINGLE BIT_ULL(60)
> > u64 gsc_message_handle;
> >
> > u32 message_size; /* lower 20 bits only, upper 12 are reserved
> > */ @@ -33,8 +43,11 @@ struct intel_gsc_mtl_header {
> > * Bit 1: Session Cleanup;
> > * Bits 2-15: Flags
> > * Bits 16-31: Extension Size
> > + * According to internal spec flags are either input or output
> > + * we distinguish the flags using OUTFLAG or INFLAG
> > */
> > u32 flags;
> > +#define GSC_OUTFLAG_MSG_PENDING 1
> >
> > u32 status;
> > } __packed;
> > @@ -42,4 +55,7 @@ struct intel_gsc_mtl_header {
> > int intel_gsc_uc_heci_cmd_submit_packet(struct intel_gsc_uc *gsc,
> > u64 addr_in, u32 size_in,
> > u64 addr_out, u32 size_out);
> > +void intel_gsc_uc_heci_cmd_emit_mtl_header(struct
> > +intel_gsc_mtl_header *header,
> > + u8 heci_client_id, u32
> > +message_size,
> > + u64 host_session_id);
> > #endif
> > --
> > 2.25.1
> >
More information about the Intel-gfx
mailing list