[Intel-gfx] [PATCH v8 5/6] drm/i915/mtl: Add function to send command to GSC CS

Mon Jan 30 07:12:24 UTC 2023


> -----Original Message-----
> From: Teres Alexis, Alan Previn <alan.previn.teres.alexis at intel.com>
> Sent: Friday, January 27, 2023 7:14 PM
> To: Kandpal, Suraj <suraj.kandpal at intel.com>; intel-
> gfx at lists.freedesktop.org
> Cc: Shankar, Uma <uma.shankar at intel.com>; Nautiyal, Ankit K
> <ankit.k.nautiyal at intel.com>; Ceraolo Spurio, Daniele
> <daniele.ceraolospurio at intel.com>; Gupta, Anshuman
> <anshuman.gupta at intel.com>
> Subject: Re: [PATCH v8 5/6] drm/i915/mtl: Add function to send command to
> GSC CS
> 
> On Tue, 2023-01-24 at 15:12 +0530, Kandpal, Suraj wrote:
> > Add function that takes care of sending command to gsc cs. We start of
> > with allocation of memory for our command intel_hdcp_gsc_message that
> > contains gsc cs memory header as directed in specs followed by the
> > actual payload hdcp message that we want to send.
> > Spec states that we need to poll pending bit of response header around
> > 20 times each try being 50ms apart hence adding that to current
> > gsc_msg_send function Also we use the same function to take care of
> > both sending and receiving hence no separate function to get the
> > response.
> >
> > --v4
> > -Create common function to fill in gsc_mtl_header [Alan] -define host
> > session bitmask [Alan]
> >
> > --v5
> > -use i915 directly instead of gt->i915 [Alan] -No need to make fields
> > NULL as we are already using kzalloc [Alan]
> >
> > --v8
> > -change mechanism to reuse the same memory for one hdcp session[Alan]
> > -fix header ordering -add comments to explain flags and host session
> > mask [Alan]
> >
> > Cc: Ankit Nautiyal <ankit.k.nautiyal at intel.com>
> > Cc: Daniele Ceraolo Spurio <daniele.ceraolospurio at intel.com>
> > Cc: Alan Pervin Teres <alan.previn.teres.alexis at intel.com>
> > Cc: Uma Shankar <uma.shankar at intel.com>
> > Cc: Anshuman Gupta <anshuman.gupta at intel.com>
> > Signed-off-by: Suraj Kandpal <suraj.kandpal at intel.com>
> > ---
> >  drivers/gpu/drm/i915/Makefile                 |   1 +
> >  .../gpu/drm/i915/display/intel_display_core.h |   5 +
> >  drivers/gpu/drm/i915/display/intel_hdcp.c     |  20 ++
> >  drivers/gpu/drm/i915/display/intel_hdcp_gsc.c | 185
> > ++++++++++++++++++
> >  drivers/gpu/drm/i915/display/intel_hdcp_gsc.h |  27 +++
> >  .../i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c |  15 ++
> >  .../i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h |  16 ++
> >  7 files changed, 269 insertions(+)
> >  create mode 100644 drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> >  create mode 100644 drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> >
> > diff --git a/drivers/gpu/drm/i915/Makefile
> > b/drivers/gpu/drm/i915/Makefile index 461d6b40656d..194aae92c354
> > 100644
> > --- a/drivers/gpu/drm/i915/Makefile
> > +++ b/drivers/gpu/drm/i915/Makefile
> > @@ -254,6 +254,7 @@ i915-y += \
> >         display/intel_frontbuffer.o \
> >         display/intel_global_state.o \
> >         display/intel_hdcp.o \
> > +       display/intel_hdcp_gsc.o \
> >         display/intel_hotplug.o \
> >         display/intel_hti.o \
> >         display/intel_lpe_audio.o \
> > diff --git a/drivers/gpu/drm/i915/display/intel_display_core.h
> > b/drivers/gpu/drm/i915/display/intel_display_core.h
> > index 132e9134ba05..7e9f3f87c767 100644
> > --- a/drivers/gpu/drm/i915/display/intel_display_core.h
> > +++ b/drivers/gpu/drm/i915/display/intel_display_core.h
> > @@ -372,6 +372,11 @@ struct intel_display {
> >                 struct i915_hdcp_master *master;
> >                 bool comp_added;
> >
> > +               /*HDCP message struct for allocation of memory which
> > +can be reused
> > +                * when sending message to gsc cs
> > +                * this is only populated post Meteorlake
> > +                */
> > +               struct intel_hdcp_gsc_message *hdcp_message;
> >                 /* Mutex to protect the above hdcp component related
> > values. */
> >                 struct mutex comp_mutex;
> alan: apologies for these additional self-narration drizzled with questions
> after an 8th rev - please bear with me:
> 
> 1. Firstly thanks for changing the codes from allocating 'struct
> intel_hdcp_gsc_message'
>    dynamically-as-we-need-send-a-msg it to an init-time allocation that we
> can continue
>    to use over life of a hdcp-enabled-connector (so we dont have to keep
> allocating and
>    freeing every few seconds).
> 2. With this new code i was lead to look at "intel_display->hdcp" usage.
> intel_display
>    is a global context i.e. (not per crtc) and the hdcp substruct in intel_display
> is
>    holding global structures too (with comp_mutex protecting master).
> However
>    'struct intel_hdcp_gsc_message' is used at a per-connector level (but only
> for active
>    connectors which cannot exist without an attached crtc).
> 3. That said, although we do use comp_mutex for all the connector level
> hdcp activity,
>    those activities sit behind a global lock and component interface. To me this
> looks
>    inefficient because we are forcing serialization of hdcp operations across all
> the
>    connectors with active hdcp operatoin. Putting an instance of
> hdcp_message per connector
>    will allow parallelism and reduce overall latencies. (perhaps attaching
> hdcp_message
>    to crtc works better as we might have more logical connectors than active -
> capped
>    strictly by no. of crtcs).
> 4. I am aware that in the past we wouldn't have gained anything by such
> paralellism since
>    the hdcp interactions would have to be serialized behind the hdcp
> component interface
>    anyway. Today, with the gsc engine, we could try to parallelize stuff a bit
> more?
>    (but not completely since there is only one gsc firmware and I can't recall if
> the GSC fw
>    can operate on hdcp requests in parallel and if they support different host-
> session-
>    handles per display pipe). As such, I am not sure if this ask can gain any
> value effort
>    (thinking about dealing with 2 3 or more crtcs all with HDCP enabled and
> doing longer
>    operations like re-establishing the hdcp link). Do you think its worth it?
>    But i am not sure about whether this will work with display team direction
> (i.e. keeping
>    the hdcp instructions as part of the component driver).
>    Thus, this would be your and display team's call, not mine.
> 5. Event if #4 is "lets not move hdcp_message to connector level", i believe it
> looks like

So I don’t see the any significant gain to parallelizing our approach to the amount of effort that would be needed
To put in to change the design at this point 

>    you can allocate and free the hdcp_message at connector level hdcp init
> and teardown without
>    checking if its been allocated before if multiple connectors are currently
> being enabled or
>    already active. Please correct me if i am mistaken.

Good point will try to get this done on my next revision

> 
> 
> >         } hdcp;
> > diff --git a/drivers/gpu/drm/i915/display/intel_hdcp.c
> > b/drivers/gpu/drm/i915/display/intel_hdcp.c
> > index 0d6aed1eb171..c578fcc34bfd 100644
> > --- a/drivers/gpu/drm/i915/display/intel_hdcp.c
> > +++ b/drivers/gpu/drm/i915/display/intel_hdcp.c
> > @@ -23,6 +23,7 @@
> >  #include "intel_display_power_well.h"
> >  #include "intel_display_types.h"
> >  #include "intel_hdcp.h"
> > +#include "intel_hdcp_gsc.h"
> >  #include "intel_hdcp_regs.h"
> >  #include "intel_pcode.h"
> >
> > @@ -1966,12 +1967,28 @@ static int _intel_hdcp2_enable(struct
> > intel_connector *connector)
> >  {
> >         struct drm_i915_private *i915 = to_i915(connector->base.dev);
> >         struct intel_hdcp *hdcp = &connector->hdcp;
> > +       struct intel_hdcp_gsc_message *hdcp_message;
> >         int ret;
> >
> >         drm_dbg_kms(&i915->drm, "[%s:%d] HDCP2.2 is being enabled.
> > Type: %d\n",
> >                     connector->base.name, connector->base.base.id,
> >                     hdcp->content_type);
> >
> > +       if (DISPLAY_VER(i915) >= 14) {
> > +               hdcp_message = kzalloc(sizeof(*hdcp_message),
> > +GFP_KERNEL);
> > +
> > +               if (!hdcp_message)
> > +                       return -ENOMEM;
> > +
> > +               i915->display.hdcp.hdcp_message = hdcp_message;
> alan: non-blocker: honestly, just reading this code looks a bit of an issue (we
> are overriding a global's member-ptr from inside a connector level function).
> Perhaps a comment that says something like : Note: any connector level hdcp
> function would have already taken display->hdcp->comp_mutex farther up
> the stack so we can never trample hdcp_message between connector level
> hdcp functions concurrently. this is related to the last question in my point #5
> above.
> 
This wont be an issue anymore as I am planning to move the allocation and deallocation to init
And teardown as suggested by you

> 
> > +               ret = intel_hdcp_gsc_initialize_message(i915,
> > +hdcp_message);
> > +
> > +               if (ret) {
> > +                       drm_err(&i915->drm, "Could not initialize
> > +hdcp_message\n");
> > +                       return ret;
> > +               }
> > +       }
> > +
> >         ret = hdcp2_authenticate_and_encrypt(connector);
> >         if (ret) {
> >                 drm_dbg_kms(&i915->drm, "HDCP2 Type%d  Enabling
> > Failed. (%d)\n", @@ -2018,6 +2035,9 @@ _intel_hdcp2_disable(struct
> > intel_connector *connector, bool hdcp2_link_recovery
> >         if (hdcp2_deauthenticate_port(connector) < 0)
> >                 drm_dbg_kms(&i915->drm, "Port deauth failed.\n");
> >
> > +       if (DISPLAY_VER(i915) >= 14)
> > +               intel_hdcp_gsc_free_message(i915);
> > +
> >         connector->hdcp.hdcp2_encrypted = false;
> >         dig_port->hdcp_auth_status = false;
> >         data->k = 0;
> > diff --git a/drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > new file mode 100644
> > index 000000000000..8da4faf9b10c
> > --- /dev/null
> > +++ b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.c
> > @@ -0,0 +1,185 @@
> > +// SPDX-License-Identifier: MIT
> > +/*
> > + * Copyright 2023, Intel Corporation.
> > + */
> > +
> > +#include "display/intel_hdcp_gsc.h"
> > +#include "gem/i915_gem_region.h"
> > +#include "gt/uc/intel_gsc_uc_heci_cmd_submit.h"
> > +#include "i915_drv.h"
> > +#include "i915_utils.h"
> > +
> > +/*This function helps allocate memory for the command that we will
> > +send to gsc cs */ int intel_hdcp_gsc_initialize_message(struct
> > +drm_i915_private *i915,
> > +                                     struct intel_hdcp_gsc_message
> > +*hdcp_message) {
> > +       struct intel_gt *gt = i915->media_gt;
> > +       struct drm_i915_gem_object *obj = NULL;
> > +       struct i915_vma *vma = NULL;
> > +       void *cmd;
> > +       int err;
> > +
> > +       /* allocate object of one page for HDCP command memory and
> > +store it */
> > +       obj = i915_gem_object_create_shmem(i915, PAGE_SIZE);
> > +
> > +       if (IS_ERR(obj)) {
> > +               drm_err(&i915->drm, "Failed to allocate HDCP streaming
> > +command!\n");
> > +               return PTR_ERR(obj);
> > +       }
> > +
> > +       cmd = i915_gem_object_pin_map_unlocked(obj,
> > +i915_coherent_map_type(i915, obj, true));
> > +       if (IS_ERR(cmd)) {
> > +               drm_err(&i915->drm, "Failed to map gsc message
> > +page!\n");
> > +               err = PTR_ERR(cmd);
> > +               goto out_unpin;
> > +       }
> > +
> > +       vma = i915_vma_instance(obj, &gt->ggtt->vm, NULL);
> > +       if (IS_ERR(vma)) {
> > +               err = PTR_ERR(vma);
> > +               goto out_unmap;
> > +       }
> > +
> > +       err = i915_vma_pin(vma, 0, 0, PIN_GLOBAL);
> > +       if (err)
> > +               goto out_unmap;
> > +
> > +       memset(cmd, 0, obj->base.size);
> > +
> > +       hdcp_message->obj = obj;
> see below comment about possible option to eliminate keeping obj
> > +       hdcp_message->hdcp_cmd = cmd;
> > +       hdcp_message->vma = vma;
> > +
> > +       return 0;
> > +
> > +out_unmap:
> > +       i915_gem_object_unpin_map(obj);
> > +out_unpin:
> > +       i915_gem_object_put(obj);
> > +       return err;
> > +}
> > +
> > +void intel_hdcp_gsc_free_message(struct drm_i915_private *i915) {
> > +       struct intel_hdcp_gsc_message *hdcp_message =
> > +
> > +i915->display.hdcp.hdcp_message;
> > +       struct drm_i915_gem_object *obj =
> > +fetch_and_zero(&hdcp_message->obj);
> > +
> > +       if (!obj)
> > +               return;
> > +
> > +       if (hdcp_message->vma)
> > +               i915_vma_unpin(fetch_and_zero(&hdcp_message->vma));
> > +
> > +       i915_gem_object_unpin_map(obj);
> > +       i915_gem_object_put(obj);
> > +       kfree(hdcp_message);
> >
> above cleanup code might be significantly minimized / cleaned up by using
> just i915_vma_unpin_and_release(vma, I915_VMA_RELEASE_MAP) which
> will also unmap, unpit and release the object internally and with that you
> don't need to carry "obj" in hdcp_message anymore.

Ahh sorry for making you repeat this review missed this one while floating will
Look into this

Regards,
Suraj Kandpal
> 
> > +}
> > +
> > +static int intel_gsc_send_sync(struct drm_i915_private *i915,
> > +                              struct intel_gsc_mtl_header *header,
> > +u64 addr,
> > +                              size_t msg_out_len) {
> > +       struct intel_gt *gt = i915->media_gt;
> > +       int ret;
> > +
> > +       header->flags = 0;
> > +       ret = intel_gsc_uc_heci_cmd_submit_packet(&gt->uc.gsc, addr,
> > +
> > +header->message_size,
> > +                                                 addr,
> > +                                                 msg_out_len +
> > +sizeof(*header));
> > +       if (ret) {
> > +               drm_err(&i915->drm, "failed to send gsc HDCP msg
> > +(%d)\n", ret);
> > +               return ret;
> > +       }
> > +       /*
> > +        * Checking validity marker for memory sanity
> > +        */
> > +       if (header->validity_marker != GSC_HECI_VALIDITY_MARKER) {
> > +               drm_err(&i915->drm, "invalid validity marker\n");
> > +               return -EINVAL;
> > +       }
> > +
> > +       if (header->status != 0) {
> > +               drm_err(&i915->drm, "header status indicates error
> > +%d\n",
> > +                       header->status);
> > +               return -EINVAL;
> > +       }
> > +
> > +       if (header->flags & GSC_OUTFLAG_MSG_PENDING)
> > +               return -EAGAIN;
> > +
> > +       return 0;
> > +}
> > +
> > +/*
> > + * This function can now be used for sending requests and will also
> > +handle
> > + * receipt of reply messages hence no different function of message
> > +retrieval
> > + * is required. We will initialize intel_hdcp_gsc_message structure
> > +then add
> > + * gsc cs memory header as stated in specs after which the normal
> > +HDCP payload
> > + * will follow
> > + */
> > +ssize_t intel_hdcp_gsc_msg_send(struct drm_i915_private *i915, u8
> > +*msg_in,
> > +                               size_t msg_in_len, u8 *msg_out, size_t
> > +msg_out_len) {
> > +       struct intel_gt *gt = i915->media_gt;
> > +       struct intel_gsc_mtl_header *header;
> > +       const size_t max_msg_size = PAGE_SIZE - sizeof(*header);
> > +       struct intel_hdcp_gsc_message *hdcp_message;
> > +       u64 addr, host_session_id;
> > +       u32 reply_size, msg_size;
> > +       int ret, tries = 0;
> > +
> > +       if (!intel_uc_uses_gsc_uc(&gt->uc))
> > +               return -ENODEV;
> > +
> > +       if (msg_in_len > max_msg_size || msg_out_len > max_msg_size)
> > +               return -ENOSPC;
> > +
> > +       hdcp_message = i915->display.hdcp.hdcp_message;
> > +       header = hdcp_message->hdcp_cmd;
> > +       addr = i915_ggtt_offset(hdcp_message->vma);
> > +
> > +       msg_size = msg_in_len + sizeof(*header);
> > +       memset(header, 0, msg_size);
> > +       get_random_bytes(&host_session_id, sizeof(u64));
> > +       intel_gsc_uc_heci_cmd_emit_mtl_header(header,
> > +HECI_MEADDRESS_HDCP,
> > +                                             msg_size,
> > +host_session_id);
> > +       memcpy(hdcp_message->hdcp_cmd + sizeof(*header), msg_in,
> > +msg_in_len);
> > +
> > +       /*
> > +        * Keep sending request in case the pending bit is set no need
> > +to add
> > +        * message handle as we are using same address hence loc. of
> > +header is
> > +        * same and it will contain the message handle. we will send
> > +the message
> > +        * 20 times each message 50 ms apart
> > +        */
> > +       do {
> > +               ret = intel_gsc_send_sync(i915, header, addr,
> > +msg_out_len);
> > +
> > +               /* Only try again if gsc says so */
> > +               if (ret != -EAGAIN)
> > +                       break;
> > +
> > +               msleep(50);
> > +
> > +       } while (++tries < 20);
> > +
> > +       if (ret)
> > +               goto err;
> > +
> > +       /* we use the same mem for the reply, so header is in the same
> > +loc */
> > +       reply_size = header->message_size - sizeof(*header);
> > +       if (reply_size > msg_out_len) {
> > +               drm_warn(&i915->drm, "caller with insufficient HDCP
> > +reply size %u (%d)\n",
> > +                        reply_size, (u32)msg_out_len);
> > +               reply_size = msg_out_len;
> > +       } else if (reply_size != msg_out_len) {
> > +               drm_dbg_kms(&i915->drm, "caller unexpected HCDP reply
> > +size %u (%d)\n",
> > +                           reply_size, (u32)msg_out_len);
> > +       }
> > +
> > +       memcpy(msg_out, hdcp_message->hdcp_cmd + sizeof(*header),
> > +msg_out_len);
> > +
> > +err:
> > +       return ret;
> > +}
> > diff --git a/drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> > b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> > new file mode 100644
> > index 000000000000..8352b31a7e4a
> > --- /dev/null
> > +++ b/drivers/gpu/drm/i915/display/intel_hdcp_gsc.h
> > @@ -0,0 +1,27 @@
> > +/* SPDX-License-Identifier: MIT */
> > +/*
> > + * Copyright © 2023 Intel Corporation  */
> > +
> > +#ifndef __INTEL_HDCP_GSC_H__
> > +#define __INTEL_HDCP_GSC_H__
> > +
> > +#include <linux/err.h>
> > +#include <linux/types.h>
> > +
> > +struct drm_i915_private;
> > +
> > +struct intel_hdcp_gsc_message {
> > +       struct drm_i915_gem_object *obj;
> > +       struct i915_vma *vma;
> > +       void *hdcp_cmd;
> > +};
> > +
> > +int intel_hdcp_gsc_initialize_message(struct drm_i915_private *i915,
> > +                                     struct intel_hdcp_gsc_message
> > +*hdcp_message); void intel_hdcp_gsc_free_message(struct
> > +drm_i915_private *i915); ssize_t intel_hdcp_gsc_msg_send(struct
> > +drm_i915_private *i915, u8 *msg_in,
> > +                               size_t msg_in_len, u8 *msg_out,
> > +                               size_t msg_out_len);
> > +
> > +#endif /* __INTEL_HDCP_GCS_H__ */
> > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > index be2424af521d..ea0da06e2f39 100644
> > --- a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > +++ b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.c
> > @@ -92,3 +92,18 @@ int intel_gsc_uc_heci_cmd_submit_packet(struct
> > intel_gsc_uc *gsc, u64 addr_in,
> >
> >         return err;
> >  }
> > +
> > +void intel_gsc_uc_heci_cmd_emit_mtl_header(struct
> > +intel_gsc_mtl_header *header,
> > +                                          u8 heci_client_id, u32
> > +message_size,
> > +                                          u64 host_session_id) {
> > +       host_session_id &= ~HOST_SESSION_MASK;
> > +       if (heci_client_id == HECI_MEADDRESS_PXP)
> > +               host_session_id |= HOST_SESSION_PXP_SINGLE;
> > +
> > +       header->validity_marker = GSC_HECI_VALIDITY_MARKER;
> > +       header->heci_client_id = heci_client_id;
> > +       header->host_session_handle = host_session_id;
> > +       header->header_version = MTL_GSC_HEADER_VERSION;
> > +       header->message_size = message_size; }
> > diff --git a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > index cf610dfca7a5..3d56ae501991 100644
> > --- a/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > +++ b/drivers/gpu/drm/i915/gt/uc/intel_gsc_uc_heci_cmd_submit.h
> > @@ -22,7 +22,17 @@ struct intel_gsc_mtl_header {
> >         u16 header_version;
> >  #define MTL_GSC_HEADER_VERSION 1
> >
> > +       /*
> > +        * FW allows host to decide host_session handle
> > +        * as it sees fit.
> > +        * For intertracebility reserving select bits(60-63)
> > +        * to differentiate caller-target subsystem
> > +        * 0000 - HDCP
> > +        * 0001 - PXP Single Session
> > +        */
> >         u64 host_session_handle;
> > +#define HOST_SESSION_MASK      REG_GENMASK64(63, 60) #define
> > +HOST_SESSION_PXP_SINGLE BIT_ULL(60)
> >         u64 gsc_message_handle;
> >
> >         u32 message_size; /* lower 20 bits only, upper 12 are reserved
> > */ @@ -33,8 +43,11 @@ struct intel_gsc_mtl_header {
> >          * Bit 1: Session Cleanup;
> >          * Bits 2-15: Flags
> >          * Bits 16-31: Extension Size
> > +        * According to internal spec flags are either input or output
> > +        * we distinguish the flags using OUTFLAG or INFLAG
> >          */
> >         u32 flags;
> > +#define GSC_OUTFLAG_MSG_PENDING        1
> >
> >         u32 status;
> >  } __packed;
> > @@ -42,4 +55,7 @@ struct intel_gsc_mtl_header {
> >  int intel_gsc_uc_heci_cmd_submit_packet(struct intel_gsc_uc *gsc,
> >                                         u64 addr_in, u32 size_in,
> >                                         u64 addr_out, u32 size_out);
> > +void intel_gsc_uc_heci_cmd_emit_mtl_header(struct
> > +intel_gsc_mtl_header *header,
> > +                                          u8 heci_client_id, u32
> > +message_size,
> > +                                          u64 host_session_id);
> >  #endif
> > --
> > 2.25.1
> >