[Spice-devel] [PATCH v2 1/1] virtio-video: Add virtio video device specification
Keiichi Watanabe
keiichiw at chromium.org
Wed Jan 8 12:14:48 UTC 2020
Hi Dmitry,
On Wed, Jan 8, 2020 at 7:00 PM Dmitry Sepp <dmitry.sepp at opensynergy.com> wrote:
>
> Hi Keiichi,
>
> On Mittwoch, 8. Januar 2020 07:59:22 CET Keiichi Watanabe wrote:
> > Hi Dmitry,
> >
> > On Wed, Jan 8, 2020 at 1:50 AM Dmitry Sepp <dmitry.sepp at opensynergy.com>
> wrote:
> > > Hi Keiichi,
> > >
> > > thanks for the updates, please see my comments below.
> > >
> > > On Dienstag, 7. Januar 2020 14:24:31 CET Keiichi Watanabe wrote:
> > > > Hi Dmitry,
> > > >
> > > > On Mon, Jan 6, 2020 at 11:59 PM Dmitry Sepp
> > > > <dmitry.sepp at opensynergy.com>
> > >
> > > wrote:
> > > > > Hi,
> > > > >
> > > > > a couple of new comments:
> > > > >
> > > > > On Mittwoch, 18. Dezember 2019 14:02:14 CET Keiichi Watanabe wrote:
> > > > > > From: Dmitry Sepp <dmitry.sepp at opensynergy.com>
> > > > > >
> > > > > > The virtio video encoder device and decoder device provide
> > > > > > functionalities
> > > > > > to encode and decode video stream respectively.
> > > > > > Though video encoder and decoder are provided as different devices,
> > > > > > they
> > > > > > use a same protocol.
> > > > > >
> > > > > > Signed-off-by: Dmitry Sepp <dmitry.sepp at opensynergy.com>
> > > > > > Signed-off-by: Keiichi Watanabe <keiichiw at chromium.org>
> > > > > > ---
> > > > > >
> > > > > > content.tex | 1 +
> > > > > > virtio-video.tex | 579
> > > > > > +++++++++++++++++++++++++++++++++++++++++++++++
> > > > > > 2 files changed, 580 insertions(+)
> > > > > > create mode 100644 virtio-video.tex
> > > > > >
> > > > > > diff --git a/content.tex b/content.tex
> > > > > > index 556b373..9e56839 100644
> > > > > > --- a/content.tex
> > > > > > +++ b/content.tex
> > > > > > @@ -5743,6 +5743,7 @@ \subsubsection{Legacy Interface: Framing
> > > > > > Requirements}\label{sec:Device \input{virtio-vsock.tex}
> > > > > >
> > > > > > \input{virtio-fs.tex}
> > > > > > \input{virtio-rpmb.tex}
> > > > > >
> > > > > > +\input{virtio-video.tex}
> > > > > >
> > > > > > \chapter{Reserved Feature Bits}\label{sec:Reserved Feature Bits}
> > > > > >
> > > > > > diff --git a/virtio-video.tex b/virtio-video.tex
> > > > > > new file mode 100644
> > > > > > index 0000000..30e728d
> > > > > > --- /dev/null
> > > > > > +++ b/virtio-video.tex
> > > > > > @@ -0,0 +1,579 @@
> > > > > > +\section{Video Device}\label{sec:Device Types / Video Device}
> > > > > > +
> > > > > > +The virtio video encoder device and decoder device are virtual
> > > > > > devices
> > > > > > that +supports encoding and decoding respectively. Though the
> > > > > > encoder
> > > > > > and the decoder +are different devices, they use the same protocol.
> > > > > > +
> > > > > > +\subsection{Device ID}\label{sec:Device Types / Video Device /
> > > > > > Device
> > > > > > ID}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[30] encoder device
> > > > > > +\item[31] decoder device
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\subsection{Virtqueues}\label{sec:Device Types / Video Device /
> > > > > > Virtqueues} +
> > > > > > +\begin{description}
> > > > > > +\item[0] controlq - queue for sending control commands.
> > > > > > +\item[1] eventq - queue for sending events happened in the device.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\subsection{Feature bits}\label{sec:Device Types / Video Device /
> > > > > > Feature
> > > > > > bits} +
> > > > > > +\begin{description}
> > > > > > +\item[VIRTIO_VIDEO_F_RESOURCE_GUEST_PAGES (0)] Guest pages can be
> > > > > > used
> > > > > > for
> > > > > > video + buffers.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\devicenormative{\subsubsection}{Feature bits}{Device Types / Video
> > > > > > Device
> > > > > > / Feature bits} +
> > > > > > +The device MUST offer at least one of feature bits.
> > > > > > +
> > > > > > +\subsection{Device configuration layout}\label{sec:Device Types /
> > > > > > Video
> > > > > > Device / Device configuration layout} +
> > > > > > +Video device configuration uses the following layout structure:
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_config {
> > > > > > + le32 max_cap_len;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{max_cap_len}] defines the maximum length of a
> > > > > > descriptor
> > > > > > + required to call VIRTIO_VIDEO_GET_CAPABILITY in bytes. The device
> > > > > > + MUST set this value.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\subsection{Device Initialization}\label{sec:Device Types / Video
> > > > > > Device /
> > > > > > Device Initialization} +
> > > > > > +\devicenormative{\subsubsection}{Device Initialization}{Device
> > > > > > Types /
> > > > > > Video Device / Device Initialization} +
> > > > > > +The driver SHOULD query device capability by using the
> > > > > > +VIRTIO_VIDEO_T_GET_CAPABILITY and use that information for the
> > > > > > initial
> > > > > > +setup.
> > > > > > +
> > > > > > +\subsection{Device Operation}\label{sec:Device Types / Video Device
> > > > > > /
> > > > > > Device Operation} +
> > > > > > +The driver allocates input and output buffers and queues the
> > > > > > buffers
> > > > > > +to the device. The device performs operations on the buffers
> > > > > > according
> > > > > > +to the function in question.
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Create stream}
> > > > > > +
> > > > > > +To process buffers, the device needs to associate them with a
> > > > > > certain
> > > > > > +video stream (essentially, a context). Streams are created by
> > > > > > +VIRTIO_VIDEO_T_STREAM_CREATE with a default set of parameters
> > > > > > +determined by the device.
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Create buffers}
> > > > > > +
> > > > > > +Buffers are used to store the actual data as well as the relevant
> > > > > > +metadata. Scatter lists are supported, so the buffer doesn't need
> > > > > > to
> > > > > > +be contiguous in guest physical memory.
> > > > > > +
> > > > > > +\begin{itemize*}
> > > > > > +\item Use VIRTIO_VIDEO_T_RESOURCE_CREATE to create a virtio video
> > > > > > + resource that is backed by a buffer allocated from the driver's
> > > > > > + memory.
> > > > > > +\item Use VIRTIO_VIDEO_T_RESOURCE_DESTROY to destroy a resource
> > > > > > that
> > > > > > + is no longer needed.
> > > > > > +\end{itemize*}
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Stream parameter control}
> > > > > > +
> > > > > > +\begin{itemize*}
> > > > > > +\item Use VIRTIO_VIDEO_T_GET_PARAMS to get the current stream
> > > > > > parameters
> > > > > > for + input and output streams from the device.
> > > > > > +\item Use VIRTIO_VIDEO_T_SET_PARAMS to provide new stream
> > > > > > parameters to
> > > > > > the + device.
> > > > > > +\item After setting stream parameters, the driver may issue
> > > > > > + VIRTIO_VIDEO_T_GET_PARAMS as some parameters of both input and
> > > > > > output
> > > > > > can be + changed implicitly by the device during the set operation.
> > > > > > +\end{itemize*}
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Process buffers}
> > > > > > +
> > > > > > +\begin{itemize*}
> > > > > > +\item If the function and the buffer type require so, write data to
> > > > > > +the buffer memory.
> > > > > > +\item Use VIRTIO_VIDEO_T_RESOURCE_QUEUE to queue the buffer for
> > > > > > +processing in the device.
> > > > > > +\item The request completes asynchronously when the device has
> > > > > > +finished with the buffer.
> > > > > > +\end{itemize*}
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Buffer processing control}
> > > > > > +
> > > > > > +\begin{itemize*}
> > > > > > +\item Use VIRTIO_VIDEO_T_STREAM_DRAIN to ask the device to process
> > > > > > and
> > > > > > + return all of the already queued buffers.
> > > > > > +\item Use VIRTIO_VIDEO_T_QUEUE_CLEAR to ask the device to return
> > > > > > back
> > > > > > + already queued buffers from the input or the output queue. This
> > > > > > also
> > > > > > + includes input or output buffers that can be currently owned by
> > > > > > the
> > > > > > + device's processing pipeline.
> > > > > > +\end{itemize*}
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Asynchronous events}
> > > > > > +
> > > > > > +While processing buffers, the device can send asynchronous event
> > > > > > +notifications to the driver. The behaviour depends on the exact
> > > > > > +stream. For example, the decoder device sends a resolution change
> > > > > > +event when it encounters new resolution metadata in the stream.
> > > > > > +
> > > > > > +\subsubsection{Device Operation: Request header}
> > > > > > +
> > > > > > +All requests and responses on the control virt queue have a fixed
> > > > > > +header using the following layout structure and definitions:
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +enum virtio_video_ctrl_type {
> > > > > > + VIRTIO_VIDEO_CTRL_UNDEFINED = 0,
> > > > > > +
> > > > > > + /* request */
> > > > > > + VIRTIO_VIDEO_T_GET_CAPABILITY = 0x0100,
> > > > > > + VIRTIO_VIDEO_T_STREAM_CREATE,
> > > > > > + VIRTIO_VIDEO_T_STREAM_DESTROY,
> > > > > > + VIRTIO_VIDEO_T_STREAM_DRAIN,
> > > > > > + VIRTIO_VIDEO_T_RESOURCE_CREATE,
> > > > > > + VIRTIO_VIDEO_T_RESOURCE_DESTROY,
> > > > > > + VIRTIO_VIDEO_T_RESOURCE_QUEUE,
> > > > > > + VIRTIO_VIDEO_T_QUEUE_CLEAR,
> > > > > > + VIRTIO_VIDEO_T_SET_PARAMS,
> > > > > > + VIRTIO_VIDEO_T_GET_PARAMS,
> > > > > > +
> > > > > > + /* response */
> > > > > > + VIRTIO_VIDEO_S_OK = 0x0200,
> > > > > > + VIRTIO_VIDEO_S_OK_RESOURCE_QUEUE,
> > > > > > + VIRTIO_VIDEO_S_OK_GET_PARAMS,
> > > > > > +
> > > > > > + VIRTIO_VIDEO_S_ERR_UNSPEC = 0x0300,
> > > > > > + VIRTIO_VIDEO_S_ERR_OUT_OF_MEMORY,
> > > > > > + VIRTIO_VIDEO_S_ERR_INVALID_RESOURCE_ID,
> > > > > > + VIRTIO_VIDEO_S_ERR_INVALID_STREAM_ID,
> > > > > > + VIRTIO_VIDEO_S_ERR_INVALID_PARAMETER,
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_ctrl_hdr {
> > > > > > + le32 type;
> > > > > > + le32 stream_id;
> > > > > > + le32 len; /* Length of the structure in bytes. */
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{type}] is the type of the driver request or the device
> > > > > > +response.
> > > > > > +\item[\field{stream_id}] specifies a target stream.
> > > > > > +\item[\field{len}] is the length of data in bytes, which includes
> > > > > > +length of the header.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\subsubsection{Device Operation: controlq}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_GET_CAPABILITY] Retrieve information about
> > > > > > +supported formats.
> > > > > > +
> > > > > > +The driver uses \field{struct virtio_video_get_capability} to send
> > > > > > a
> > > > > > +query request.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +enum virtio_video_buf_type {
> > > > > > + VIRTIO_VIDEO_BUF_TYPE_INPUT,
> > > > > > + VIRTIO_VIDEO_BUF_TYPE_OUTPUT,
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_get_capability {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + enum virtio_video_buf_type buf_type;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +\begin{description}
> > > > > > +\item[\field{buf_type}] is the buffer type that the driver asks
> > > > > > +information about. The driver MUST set either
> > > > > > +\field{VIRTIO_VIDEO_BUF_TYPE_INPUT} or
> > > > > > \field{VIRTIO_VIDEO_BUF_TYPE_OUTPUT}. +\end{description}
> > > > > > +
> > > > > > +The device responds a capability by using \field{struct
> > > > > > +virtio_video_get_capability_resp}.
> > > > > > +\begin{lstlisting}
> > > > > > +enum virtio_video_format {
> > > > > > + VIRTIO_VIDEO_FORMAT_UNDEFINED = 0,
> > > > > > + /* Raw formats */
> > > > > > + VIRTIO_VIDEO_FORMAT_NV12 = 1,
> > > > > > + VIRTIO_VIDEO_FORMAT_YUV420,
> > > > > > + VIRTIO_VIDEO_FORMAT_YVU420,
> > > > > > +
> > > > > > + /* Compressed formats */
> > > > > > + VIRTIO_VIDEO_FORMAT_H264 = 0x1001,
> > > > > > + VIRTIO_VIDEO_FORMAT_VP8 = 0x1002,
> > > > > > + VIRTIO_VIDEO_FORMAT_VP9 = 0x1003,
> > > > > > +};
> > > > > > +
> > > > > > +enum virtio_video_profile {
> > > > > > + VIRTIO_VIDEO_PROFILE_UNDEFINED = 0,
> > > > > > +
> > > > > > + /* H.264 */
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_MIN = 0x100,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_BASELINE =
> > > > > > VIRTIO_VIDEO_PROFILE_H264_BASELINE, +
> > > > > > VIRTIO_VIDEO_PROFILE_H264_MAIN,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_EXTENDED,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_HIGH,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_HIGH10PROFILE,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_HIGH422PROFILE,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_HIGH444PREDICTIVEPROFILE,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_SCALABLEBASELINE,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_SCALABLEHIGH,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_STEREOHIGH,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_MULTIVIEWHIGH,
> > > > > > + VIRTIO_VIDEO_PROFILE_H264_MAX =
> > > > > > VIRTIO_VIDEO_PROFILE_H264_MULTIVIEWHIGH, +
> > > > > > + /* VP8 */
> > > > > > + VIRTIO_VIDEO_PROFILE_VP8_MIN = 0x200,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP8_ANY =
> > > > > > VIRTIO_VIDEO_PROFILE_VP8_MIN,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP8_MAX =
> > > > > > VIRTIO_VIDEO_PROFILE_VP8_ANY,
> > > > > > +
> > > > > > + /* VP9 */
> > > > > > + VIRTIO_VIDEO_PROFILE_VP9_MIN = 0x300,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP9_PROFILE0 =
> > > > > > VIRTIO_VIDEO_PROFILE_VP9_MIN,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP9_PROFILE1,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP9_PROFILE2,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP9_PROFILE3,
> > > > > > + VIRTIO_VIDEO_PROFILE_VP9_MAX =
> > > > > > VIRTIO_VIDEO_PROFILE_VP9_PROFILE3,
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_format_range {
> > > > > > + le32 min;
> > > > > > + le32 max;
> > > > > > + le32 step;
> > > > > > + u8 paddings[4];
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_format_desc {
> > > > > > + le32 format; /* One of VIRTIO_VIDEO_FORMAT_* types */
> > > > > > + le32 profile; /* One of VIRTIO_VIDEO_PROFILE_* types */
> > > > > > + le64 mask;
> > > > > > + struct virtio_video_format_range width;
> > > > > > + struct virtio_video_format_range height;
> > > > > > + le32 num_rates;
> > > > > > + u8 padding[4];
> > > > > > + /* Followed by struct virtio_video_frame_rate frame_rates[]
> > > > > > */
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_get_capability_resp {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 num_descs;
> > > > > > + /* Followed by struct virtio_video_format_desc desc[] */
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +The format description \field{struct virtio_video_format_desc}
> > > > > > +includes the following fields:
> > > > > > +\begin{description}
> > > > > > +\item[\field{format}] specifies an image format. The device MUST
> > > > > > set
> > > > > > one
> > > > > > + of \field{enum virtio_video_format}.
> > > > > > +\item[\field{profile}] specifies a profile of the compressed image
> > > > > > format
> > > > > > + specified in \field{format}. The driver SHOULD ignore this value
> > > > > > if
> > > > > > + \field{format} is a raw format.
> > > > >
> > > > > So how should this be used? The spec does not define any way to set
> > > > > profile for the device. It is very important for encoder.
> > > >
> > > > Thank you for pointing this.
> > > > These points are overlooked, as I didn't care about encoder usage
> > > > enough.
> > > >
> > > > After thinking it again, I think it's not a very good idea to include
> > > > supported profiles and levels in a struct for capability.
> > > > This is because these values are available only for limited number of
> > > > formats. Also, it's true that we need to have a way to set these values
> > > > as
> > > > Dmitry pointed.
> > >
> > > Yes, you are right. In fact, the approach of the v1 spec to keep controls
> > > separately was not correct.
> > >
> > > > Instead, it would make more sense to have additional three types of
> > > > controls for profiles, levels, and bitrates:
> > > > (1) QUERY_CONTROL: Query values supported by the device
> > > > (2) GET_CONTROL: Read a value that is set in the device
> > > > (3) SET_CONTROL: Set a value in the device
> > > >
> > > > These operations are similar to V4L2 controls.
> > > > (1), (2) and (3) would correspond VIDIOC_QUERY{CTRL,MENU}, S_CTRL, and
> > > > G_CTRL in V4L2, respectively.
> > > > Also, (3) would be similar to enum virtio_video_control_type in the
> > > > virtio-video v1 driver implementation in
> > > > https://markmail.org/message/dwghwdqsbl3gsjxu .
> > > >
> > > > For QUERY_CONTROL, my idea is like this:
> > > >
> > > > enum virtio_video_control_type {
> > > >
> > > > VIRTIO_VIDEO_CONTROL_UNDEFINED = 0,
> > > >
> > > > VIRTIO_VIDEO_CONTROL_BITRATE = 0x100,
> > > > VIRTIO_VIDEO_CONTROL_PROFILE,
> > > > VIRTIO_VIDEO_CONTROL_LEVEL,
> > > >
> > > > };
> > > >
> > > > struct virtio_video_query_control {
> > > >
> > > > struct virtio_video_ctrl_hdr hdr;
> > > > le32 control; /* One of VIRTIO_VIDEO_CONTROL_* types */
> > > > le32 length;
> > > > /* Followed by additional data.
> > > >
> > > > * If |control| is VIRTIO_VIDEO_CONTROL_PROFILE,
> > > > * the device must pass a codec format like H264 or VP9.
> > > > * The requred data must be defined in the specification.
> > > > */
> > > >
> > > > };
> > > >
> > > > struct virtio_video_query_control_resp {
> > > >
> > > > struct virtio_video_ctrl_hdr hdr;
> > > > le32 length;
> > > > u8 padding[4];
> > > > /* Followed by data corresponds to the specified control.
> > > >
> > > > * The type of data must be defined in the spec.
> > > > * For example, if the driver queries profiles, this part should be
> > > > * an array of supported profiles of a given format.
> > > > */
> > > >
> > > > };
> > > >
> > > > WDYT?
> > >
> > > I think virtio_video_control_type should make sense. But I would disagree
> > > with the need to have new QUERY_CONTROL and GET_CONTROL.
> > >
> > > I assume the set of supported controls is fixed for some particular format
> > > on a given IP. So we'd propose to include controls into format
> > > descriptors, so we don't need to QUERY_CONTROL. This way (with
> > > 'virtio_video_format_list') we can define not-contiguos ranges, e.g. for
> > > profiles.
> > >
> > > struct virtio_video_format_frame {
> > >
> > > /* As proposed in Keiichi's prev email */
> > >
> > > };
> > >
> > > virtio_video_format_list {
> > >
> > > le32 num_entries;
> > > u8 padding[4];
> > > /* Followed by le64 entries[] */
> > >
> > > };
> > >
> > > struct virtio_video_format_control {
> > >
> > > le32 type;
> > > u8 padding[4];
> > > struct virtio_video_format_list values;
> > >
> > > };
> > >
> > > struct virtio_video_format_desc {
> > >
> > > le64 mask;
> > > le32 format; /* One of VIRTIO_VIDEO_FORMAT_* types */
> > > le32 planes_layout; /* See the thread [v2 0/1] */
> > > le32 num_frames;
> > > le32 num_controls;
> > > /* Followed by struct virtio_video_format_frame frames[] */
> > > /* Followed by struct virtio_video_format_control controls[] */
> > >
> > > };
> >
> > In my understanding, a set of supported levels depends on a profile.
> > H.264 spec says that levels are specified within each profile.
> > cf. "0.5 Profiles and levels" in
> > https://www.itu.int/rec/T-REC-H.264-201906-I/en (V4L2's QUERYMENU doesn't
> > seem to provide a way to query levels for each profile properly, though)
> >
> > So, if we want to have supported profiles and levels in format_desc as
> > your idea, each supported profile should have a list of supported
> > levels.
> > I suppose it makes the structure of video_format_desc too complicated.
> > I'd like to avoid this complexity caused by some specific formats.
> >
> > Instead, I'd like to keep virtio_video_format_desc minimal and have
> > QUERY_CONTROL to query format-specific values like profiles and
> > levels.
> > This design would make it easy to extend controls when we want to
> > support new types of formats. We will just need to define new
> > VIRTIO_VIDEO_CONTROL_*.
> > What do you think?
>
> Yes, I think we can keep it this way, this indeed provides more flexibility and
> this way we won't need to modify the 'format' parsing logic when more controls
> are added.
Thanks! I'm preparing the next version of the patch.
I have one concern about terminology.
We have two different things called "controls" now:
1) Messages passed via controlq. i.e. "ctrl" in 'enum
virtio_video_ctrl_type' and 'virtio_video_ctrl_hdr'.
2) Commands for profiles, levels, and bitrates similar to V4L2
controls. e.g.., "CONTROL" in VIRTIO_VIDEO_CONTROL_*
I feel calling both "controls" is somewhat confusing and I am
wondering if we can't rename either of them.
For example, how about renaming 2) to "command"? Then, we'll have
{QUERY ,GET, SET}_COMMAND.
It may sound strange, though.
Do you have any idea?
Best regards,
Keiichi
>
> Best regards,
> Dmitry.
>
> >
> > Best regards,
> > Keiichi
> >
> > > SET_CONTROL seems to be mandatory. If it succeeds, we can store the
> > > current
> > > value locally, so there is no need to have GET_CONTROL.
> > >
> > > The only exception is the initial (default) control value in the device.
> > > But with the removal of 'function' and with addition of 'caps' instead,
> > > the way to provide defaults is gone. So I suppose for formats we'll be
> > > just using GET on driver start to get the 'defaults'. But the thing is
> > > that for formats there are other uses for GET, but for controls GET
> > > apparently does not make to much sense at runtime.
> > >
> > > > > Also, shouldn't the profile come together with level? Would make sense
> > > > > for
> > > > > encoders.
> > > >
> > > > Yeah. So, in the above idea of QUERY_CONTROL, profile should be
> > > > required when querying supported levels.
> > >
> > > So probably should be enumerated together, not queried, as per the comment
> > > above.
> > >
> > > > > > +\item[\field{mask}] is a bitset that represents the supported
> > > > > > + combination of input and output format. If \textit{i}-th bit is
> > > > > > set
> > > > > > + in \field{mask} of \textit{j}-th \field{struct
> > > > > > + virtio_video_format_desc} for input, the device supports encoding
> > > > > > or
> > > > > > + decoding from the \textit{j}-th input format to \textit{i}-th
> > > > > > output
> > > > > > + format.
> > > > > > +\item[\field{width, height}] represents a range of resolutions
> > > > > > + supported by the device. If its \field{step} is not applicable,
> > > > > > its
> > > > > > + \field{min} is equal to its \field{max}.
> > > > > > +\item[\field{num_rates}] is the length of an array
> > > > > > \field{frame_rates}.
> > > > > > In
> > > > > > case of decoder, the driver SHOULD ignore this value.
> > > > > > +\item[\field{frame_rates}] is an array of supported frame rates.
> > > > > > +\end{description}
> > > > > > +
> > > > >
> > > > > I'd guess frame rates depend on the resolution as well. This
> > > > > dependency
> > > > > was
> > > > > clear in the v1 spec, but in the v2 there is no dependency anymore. I
> > > > > think we need to update this.
> > > >
> > > > That's a good point. I missed that dependency when updating the
> > > > structures.
> > > > So, let me update the structs like the following:
> > > >
> > > > struct virtio_video_format_frame {
> > > >
> > > > struct virtio_video_format_range width;
> > > > struct virtio_video_format_range height;
> > > > le32 num_rates;
> > > > u8 padding[4];
> > > > /* Followed by struct virtio_video_format_range frame_rates[] */
> > > >
> > > > };
> > > >
> > > > struct virtio_video_format_desc {
> > > >
> > > > le64 mask;
> > > > le32 format; /* One of VIRTIO_VIDEO_FORMAT_* types */
> > > > le32 planes_layout; /* See the thread [v2 0/1] */
> > > > le32 num_frames;
> > > > u8 padding[4];
> > > > /* Followed by struct virtio_video_format_frame frames[] */
> > > >
> > > > };
> > >
> > > Yes, I do agree with this approach.
> > >
> > > Best regards,
> > > Dmitry.
> > >
> > > > Best regards,
> > > > Keiichi.
> > > >
> > > > > Best regards,
> > > > > Dmitry.
> > > > >
> > > > > > +\item[VIRTIO_VIDEO_T_STREAM_CREATE] create a video stream (context)
> > > > > > + within the device.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +enum virtio_video_mem_type {
> > > > > > + VIRTIO_VIDEO_MEM_TYPE_GUEST_PAGES,
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_stream_create {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 in_mem_type; /* One of VIRTIO_VIDEO_MEM_TYPE_* types
> > > > > > */
> > > > > > + le32 out_mem_type; /* One of VIRTIO_VIDEO_MEM_TYPE_* types
> > > > > > */
> > > > > > + char debug_name[64];
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{in_mem_type}] is a type of buffer management for input
> > > > > > +buffers. The driver MUST set a value in \field{enum
> > > > > > +virtio_video_mem_type}.
> > > > > > +\item[\field{out_mem_type}] is a type of buffer management for
> > > > > > output
> > > > > > +buffers. The driver MUST set a value in \field{enum
> > > > > > +virtio_video_mem_type}.
> > > > > > +\item[\field{debug_name}] is a text string for a debug purpose.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_STREAM_DESTROY] destroy a video stream
> > > > > > (context)
> > > > > > + within the device.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_stream_destroy {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_STREAM_DRAIN] ask the device to push all the
> > > > > > + queued buffers through the pipeline.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_stream_drain {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_RESOURCE_CREATE] create a resource descriptor
> > > > > > + within the device.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_mem_entry {
> > > > > > + le64 addr;
> > > > > > + le32 length;
> > > > > > + u8 padding[4];
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_resource_create {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 resource_id;
> > > > > > + le32 nr_entries;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{resource_id}] internal id of the resource.
> > > > > > +\item[\field{nr_entries}] number of \field{struct
> > > > > > + virtio_video_mem_entry} memory entries.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_RESOURCE_DESTROY] destroy a resource
> > > > > > descriptor
> > > > > > + within the device.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_resource_destroy {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 resource_id;
> > > > > > + u8 padding[4];
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{resource_id}] internal id of the resource.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_RESOURCE_QUEUE] Add a buffer to the device's
> > > > > > +queue.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +#define VIRTIO_VIDEO_MAX_PLANES 8
> > > > > > +
> > > > > > +struct virtio_video_resource_queue {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 buf_type;
> > > > > > + le32 resource_id;
> > > > > > + le64 timestamp;
> > > > > > + le32 nr_data_size;
> > > > > > + le32 data_size[VIRTIO_VIDEO_MAX_PLANES];
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{buf_type}] buf_type of the .
> > > > > > +\item[\field{resource_id}] internal id of the resource.
> > > > > > +\item[\field{timestamp}] an abstract sequence counter that can be
> > > > > > used
> > > > > > + for synchronisation.
> > > > > > +\item[\field{nr_data_size}] number of \field{data_size} entries.
> > > > > > +\item[\field{data_size}] number of data bytes within a plane.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +enum virtio_video_buffer_flag {
> > > > > > + VIRTIO_VIDEO_BUFFER_F_ERR = 0x0001,
> > > > > > + VIRTIO_VIDEO_BUFFER_F_EOS = 0x0002,
> > > > > > + /* Encoder only */
> > > > > > + VIRTIO_VIDEO_BUFFER_IFRAME = 0x0004,
> > > > > > + VIRTIO_VIDEO_BUFFER_PFRAME = 0x0008,
> > > > > > + VIRTIO_VIDEO_BUFFER_BFRAME = 0x0010,
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_resource_queue_resp {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le64 timestamp;
> > > > > > + le32 flags; /* One of VIRTIO_VIDEO_BUFFER_* flags */
> > > > > > + le32 size; /* Encoded size */
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{timestamp}] an abstract sequence counter that can be
> > > > > > used
> > > > > > + for synchronisation.
> > > > > > +\item[\field{flags}] mark specific buffers in the sequence.
> > > > > > +\item[\field{size}] data size in the buffer (encoder only).
> > > > > > +\end{description}
> > > > > > +
> > > > > > +The device sends a response to the queue request asynchronously
> > > > > > when
> > > > > > +it has finished processing the buffer.
> > > > > > +
> > > > > > +The device SHOULD mark a buffer that triggered a processing error
> > > > > > with
> > > > > > +the VIRTIO_VIDEO_BUFFER_F_ERR flag.
> > > > > > +
> > > > > > +The device MUST mark the last buffer with the
> > > > > > +VIRTIO_VIDEO_BUFFER_F_EOS flag to denote completion of the drain
> > > > > > +sequence.
> > > > > > +
> > > > > > +In case of encoder, to denote a particular frame type the devie
> > > > > > MUST
> > > > > > +mark the respective buffer with VIRTIO_VIDEO_BUFFER_IFRAME,
> > > > > > +VIRTIO_VIDEO_BUFFER_PFRAME, VIRTIO_VIDEO_BUFFER_BFRAME.
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_RESOURCE_QUEUE_CLEAR] Return already queued
> > > > > > + buffers back from the input or the output queue of the device.
> > > > > > The
> > > > > > + device SHOULD return all of the buffers from the respective queue
> > > > > > as
> > > > > > + soon as possible without pushing the buffers through the
> > > > > > processing
> > > > > > + pipeline.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_queue_clear {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 buf_type;
> > > > > > + u8 padding[4];
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{buf_type}] buffer type.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_GET_PARAMS] Get parameters of the input or the
> > > > > > + output of a stream.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_plane_format {
> > > > > > + le32 plane_size;
> > > > > > + le32 stride;
> > > > > > + u8 padding[4];
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{plane_size}] size of the plane in bytes.
> > > > > > +\item[\field{stride}] stride used for the plane in bytes.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_params {
> > > > > > + le32 buf_type; /* One of VIRTIO_VIDEO_BUF_TYPE_* types */
> > > > > > + le32 fourcc; /* One of VIRTIO_VIDEO_FOURCC_* types */
> > > > > > + le32 frame_width;
> > > > > > + le32 frame_height;
> > > > > > + le32 min_buffers;
> > > > > > + le32 max_buffers;
> > > > > > + le32 frame_rate;
> > > > > > + struct virtio_video_crop {
> > > > > > + le32 left;
> > > > > > + le32 top;
> > > > > > + le32 width;
> > > > > > + le32 height;
> > > > > > + } crop;
> > > > > > + le32 num_planes;
> > > > > > + struct virtio_video_plane_format
> > > > > > plane_formats[VIRTIO_VIDEO_MAX_PLANES]; +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{frame_width}] the value to get/set.
> > > > > > +\item[\field{frame_height}] the value to get/set.
> > > > > > +\item[\field{pixel_format}] the value to get/set.
> > > > > > +\item[\field{min_buffers}] minimum buffers required to handle the
> > > > > > + format (r/o).
> > > > > > +\item[\field{max_buffers}] maximum buffers required to handle the
> > > > > > + format (r/o).
> > > > > > +\item[\field{frame_rate}] the value to get/set.
> > > > > > +\item[\field{crop}] cropping (composing) rectangle.
> > > > > > +\item[\field{num_planes}] number of planes used to store pixel data
> > > > > > +(r/o).
> > > > > > +\item[\field{plane_formats}] description of each plane.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_get_params {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + le32 buf_type; /* One of VIRTIO_VIDEO_BUF_TYPE_* types */
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_get_params_resp {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + struct virtio_video_params params;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{buf_type}] buffer type.
> > > > > > +\item[\field{params}] parameter values.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\item[VIRTIO_VIDEO_T_SET_PARAMS] Change parameters of a stream.
> > > > > > +
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +struct virtio_video_set_params {
> > > > > > + struct virtio_video_ctrl_hdr hdr;
> > > > > > + struct virtio_video_params params;
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{params}] parameters to set.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +Setting stream parameters might have side effects within the
> > > > > > device.
> > > > > > +For example, the device MAY perform alignment of width and height,
> > > > > > +change the number of planes it uses for the format, or do whatever
> > > > > > +changes that are required to continue normal operation using the
> > > > > > +updated parameters. It is up to the driver to check the parameter
> > > > > > set
> > > > > > +after the VIRTIO_VIDEO_T_SET_PARAMS request has been issued.
> > > > > > +
> > > > > > +\end{description}
> > > > > > +
> > > > > > +\subsubsection{Device Operation: eventq}
> > > > > > +
> > > > > > +The device can report events on the event queue. The driver
> > > > > > initially
> > > > > > +populates the queue with device-writeable buffers. When the device
> > > > > > +needs to report an event, it fills a buffer and notifies the
> > > > > > driver.
> > > > > > +The driver consumes the report and adds a new buffer to the
> > > > > > virtqueue.
> > > > > > +
> > > > > > +\begin{lstlisting}
> > > > > > +enum virtio_video_event_type {
> > > > > > + VIRTIO_VIDEO_EVENT_T_UNDEFINED = 0,
> > > > > > + /* For all functions */
> > > > > > + VIRTIO_VIDEO_EVENT_T_ERROR_UNSPEC = 0x0100,
> > > > > > + /* For decoder only */
> > > > > > + VIRTIO_VIDEO_EVENT_T_DECODER_RESOLUTION_CHANGED = 0x0200,
> > > > > > +};
> > > > > > +
> > > > > > +struct virtio_video_event {
> > > > > > + le32 event_type; /* One of VIRTIO_VIDEO_EVENT_T_* types */
> > > > > > + le32 stream_id;
> > > > > > + u8 padding[4];
> > > > > > +};
> > > > > > +\end{lstlisting}
> > > > > > +
> > > > > > +\begin{description}
> > > > > > +\item[\field{event_type}] type of the triggered event .
> > > > > > +\item[\field{stream_id}] id of the source stream.
> > > > > > +\end{description}
> > > > > > +
> > > > > > +The device MUST send
> > > > > > VIRTIO_VIDEO_EVENT_T_DECODER_RESOLUTION_CHANGED
> > > > > > +whenever it encounters new resolution data in the stream. This
> > > > > > +includes the case of the initial device configuration after
> > > > > > metadata
> > > > > > +has been parsed and the case of dynamic resolution change.
>
>
More information about the Spice-devel
mailing list