[PATCH v8 00/14] drm/tegra: Introduce a modern UABI
Mikko Perttunen
cyndis at kapsi.fi
Wed Jul 14 15:26:37 UTC 2021
On 7/14/21 5:50 PM, Dmitry Osipenko wrote:
> 14.07.2021 11:30, Thierry Reding пишет:
>> On Sat, Jul 10, 2021 at 12:16:28AM +0300, Dmitry Osipenko wrote:
>>> Hello Thierry,
>>>
>>> 09.07.2021 22:31, Thierry Reding пишет:
>>>> From: Thierry Reding <treding at nvidia.com>
>>>>
>>>> Hi all,
>>>>
>>>> Mikko has been away for a few weeks, so I've been testing and revising
>>>> the new UABI patches in the meantime. There are very minor changes to
>>>> the naming of some of the UABI fields, but other than that it's mostly
>>>> unchanged from v7.
>>>
>>> Why you haven't addressed any of the previous review comments? There
>>> were some obvious problems in v7 and v8 still has them.
>>>
>>>> One notable change is that mappings can now be read-only, write-only,
>>>> read-write or none of them (rather than just read-only or read-write),
>>>> since those combinations are all supported by the IOMMUs and it might
>>>> be useful to make some mappings write-only.
>>>>
>>>> For a full list of changes in v8, see the changelog in patch 6.
>>>>
>>>> I've also updated the libdrm_tegra library to work against this version
>>>> of the UABI. A branch can be found here:
>>>>
>>>> https://gitlab.freedesktop.org/tagr/drm/-/commits/drm-tegra-uabi-v8
>>>>
>>>> That contains helper APIs for the concepts introduced in this series and
>>>> shows how they can be used in various tests that can be run for sanity
>>>> checking.
>>>>
>>>> In addition, Mikko has made updates to the following projects, though
>>>> they may need to be updated for the minor changes in v8:
>>>>
>>>> * vaapi-tegra-driver - https://github.com/cyndis/vaapi-tegra-driver
>>>> Experimental support for MPEG2 and H264 decoding on T210, T186
>>>> and T194.
>>>>
>>>> * xf86-video-opentegra - https://github.com/grate-driver/xf86-video-opentegra
>>>> X11 userspace acceleration driver for Tegra20, Tegra30, and Tegra114.
>>>>
>>>> * grate - https://github.com/grate-driver/grate
>>>> 3D rendering testbed for Tegra20, Tegra30, and Tegra114
>>>>
>>>> I plan on putting this into linux-next soon after v5.14-rc1 so that this
>>>> can get some soak time.
>>>
>>> It should be a bit too early to push it into kernel. The UAPI is not
>>> ready because it's missing essential features. We can't call this a
>>> 'modern UABI' until it's fully implemented. The design decisions are
>>> still questionable because this UAPI is built around the proprietary
>>> firmware (and based on UAPI of downstream driver) which doesn't fit well
>>> into DRM world. I haven't got all the answers to my previous questions,
>>> should I repeat them?
>>
>> I don't know what you means by "built around the proprietary firmware".
>> Yes, this ends up using proprietary firmware for some of the hardware
>> engines that host1x drives, but that's completely orthogonal to the
>> UABI. No matter what UABI we'd be introducing, we'd be using that same
>> firmware.
>>
>> And yes, this is based on the UABI of the downstream drivers. The design
>> is guided by what we've learned over the last decade working with this
>> hardware in use-cases that customers need. It'd be dumb not to use that
>> knowledge to our advantage. This is the only way to ensure we can
>> deliver an upstream driver that's on par with our downstream drivers and
>> therefore make it possible to eventually adopt the upstream driver.
>>
>> And frankly, you did get answers to previous questions, though perhaps
>> not all, but I'm out of patience. We've been going in circles and at
>> some point we have to make a decision so we can make progress.
>
> By firmware I was referring to the supervisor OS and inter-VM
> integration, sorry for not making it clear. My rough understanding is
> that it's all software defined and technically it's possible to avoid
> going though the trouble of supporting the firmware convention defined
> by downstream, and thus, making driver less optimal than it could be.
> It's still not clear to me how much that firmware is relevant to
> upstream in practice.
As mentioned in discussion elsewhere, there is no 'firmware convention'.
The view I've formed so far is that the model of ephemeral syncpoint
allocations and value resets, which I believe you are talking about
here, is fundamentally opposed to the design intent of the hardware, and
would result in a less efficient system regardless of inter-VM
integration convention.
>
>> I made several attempts over the years to get something usable merged
>> upstream so that we can finally make use of this hardware and get it
>> supported upstream and each time I made the mistake of trying to make it
>> perfect and accomodate all wishlist items. The result is that I wasted a
>> lot of time and have nothing to show for it.
>
> It's a problem that you try to do everything on your own and not
> collaborating as much as you could. Writing code isn't a problem, the
> problem is that there is no clear understanding of what needs to be
> done, IMO. I have a vision of whats need to be done from a perspective
> of older SoCs, but I never could start implementing it for upstream
> because it requires yours feedback and preliminary agreement since
> you're the only maintainer of the driver who could merge patches I don't
> want to waste my time too.
>
>> I've also been very hard Mikko with his work on this and I think we've
>> stretched this as far as we can without compromising too much on what we
>> are going to need from this UABI in the future.
>>
>> We've gone through the process of making sure all existing userspace can
>> and does work with this new UABI and even left the old UABI in place in
>> case we need it.
>
> That is great, but it is not enough. So far we are enabling a minimal
> support for newer hardware here, but we also need to solve the older
> problems that are relevant to all SoCs and may well affect the UABI
> decisions.
>
> It became apparent to me now that yours only goal here is to enable
> newer hardware, which is fine. It's also apparent to me that you still
> don't have a clear vision of what needs to be done overall, but this is
> fine too since nothing is removed here and it's in a staging phase yet.
While my goal of course is to enable proper use of Host1x on the newer
SoCs, there is absolutely no intention to forget about the older SoCs.
Observably, to me at least, GR2D and GR3D are working -- the test suites
are passing (though I did not port/try mesa). We are also not regressing
anything, and I do not think after this series we are worse (at least in
any fundamental manner) than the downstream software stack that these
chips were originally productized with. As such I have a hard time
understanding the doom and gloom about the driver's state and needing
grand overarching re-architectures.
>
>> I'm reasonably satisfied with what we have now and I don't see any
>> reason to hold this back any further. We always have the option of
>> adding UABI if we need it for something, or extend functionality of
>> existing UABI where it makes sense. But we also do have to start
>> somewhere, otherwise we're just not going to get anywhere, as the last
>> 10 years have shown.
>
> We're starting in a bit wrong direction by extending the old code base,
> adding unnecessary burden to userspace of older SoCs and adding sync
> point UAPI that may not integrate with the proper jobs scheduling. It
> probably shouldn't be a big problem to rework it all later on if will be
> needed, but we could avoid this extra work beforehand if we could put
> more effort into implementing more features from the start.
>
As mentioned before, in my opinion, redoing the internals while
supporting the legacy UAPI would be more difficult. Having the new UAPI
in place will also guide the improvements to be made - and at least on
new SoCs where hardware scheduling works fine we want to take advantage
of that in the UAPI. And I don't know of any issue in the UAPI that
would prevent use of a software scheduler (depending on implementation
some fields could be unused in some cases but it's already the case that
you don't always use every field).
>>> UAPI is not the only problem that we have. The performance and stability
>>> of the driver are in a very bad shape too. The modern UAPI can't be
>>> built on top of the old code. It's clear now that this is a very serious
>>> problem that must be addressed along with the UAPI work and I'm getting
>>> silence from you guys.
>>
>> We've been over this multiple times before, though perhaps never over
>> email. So let me make this clear here again and for future reference: we
>> will *not* be rewriting the driver from scratch.
>
> This was unnecessary, we agreed on the incremental updates long time
> ago. My point is that we shouldn't be building the new UAPI on top of
> the old code, instead we should prioritize the upgrading to a modern
> APIs and fixing the lame abstractions of the legacy driver.
>
> Mikko assures that this effort won't stop after merging this first bits
> and we will start upgrading driver soon. I trust Mikko. You guys want to
> light up the newer hardware so much that I don't know how to stop you :)
> Even if this will become a mistake later on, likely it won't be a
> critical mistake, so let's not hold it.
>
I have a long TODO list of improvements to work on. Admittedly, I won't
have as much time to work on it as I would have before since I need to
start working on other projects in parallel as well. And things will
need to be agreed on (e.g. as alluded to earlier I still don't know of
any concrete reason why we would need to add a software scheduler. I can
only make guesses. It probably makes sense for the old SoCs, but I don't
know why we now need one and downstream never did)
Mikko
More information about the dri-devel
mailing list