[PATCH v2 1/1] drm/virtio: Implement device_attach

Zhang, Julia Julia.Zhang at amd.com
Fri Dec 6 06:43:54 UTC 2024


Hi Christian,

Thanks for your reply.
I get your point.  Actually I have a design to send a command from guest with pci notations of amdgpu and virtio-gpu to host side to calculate physical p2pdma_distance. I will send out those patches soon.
Best regards,
Julia


On 2024/12/4 17:32, Christian König wrote:
Hi Julia,

sorry I totally missed your mail.

The basic problem for P2P is what I already described in my previous mail:


Well the problem is the virtualized environment. pci_p2pdma_distance() checks if two physical PCI devices can communicate with each other (and returns how many hops are in between).

But inside a VM you don't see the physical devices, you can only see passed through devices plus your virtual device and a bunch of virtual bridges.

So what pci_p2pdma_distance() returns inside the VM is actually completely meaningless. It can be that P2P works, but it can also be P2P doesn't work because on the physical system you have a bridge, root complex or whatever which is blacklisted and won't work for some reason.

So the basic problem is that you can't figure out inside the VM if P2P is possible or not.

As long as you don't fix this it's irrelevant if you have get_sg_table implemented or not, you first need to figure out the basic and not try to implement some detail.

Regards,
Christian.
Am 04.12.24 um 04:46 schrieb Zhang, Julia:

[AMD Official Use Only - AMD Internal Distribution Only]

Hi Sima, Christian,

I would like to rediscuss about p2p in guest VM, can you please take a look. Thanks.

Best regards,
Julia

From: Zhang, Julia <Julia.Zhang at amd.com><mailto:Julia.Zhang at amd.com>
Sent: Friday, November 29, 2024 3:52 PM
To: Koenig, Christian <Christian.Koenig at amd.com><mailto:Christian.Koenig at amd.com>; Zhang, Julia <Julia.Zhang at amd.com><mailto:Julia.Zhang at amd.com>; Gurchetan Singh <gurchetansingh at chromium.org><mailto:gurchetansingh at chromium.org>; Chia-I Wu <olvaffe at gmail.com><mailto:olvaffe at gmail.com>; David Airlie <airlied at redhat.com><mailto:airlied at redhat.com>; Gerd Hoffmann <kraxel at redhat.com><mailto:kraxel at redhat.com>; linux-kernel at vger.kernel.org<mailto:linux-kernel at vger.kernel.org>; dri-devel at lists.freedesktop.org<mailto:dri-devel at lists.freedesktop.org>; amd-gfx at lists.freedesktop.org<mailto:amd-gfx at lists.freedesktop.org>; virtualization at lists.linux-foundation.org<mailto:virtualization at lists.linux-foundation.org>; Deucher, Alexander <Alexander.Deucher at amd.com><mailto:Alexander.Deucher at amd.com>; David Airlie <airlied at gmail.com><mailto:airlied at gmail.com>; Erik Faye-Lund <kusmabite at gmail.com><mailto:kusmabite at gmail.com>; Olsak, Marek <Marek.Olsak at amd.com><mailto:Marek.Olsak at amd.com>; Pelloux-Prayer, Pierre-Eric <Pierre-eric.Pelloux-prayer at amd.com><mailto:Pierre-eric.Pelloux-prayer at amd.com>; Huang, Honglei1 <Honglei1.Huang at amd.com><mailto:Honglei1.Huang at amd.com>; Chen, Jiqian <Jiqian.Chen at amd.com><mailto:Jiqian.Chen at amd.com>; Huang, Ray <Ray.Huang at amd.com><mailto:Ray.Huang at amd.com>; David Stevens <stevensd at chromium.org><mailto:stevensd at chromium.org>
Cc: Huang, Ray <Ray.Huang at amd.com><mailto:Ray.Huang at amd.com>; Zhu, Lingshan <Lingshan.Zhu at amd.com><mailto:Lingshan.Zhu at amd.com>; robdclark at chromium.org<mailto:robdclark at chromium.org>
Subject: Re: [PATCH v2 1/1] drm/virtio: Implement device_attach


Hi all,

Sorry for my late reply. I don't know if you still remember this thread, let me give a quick summary:

  1.  We want to implement the dGPU prime feature in guest VM. But we encountered this issue: virtio-gpu doesn’t have ->get_sg_table implemented which is required by drm_gem_map_attach(). This is modified by: 207395da5a97 (“drm/prime: reject DMA-BUF attach when get_sg_table is missing”).
  2.  To fix this, I override the function virtgpu_gem_device_attach() to not call drm_gem_map_attach() for vram object so drm_gem_map_attach() will not return -ENOSYS for not having ->get_sg_table.
  3.  Then you think this is incorrect and drm_gem_map_attach() requires get_sg_table to be implemented is intentional. I should either implement ->attach or ->get_sg_table for virtio-gpu.
  4.  As discussed, I implemented ->attach for virtio-gpu, but you suggested that I should check peer2peer flag first.
  5.  Now I have the implementation to get p2p_distance and check the p2p flag already, but I found that Rob Clark merged a patch to fix above patch: 207395da5a97 (“drm/prime: reject DMA-BUF attach when get_sg_table is missing”)

     *   Rob’s patch: https://patchwork.freedesktop.org/patch/584318/

  1.  With Rob’s patch, ->get_sg_table isn’t required for virtio-gpu anymore and  it seems p2p flag also doesn’t need to be checked anymore.

So I want to rediscuss if we still need to do p2p checking now?

If so, I will send out my implementation soon.

Best regards,

Julia


On 2024/1/31 22:32, Christian König wrote:
Am 31.01.24 um 11:20 schrieb Zhang, Julia:

On 2024/1/30 22:23, Christian König wrote:

Am 30.01.24 um 12:16 schrieb Daniel Vetter:

On Tue, Jan 30, 2024 at 12:10:31PM +0100, Daniel Vetter wrote:
[SNIP]

Hi Sima, Christian,



Yeah, that is really just speculative. All importers need to set the peer2peer flag just in case.

I see, I will modify this.



What happens under the hood is that IOMMU redirects the "VRAM" memory access to whatever address the DMA-buf on the host is pointing to (system, VRAM, doorbell, IOMMU, whatever).



I'm also not 100% sure if all the cache snooping is done correctly in all cases, but for now it seems to work.

Frankly the more I look at the original patch that added vram export

support the more this just looks like a "pls revert, this is just too

broken".

The commit I mean is this one: ea5ea3d8a117 ("drm/virtio: support mapping

exported vram"). The commit message definitely needs to cite that one, and

also needs a cc: stable because not rejecting invalid imports is a pretty

big deal.

Yeah, I've pointed out that commit in an internal discussion as well. I was just not aware that it's that severely broken.



Yeah we have mentioned this patch before, but I don't totally understand why this is too broken. Without exporting vram objects, dGPU prime feature would not be realized.

Would you mind to explain more about it. Thanks!

One reason is that using sg tables without struct pages is actually a hack we came up with because we couldn't hope to clean up the sg table structure any time soon to not include struct page pointers.

Another reason is that using this with devices which don't expect a DMA address pointing into a virtual PCI BAR. So doing this without checking the peer2peer flag can most likely cause quite a bit of trouble.

Regards,
Christian.



Best regards,

Julia



Regards,

Christian.



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20241206/58eb1325/attachment-0001.htm>


More information about the amd-gfx mailing list