[Intel-gfx] [PATCH v6 12/24] vfio/pci: Allow passing zero-length fd array in VFIO_DEVICE_PCI_HOT_RESET
Alex Williamson
alex.williamson at redhat.com
Wed Mar 22 12:23:25 UTC 2023
On Wed, 22 Mar 2023 04:42:16 +0000
"Liu, Yi L" <yi.l.liu at intel.com> wrote:
> > From: Alex Williamson <alex.williamson at redhat.com>
> > Sent: Wednesday, March 22, 2023 6:48 AM
> >
> > On Tue, 21 Mar 2023 19:20:37 -0300
> > Jason Gunthorpe <jgg at nvidia.com> wrote:
> >
> > > On Tue, Mar 21, 2023 at 03:01:12PM -0600, Alex Williamson wrote:
> > >
> > > > > Though it would be nice if qemu didn't need two implementations so Yi
> > > > > I'd rather see a new info in this series as well and qemu can just
> > > > > consistently use dev_id and never bdf in iommufd mode.
> > > >
> > > > We also need to consider how libvirt determines if QEMU has the kernel
> > > > support it needs to pass file descriptors. It'd be a lot cleaner if
> > > > this aligned with the introduction of vfio cdevs.
> > >
> > > Yes, that would be much better if it was one package.
> > >
> > > But this is starting to grow and we have so many threads that need to
> > > progress blocked on this cdev enablement :(
> > >
> > > Could we go forward with the cdev main patches and kconfig it to
> > > experimental or something while the rest of the parts are completed
> > > and tested through qemu? ie move the vfio-pci reset enablment to after
> > > the cdev patches?
> >
> > We need to be able to guarantee that there cannot be any significant
> > builds of the kernel with vfio cdev support if our intention is to stage
> > it for libvirt. We don't have a global EXPERIMENTAL config option any
> > more. Adding new code under BROKEN seems wrong. Fedora ships with
> > STAGING enabled. A sternly worded Kconfig entry is toothless. What is
> > the proposed mechanism to make this not look like a big uncompiled code
> > dump? Thanks,
>
> Just out of curious, is the BDF mapping gap only for cdev or it also
> exists in the traditional group path?
The group path doesn't support passing file descriptors, getting access
to the device files requires a full container configuration, which
implies significant policy decisions in libvirt. Even if groups were
passed, QEMU would need to know the device name, ie. BDF in string
format, to get the device from the group.
> IMHO, if it is only a gap for cdev, maybe
> we can use CONFIG_VFIO_DEVICE_CDEV to stage it. This kconfig is N by
> default. I think it won't change until one day the whole ecosystem is
> updated.
See the "toothless" comment above, disabling vfio cdev support by
default because we don't have feature parity in reset support does not
provide any guarantees to libvirt that it can effectively take
advantage of passing cdev fds to QEMU. Thanks,
Alex
More information about the Intel-gfx
mailing list