[RFC] drm/i915: Allow dmabuf mmap forwarding
Tvrtko Ursulin
tvrtko.ursulin at linux.intel.com
Tue Dec 19 14:52:15 UTC 2023
On 13/12/2023 14:13, Christian König wrote:
> Am 13.12.23 um 12:46 schrieb Tvrtko Ursulin:
>>
>> Hi,
>>
>> On 12/12/2023 14:10, Christian König wrote:
>>> Hi Tvrtko,
>>>
>>> Thanks for pointing this mail out once more, I've totally missed it.
>>
>> That's okay, if it was really urgent I would have re-raised the thread
>> earlier. :) As it stands so far it is only about acceptance test
>> suites failing and no known real use cases affected.
>>
>>> Am 12.12.23 um 11:37 schrieb Tvrtko Ursulin:
>>>>
>>>> On 25/09/2023 14:16, Tvrtko Ursulin wrote:
>>>>> From: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>>>>
>>>>> Allow mmap forwarding for imported buffers in order to allow
>>>>> minigbm mmap
>>>>> to work on aperture-less platforms such as Meteorlake.
>>>>>
>>>>> So far i915 did not allow mmap on imported buffers but from minigbm
>>>>> perspective that worked because of the DRM_IOCTL_I915_GEM_MMAP_GTT
>>>>> fall-
>>>>> back would then be attempted, and would be successful.
>>>>>
>>>>> This stops working on Meteorlake since there is no aperture.
>>>>>
>>>>> Allow i915 to mmap imported buffers using forwarding via
>>>>> dma_buf_mmap(),
>>>>> which allows the primary minigbm path of
>>>>> DRM_IOCTL_I915_GEM_MMAP_OFFSET /
>>>>> I915_MMAP_OFFSET_WB to work.
>>>>>
>>>>> Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin at intel.com>
>>>>> Cc: Daniel Vetter <daniel.vetter at ffwll.ch>
>>>>> Cc: Christian König <christian.koenig at amd.com>
>>>>> Cc: Matthew Auld <matthew.auld at intel.com>
>>>>> Cc: Nirmoy Das <nirmoy.das at intel.com>
>>>>> ---
>>>>> 1)
>>>>> It is unclear to me if any real userspace depends on this, but
>>>>> there are
>>>>> certainly compliance suites which fail.
>>>
>>> Well that is actually intentional, but see below.
>>>
>>>>>
>>>>> 2)
>>>>> It is also a bit unclear to me if dma_buf_mmap() is exactly
>>>>> intended for
>>>>> this kind of use. It seems that it is, but I also found some old
>>>>> mailing
>>>>> list discussions suggesting there might be some unresolved questions
>>>>> around VMA revocation.
>>>
>>> I actually solved those a few years back by introducing the
>>> vma_set_file() function which standardized the dance necessary for
>>> the dma_buf_mmap() function.
>>>
>>>>>
>>>>> 1 + 2 = RFC for now.
>>>>>
>>>>> Daniel and Christian were involved in 2) in the past so comments would
>>>>> be appreciated.
>>>>
>>>> Any comments on this one? I don't have all the historical knowledge
>>>> of when this was maybe attempted before and what problems were hit,
>>>> or something. So would there be downsides or it is fine to forward it.
>>>
>>> It works technically inside the kernel and Thomas Zimmerman suggested
>>> a patch set which made it possible to use for all DRM drivers.
>>>
>>> But IIRC this patch set was rejected with the rational that while
>>> doing an mmap() on an imported DMA-buf works when userspace actually
>>> does this then there is a bug in userspace. The UMD doesn't seems to
>>> be aware of the fact that the buffer is imported and so for example
>>> needs to call dma_buf_begin_cpu_access() and dma_buf_end_cpu_access().
>>>
>>> UMDs can trivially work around this by doing the mmap() on the
>>> DMA-buf file descriptor instead (potentially after re-exporting it),
>>> but the kernel really shouldn't help hide userspace bugs.
>>
>> Hm right, however why does drm_gem_shmem_mmap:
>>
>> if (obj->import_attach) {
>> ret = dma_buf_mmap(obj->dma_buf, vma, 0);
>
> Honestly I have absolutely no idea.
>
>> Isn't that allowing drivers which use the helper to to forward to
>> dma_buf_mmap?
>
> Yes, Daniel mentioned that some drivers did this before we found that
> it's actually not a good idea. It could be that this code piece was
> meant with that and we only allow it to avoid breaking UAPI.
>
> Never the less I think we should add documentation for this.
>
>> Maybe I am getting lost in the forest of callbacks in this area..
>> Because it is supposed to be about shmem objects, but drivers which
>> use the helper and rely on common prime import look and also use
>> drm_gem_shmem_prime_import_sg_table can get there.
>
> I don't fully understand it either of hand.
Right, I will leave untangling the jungle of callbacks and helpers in my
mind for later too. For now I think the rationale that the API does not
allow correctnes via dma_buf_begin_cpu_access/end is I think good enough
to pass on as will-not-implement. Thanks for the clarifications!
Regards,
Tvrtko
>
> Regards,
> Christian.
>
>>
>> Regards,
>>
>> Tvrtko
>>
>>>>>
>>>>> Test-with: 20230925131539.32743-1-tvrtko.ursulin at linux.intel.com
>>>>>
>>>>> ---
>>>>> drivers/gpu/drm/i915/gem/i915_gem_mman.c | 78
>>>>> +++++++++++++++----
>>>>> .../gpu/drm/i915/gem/i915_gem_object_types.h | 1 +
>>>>> 2 files changed, 65 insertions(+), 14 deletions(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_mman.c
>>>>> b/drivers/gpu/drm/i915/gem/i915_gem_mman.c
>>>>> index aa4d842d4c5a..78c84c0a8b08 100644
>>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_mman.c
>>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_mman.c
>>>>> @@ -5,6 +5,7 @@
>>>>> */
>>>>> #include <linux/anon_inodes.h>
>>>>> +#include <linux/dma-buf.h>
>>>>> #include <linux/mman.h>
>>>>> #include <linux/pfn_t.h>
>>>>> #include <linux/sizes.h>
>>>>> @@ -664,6 +665,7 @@ insert_mmo(struct drm_i915_gem_object *obj,
>>>>> struct i915_mmap_offset *mmo)
>>>>> static struct i915_mmap_offset *
>>>>> mmap_offset_attach(struct drm_i915_gem_object *obj,
>>>>> enum i915_mmap_type mmap_type,
>>>>> + bool forward_mmap,
>>>>> struct drm_file *file)
>>>>> {
>>>>> struct drm_i915_private *i915 = to_i915(obj->base.dev);
>>>>> @@ -682,6 +684,7 @@ mmap_offset_attach(struct drm_i915_gem_object
>>>>> *obj,
>>>>> mmo->obj = obj;
>>>>> mmo->mmap_type = mmap_type;
>>>>> + mmo->forward_mmap = forward_mmap;
>>>>> drm_vma_node_reset(&mmo->vma_node);
>>>>> err = drm_vma_offset_add(obj->base.dev->vma_offset_manager,
>>>>> @@ -714,12 +717,25 @@ mmap_offset_attach(struct drm_i915_gem_object
>>>>> *obj,
>>>>> return ERR_PTR(err);
>>>>> }
>>>>> +static bool
>>>>> +should_forward_mmap(struct drm_i915_gem_object *obj,
>>>>> + enum i915_mmap_type mmap_type)
>>>>> +{
>>>>> + if (!obj->base.import_attach)
>>>>> + return false;
>>>>> +
>>>>> + return mmap_type == I915_MMAP_TYPE_WB ||
>>>>> + mmap_type == I915_MMAP_TYPE_WC ||
>>>>> + mmap_type == I915_MMAP_TYPE_UC;
>>>>> +}
>>>>> +
>>>>> static int
>>>>> __assign_mmap_offset(struct drm_i915_gem_object *obj,
>>>>> enum i915_mmap_type mmap_type,
>>>>> u64 *offset, struct drm_file *file)
>>>>> {
>>>>> struct i915_mmap_offset *mmo;
>>>>> + bool should_forward;
>>>>> if (i915_gem_object_never_mmap(obj))
>>>>> return -ENODEV;
>>>>> @@ -735,12 +751,15 @@ __assign_mmap_offset(struct
>>>>> drm_i915_gem_object *obj,
>>>>> if (mmap_type == I915_MMAP_TYPE_FIXED)
>>>>> return -ENODEV;
>>>>> + should_forward = should_forward_mmap(obj, mmap_type);
>>>>> +
>>>>> if (mmap_type != I915_MMAP_TYPE_GTT &&
>>>>> !i915_gem_object_has_struct_page(obj) &&
>>>>> - !i915_gem_object_has_iomem(obj))
>>>>> + !i915_gem_object_has_iomem(obj) &&
>>>>> + !should_forward)
>>>>> return -ENODEV;
>>>>> - mmo = mmap_offset_attach(obj, mmap_type, file);
>>>>> + mmo = mmap_offset_attach(obj, mmap_type, should_forward, file);
>>>>> if (IS_ERR(mmo))
>>>>> return PTR_ERR(mmo);
>>>>> @@ -936,6 +955,32 @@ static struct file *mmap_singleton(struct
>>>>> drm_i915_private *i915)
>>>>> return file;
>>>>> }
>>>>> +static void
>>>>> +__vma_mmap_pgprot(struct vm_area_struct *vma, enum i915_mmap_type
>>>>> mmap_type)
>>>>> +{
>>>>> + const pgprot_t pgprot =vm_get_page_prot(vma->vm_flags);
>>>>> +
>>>>> + switch (mmap_type) {
>>>>> + case I915_MMAP_TYPE_WC:
>>>>> + vma->vm_page_prot = pgprot_writecombine(pgprot);
>>>>> + break;
>>>>> + case I915_MMAP_TYPE_FIXED:
>>>>> + GEM_WARN_ON(1);
>>>>> + fallthrough;
>>>>> + case I915_MMAP_TYPE_WB:
>>>>> + vma->vm_page_prot = pgprot;
>>>>> + break;
>>>>> + case I915_MMAP_TYPE_UC:
>>>>> + vma->vm_page_prot = pgprot_noncached(pgprot);
>>>>> + break;
>>>>> + case I915_MMAP_TYPE_GTT:
>>>>> + vma->vm_page_prot = pgprot_writecombine(pgprot);
>>>>> + break;
>>>>> + }
>>>>> +
>>>>> + vma->vm_page_prot = pgprot_decrypted(vma->vm_page_prot);
>>>>> +}
>>>>> +
>>>>> static int
>>>>> i915_gem_object_mmap(struct drm_i915_gem_object *obj,
>>>>> struct i915_mmap_offset *mmo,
>>>>> @@ -953,6 +998,20 @@ i915_gem_object_mmap(struct
>>>>> drm_i915_gem_object *obj,
>>>>> vm_flags_clear(vma, VM_MAYWRITE);
>>>>> }
>>>>> + /* dma-buf import */
>>>>> + if (mmo && mmo->forward_mmap) {
>>>>> + __vma_mmap_pgprot(vma, mmo->mmap_type);
>>>>> + vm_flags_set(vma, VM_PFNMAP | VM_DONTEXPAND | VM_DONTDUMP
>>>>> | VM_IO);
>>>>> +
>>>>> + /*
>>>>> + * Don't have our vm_ops to drop the reference in this
>>>>> case so
>>>>> + * drop it now and if object goes away userspace will fault.
>>>>> + */
>>>>> + i915_gem_object_put(mmo->obj);
>>>>> +
>>>>> + return dma_buf_mmap(obj->base.dma_buf, vma, 0);
>>>>> + }
>>>>> +
>>>>> anon = mmap_singleton(to_i915(dev));
>>>>> if (IS_ERR(anon)) {
>>>>> i915_gem_object_put(obj);
>>>>> @@ -982,34 +1041,25 @@ i915_gem_object_mmap(struct
>>>>> drm_i915_gem_object *obj,
>>>>> vma->vm_private_data = mmo;
>>>>> + __vma_mmap_pgprot(vma, mmo->mmap_type);
>>>>> +
>>>>> switch (mmo->mmap_type) {
>>>>> case I915_MMAP_TYPE_WC:
>>>>> - vma->vm_page_prot =
>>>>> - pgprot_writecombine(vm_get_page_prot(vma->vm_flags));
>>>>> vma->vm_ops = &vm_ops_cpu;
>>>>> break;
>>>>> -
>>>>> case I915_MMAP_TYPE_FIXED:
>>>>> GEM_WARN_ON(1);
>>>>> fallthrough;
>>>>> case I915_MMAP_TYPE_WB:
>>>>> - vma->vm_page_prot = vm_get_page_prot(vma->vm_flags);
>>>>> vma->vm_ops = &vm_ops_cpu;
>>>>> break;
>>>>> -
>>>>> case I915_MMAP_TYPE_UC:
>>>>> - vma->vm_page_prot =
>>>>> - pgprot_noncached(vm_get_page_prot(vma->vm_flags));
>>>>> vma->vm_ops = &vm_ops_cpu;
>>>>> break;
>>>>> -
>>>>> case I915_MMAP_TYPE_GTT:
>>>>> - vma->vm_page_prot =
>>>>> - pgprot_writecombine(vm_get_page_prot(vma->vm_flags));
>>>>> vma->vm_ops = &vm_ops_gtt;
>>>>> break;
>>>>> }
>>>>> - vma->vm_page_prot = pgprot_decrypted(vma->vm_page_prot);
>>>>> return 0;
>>>>> }
>>>>> @@ -1084,7 +1134,7 @@ int i915_gem_fb_mmap(struct
>>>>> drm_i915_gem_object *obj, struct vm_area_struct *vma
>>>>> } else {
>>>>> /* handle stolen and smem objects */
>>>>> mmap_type = i915_ggtt_has_aperture(ggtt) ?
>>>>> I915_MMAP_TYPE_GTT : I915_MMAP_TYPE_WC;
>>>>> - mmo = mmap_offset_attach(obj, mmap_type, NULL);
>>>>> + mmo = mmap_offset_attach(obj, mmap_type, false, NULL);
>>>>> if (IS_ERR(mmo))
>>>>> return PTR_ERR(mmo);
>>>>> }
>>>>> diff --git a/drivers/gpu/drm/i915/gem/i915_gem_object_types.h
>>>>> b/drivers/gpu/drm/i915/gem/i915_gem_object_types.h
>>>>> index 0c5cdab278b6..b4f86fa020aa 100644
>>>>> --- a/drivers/gpu/drm/i915/gem/i915_gem_object_types.h
>>>>> +++ b/drivers/gpu/drm/i915/gem/i915_gem_object_types.h
>>>>> @@ -225,6 +225,7 @@ struct i915_mmap_offset {
>>>>> struct drm_vma_offset_node vma_node;
>>>>> struct drm_i915_gem_object *obj;
>>>>> enum i915_mmap_type mmap_type;
>>>>> + bool forward_mmap;
>>>>> struct rb_node offset;
>>>>> };
>>>
>
More information about the Intel-gfx
mailing list