[PATCH v4 07/14] drm/panthor: Add the MMU/VM logical block
Boris Brezillon
boris.brezillon at collabora.com
Sat Feb 10 08:02:06 UTC 2024
Hi Steve,
On Fri, 9 Feb 2024 16:51:46 +0000
Steven Price <steven.price at arm.com> wrote:
> On 22/01/2024 16:30, Boris Brezillon wrote:
> > MMU and VM management is related and placed in the same source file.
> >
> > Page table updates are delegated to the io-pgtable-arm driver that's in
> > the iommu subsystem.
> >
> > The VM management logic is based on drm_gpuva_mgr, and is assuming the
> > VA space is mostly managed by the usermode driver, except for a reserved
> > portion of this VA-space that's used for kernel objects (like the heap
> > contexts/chunks).
> >
> > Both asynchronous and synchronous VM operations are supported, and
> > internal helpers are exposed to allow other logical blocks to map their
> > buffers in the GPU VA space.
> >
> > There's one VM_BIND queue per-VM (meaning the Vulkan driver can only
> > expose one sparse-binding queue), and this bind queue is managed with
> > a 1:1 drm_sched_entity:drm_gpu_scheduler, such that each VM gets its own
> > independent execution queue, avoiding VM operation serialization at the
> > device level (things are still serialized at the VM level).
> >
> > The rest is just implementation details that are hopefully well explained
> > in the documentation.
>
> panthor_vm_map_pages() looks a bit suspect (see below) and there's a
> kerneldoc header which needs updating. But generally it looks fine.
>
> >
> > v4:
> > - Add an helper to return the VM state
> > - Check drmm_mutex_init() return code
> > - Remove the VM from the AS reclaim list when panthor_vm_active() is
> > called
> > - Count the number of active VM users instead of considering there's
> > at most one user (several scheduling groups can point to the same
> > vM)
> > - Pre-allocate a VMA object for unmap operations (unmaps can trigger
> > a sm_step_remap() call)
> > - Check vm->root_page_table instead of vm->pgtbl_ops to detect if
> > the io-pgtable is trying to allocate the root page table
> > - Don't memset() the va_node in panthor_vm_alloc_va(), make it a
> > caller requirement
> > - Fix the kernel doc in a few places
> > - Drop the panthor_vm::base offset constraint and modify
> > panthor_vm_put() to explicitly check for a NULL value
> > - Fix unbalanced vm_bo refcount in panthor_gpuva_sm_step_remap()
> > - Drop stale comments about the shared_bos list
> > - Patch mmu_features::va_bits on 32-bit builds to reflect the
> > io_pgtable limitation and let the UMD know about it
> >
> > v3:
> > - Add acks for the MIT/GPL2 relicensing
> > - Propagate MMU faults to the scheduler
> > - Move pages pinning/unpinning out of the dma_signalling path
> > - Fix 32-bit support
> > - Rework the user/kernel VA range calculation
> > - Make the auto-VA range explicit (auto-VA range doesn't cover the full
> > kernel-VA range on the MCU VM)
> > - Let callers of panthor_vm_alloc_va() allocate the drm_mm_node
> > (embedded in panthor_kernel_bo now)
> > - Adjust things to match the latest drm_gpuvm changes (extobj tracking,
> > resv prep and more)
> > - Drop the per-AS lock and use slots_lock (fixes a race on vm->as.id)
> > - Set as.id to -1 when reusing an address space from the LRU list
> > - Drop misleading comment about page faults
> > - Remove check for irq being assigned in panthor_mmu_unplug()
> >
> > Co-developed-by: Steven Price <steven.price at arm.com>
> > Signed-off-by: Steven Price <steven.price at arm.com>
> > Signed-off-by: Boris Brezillon <boris.brezillon at collabora.com>
> > Acked-by: Steven Price <steven.price at arm.com> # MIT+GPL2 relicensing,Arm
> > Acked-by: Grant Likely <grant.likely at linaro.org> # MIT+GPL2 relicensing,Linaro
> > Acked-by: Boris Brezillon <boris.brezillon at collabora.com> # MIT+GPL2 relicensing,Collabora
> > ---
> > drivers/gpu/drm/panthor/panthor_mmu.c | 2760 +++++++++++++++++++++++++
> > drivers/gpu/drm/panthor/panthor_mmu.h | 102 +
> > 2 files changed, 2862 insertions(+)
> > create mode 100644 drivers/gpu/drm/panthor/panthor_mmu.c
> > create mode 100644 drivers/gpu/drm/panthor/panthor_mmu.h
> >
> > diff --git a/drivers/gpu/drm/panthor/panthor_mmu.c b/drivers/gpu/drm/panthor/panthor_mmu.c
> > new file mode 100644
> > index 000000000000..d3ce29cd0662
>
> <snip>
>
> > +static int
> > +panthor_vm_map_pages(struct panthor_vm *vm, u64 iova, int prot,
> > + struct sg_table *sgt, u64 offset, u64 size)
> > +{
> > + struct panthor_device *ptdev = vm->ptdev;
> > + unsigned int count;
> > + struct scatterlist *sgl;
> > + struct io_pgtable_ops *ops = vm->pgtbl_ops;
> > + u64 start_iova = iova;
> > + int ret;
> > +
> > + if (!size)
> > + return 0;
> > +
> > + for_each_sgtable_dma_sg(sgt, sgl, count) {
> > + dma_addr_t paddr = sg_dma_address(sgl);
> > + size_t len = sg_dma_len(sgl);
> > +
> > + if (len <= offset) {
> > + offset -= len;
> > + continue;
> > + }
> > +
> > + paddr -= offset;
>
> Shouldn't that be "+="?
It should definitely be +=. Will fix that in v5.
>
> > + len -= offset;
> > +
> > + if (size >= 0) {
>
> size is unsigned... so this is always true!
I'll drop the condition.
>
> > + len = min_t(size_t, len, size);
> > + size -= len;
> > + }
> > +
> > + drm_dbg(&ptdev->base, "map: as=%d, iova=%llx, paddr=%pad, len=%zx",
> > + vm->as.id, iova, &paddr, len);
> > +
> > + while (len) {
> > + size_t pgcount, mapped = 0;
> > + size_t pgsize = get_pgsize(iova | paddr, len, &pgcount);
> > +
> > + ret = ops->map_pages(ops, iova, paddr, pgsize, pgcount, prot,
> > + GFP_KERNEL, &mapped);
> > + iova += mapped;
> > + paddr += mapped;
> > + len -= mapped;
> > +
> > + if (drm_WARN_ON(&ptdev->base, !ret && !mapped))
> > + ret = -ENOMEM;
> > +
> > + if (ret) {
> > + /* If something failed, unmap what we've already mapped before
> > + * returning. The unmap call is not supposed to fail.
> > + */
> > + drm_WARN_ON(&ptdev->base,
> > + panthor_vm_unmap_pages(vm, start_iova,
> > + iova - start_iova));
> > + return ret;
> > + }
> > + }
> > +
> > + if (!size)
> > + break;
> > + }
> > +
> > + return panthor_vm_flush_range(vm, start_iova, iova - start_iova);
> > +}
> > +
>
> <snip>
>
> > +static void panthor_vm_destroy(struct panthor_vm *vm)
> > +{
> > + if (!vm)
> > + return;
> > +
> > + vm->destroyed = true;
> > +
> > + mutex_lock(&vm->heaps.lock);
> > + panthor_heap_pool_destroy(vm->heaps.pool);
> > + vm->heaps.pool = NULL;
> > + mutex_unlock(&vm->heaps.lock);
> > +
> > + drm_WARN_ON(&vm->ptdev->base,
> > + panthor_vm_unmap_range(vm, vm->base.mm_start, vm->base.mm_range));
> > + panthor_vm_put(vm);
> > +}
> > +
> > +/**
> > + * panthor_vm_destroy() - Destroy a VM.
> ^^^^^^^^^^^^^^^^^^ needs updating to the new function name.
>
And I'll fix the kernel doc.
Thanks,
Boris
More information about the dri-devel
mailing list