[PATCH v2 00/10] Implement multi-GPU DMA mappings for KFD

Zeng, Oak Oak.Zeng at amd.com
Tue Apr 27 15:16:12 UTC 2021


This series is Acked-by: Oak Zeng <Oak.Zeng at amd.com> 

Regards,
Oak 

 

On 2021-04-21, 9:31 PM, "dri-devel on behalf of Felix Kuehling" <dri-devel-bounces at lists.freedesktop.org on behalf of Felix.Kuehling at amd.com> wrote:

    This patch series fixes DMA-mappings of system memory (GTT and userptr)
    for KFD running on multi-GPU systems with IOMMU enabled. One SG-BO per
    GPU is needed to maintain the DMA mappings of each BO.

    Changes in v2:
    - Made the original BO parent of the SG BO to fix bo destruction order
    - Removed individualiation hack that is, not needed with parent BO
    - Removed resv locking hace in amdgpu_ttm_unpopulate, not needed without
      the individualization hack
    - Added a patch to enable the Intel IOMMU driver in rock-dbg_defconfig
    - Added a patch to move dmabuf attach/detach into backend_(un)bind

    I'm still seeing some IOMMU access faults in the eviction test. They seem
    to be related to userptr handling. They happen even without this patch
    series on a single-GPU system, where this patch series is not needed. I
    believe this is an old problem in KFD or amdgpu that is being exposed by
    device isolation from the IOMMU. I'm debugging it, but it should not hold
    up this patch series.

    "drm/ttm: Don't count pages in SG BOs against pages_limit" was already
    applied to drm-misc (I think). I'm still including it here because my
    patches depend on it. Without that, the SG BOs created for DMA mappings
    cause many tests fail because TTM incorrectly thinks it's out of memory.

    Felix Kuehling (10):
      rock-dbg_defconfig: Enable Intel IOMMU
      drm/amdgpu: Rename kfd_bo_va_list to kfd_mem_attachment
      drm/amdgpu: Keep a bo-reference per-attachment
      drm/amdgpu: Simplify AQL queue mapping
      drm/amdgpu: Add multi-GPU DMA mapping helpers
      drm/amdgpu: DMA map/unmap when updating GPU mappings
      drm/amdgpu: Move kfd_mem_attach outside reservation
      drm/amdgpu: Add DMA mapping of GTT BOs
      drm/ttm: Don't count pages in SG BOs against pages_limit
      drm/amdgpu: Move dmabuf attach/detach to backend_(un)bind

     arch/x86/configs/rock-dbg_defconfig           |  11 +-
     drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.h    |  18 +-
     .../gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c  | 530 ++++++++++++------
     drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c       |  51 +-
     drivers/gpu/drm/ttm/ttm_tt.c                  |  27 +-
     5 files changed, 437 insertions(+), 200 deletions(-)

    -- 
    2.31.1

    _______________________________________________
    dri-devel mailing list
    dri-devel at lists.freedesktop.org
    https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Fdri-devel&data=04%7C01%7Coak.zeng%40amd.com%7Cfb31922bd50846641e9508d9052e635d%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637546519058204046%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=yxNesWxDmM5H8ObiNmeaa0DBIEyptiBpjUKSUqS%2B52M%3D&reserved=0



More information about the dri-devel mailing list