[PATCH v5 5/5] drm/xe: Refactor default device atomic settings
Nirmoy Das
nirmoy.das at linux.intel.com
Mon Apr 29 09:05:50 UTC 2024
Hi Jose,
On 4/26/2024 11:04 PM, Souza, Jose wrote:
> On Fri, 2024-04-26 at 12:56 +0200, Nirmoy Das wrote:
>> The default behavior of device atomics depends on the
>> VM type and buffer allocation types. Device atomics are
>> expected to function with all types of allocations for
>> traditional applications/APIs. Additionally, in compute/SVM
>> API scenarios with fault mode or LR mode VMs, device atomics
>> must work with single-region allocations. In all other cases
>> device atomics should be disabled by default also on platforms
>> where we know device atomics doesn't on work on particular
>> allocations types.
>>
>> v2: Fix platform checks to correct atomics behaviour on PVC.
>>
>> Signed-off-by: Nirmoy Das <nirmoy.das at intel.com>
>> ---
>> drivers/gpu/drm/xe/xe_pt.c | 27 ++++++++++++++++++++++++---
>> drivers/gpu/drm/xe/xe_vm.c | 2 +-
>> 2 files changed, 25 insertions(+), 4 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/xe/xe_pt.c b/drivers/gpu/drm/xe/xe_pt.c
>> index 5b7930f46cf3..237e4a4985a4 100644
>> --- a/drivers/gpu/drm/xe/xe_pt.c
>> +++ b/drivers/gpu/drm/xe/xe_pt.c
>> @@ -619,9 +619,30 @@ xe_pt_stage_bind(struct xe_tile *tile, struct xe_vma *vma,
>> struct xe_pt *pt = xe_vma_vm(vma)->pt_root[tile->id];
>> int ret;
>>
>> - if ((vma->gpuva.flags & XE_VMA_ATOMIC_PTE_BIT) &&
>> - (is_devmem || !IS_DGFX(xe)))
>> - xe_walk.default_pte |= XE_USM_PPGTT_PTE_AE;
>> + /**
>> + * Default atomic expectations for different allocation scenarios are as follows:
>> + *
>> + * 1. Traditional API: When the VM is not in fault mode or LR mode:
>> + * - Device atomics are expected to function with all allocations.
>> + *
>> + * 2. Compute/SVM API: When the VM is either in fault mode or LR mode:
>> + * - Device atomics are the default behavior when the bo is placed in a single region.
>> + * - In all other cases device atomics will be disabled with AE=0 until an application
>> + * request differently using a ioctl like madvise.
>> + */
>> + if (vma->gpuva.flags & XE_VMA_ATOMIC_PTE_BIT) {
>> + if (xe_vm_in_fault_mode(xe_vma_vm(vma)) ||
>> + xe_vm_in_lr_mode(xe_vma_vm(vma))) {
> nit: xe_vm_in_fault_mode requires xe_vm_in_lr_mode, so you can just check for the later.
Will do that. I think scratch page is also part of non-traditional API,
I will confirm that and add that if needed.
Thanks,
Nirmoy
>
>> + if (bo && xe_bo_has_single_placement(bo))
>> + xe_walk.default_pte |= XE_USM_PPGTT_PTE_AE;
>> + } else {
>> + xe_walk.default_pte |= XE_USM_PPGTT_PTE_AE;
>> + }
>> +
>> + /* Unset AE if the platform(PVC) doesn't support it */
>> + if (!xe->info.has_device_atomics_on_smem && !is_devmem)
>> + xe_walk.default_pte &= ~XE_USM_PPGTT_PTE_AE;
>> + }
>>
>> if (is_devmem) {
>> xe_walk.default_pte |= XE_PPGTT_PTE_DM;
>> diff --git a/drivers/gpu/drm/xe/xe_vm.c b/drivers/gpu/drm/xe/xe_vm.c
>> index 8fc37c5a0196..f795016a80d5 100644
>> --- a/drivers/gpu/drm/xe/xe_vm.c
>> +++ b/drivers/gpu/drm/xe/xe_vm.c
>> @@ -805,7 +805,7 @@ static struct xe_vma *xe_vma_create(struct xe_vm *vm,
>> for_each_tile(tile, vm->xe, id)
>> vma->tile_mask |= 0x1 << id;
>>
>> - if (GRAPHICS_VER(vm->xe) >= 20 || vm->xe->info.platform == XE_PVC)
>> + if (vm->xe->info.has_atomic_enable_pte_bit)
>> vma->gpuva.flags |= XE_VMA_ATOMIC_PTE_BIT;
>>
>> vma->pat_index = pat_index;
More information about the Intel-xe
mailing list