[i-g-t, v2, 1/1] tests/intel/xe_exec_threads: wait for all submissions to complete
fei.yang at intel.com
fei.yang at intel.com
Mon Oct 28 22:53:49 UTC 2024
From: Fei Yang <fei.yang at intel.com>
In test_compute_mode, there is an one second sleep waiting for all
the submissions to complete, but that is not reliable especially
on pre-si platforms where the GPU could be a lot slower. Instead we
should wait for the ufence to make sure the GPU is inactive before
unbinding the BO.
Signed-off-by: Fei Yang <fei.yang at intel.com>
---
tests/intel/xe_exec_threads.c | 29 ++++++++++++++++++++++-------
1 file changed, 22 insertions(+), 7 deletions(-)
diff --git a/tests/intel/xe_exec_threads.c b/tests/intel/xe_exec_threads.c
index 413d6626b..03043c53e 100644
--- a/tests/intel/xe_exec_threads.c
+++ b/tests/intel/xe_exec_threads.c
@@ -340,7 +340,7 @@ test_compute_mode(int fd, uint32_t vm, uint64_t addr, uint64_t userptr,
xe_exec(fd, &exec);
if (flags & REBIND && i && !(i & 0x1f)) {
- for (j = i - 0x20; j <= i; ++j)
+ for (j = i == 0x20 ? 0 : i - 0x1f; j <= i; ++j)
xe_wait_ufence(fd, &data[j].exec_sync,
USER_FENCE_VALUE,
exec_queues[e], fence_timeout);
@@ -404,16 +404,31 @@ test_compute_mode(int fd, uint32_t vm, uint64_t addr, uint64_t userptr,
}
}
- j = flags & INVALIDATE ?
- (flags & RACE ? n_execs / 2 + 1 : n_execs - 1) : 0;
+ j = 0; /* wait for all submissions to complete */
+ if (flags & INVALIDATE)
+ /*
+ * For !RACE cases xe_wait_ufence has been called in above for-loop
+ * except the last batch of submissions. For RACE cases we will need
+ * to wait for the second half of the submissions to complete. There
+ * is a potential race here because the first half submissions might
+ * have updated the fence in the old physical location while the test
+ * is remapping the buffer from a different physical location, but the
+ * wait_ufence only checks the fence from the new location which would
+ * never be updated. We have to assume the first half of the submissions
+ * complete before the second half.
+ */
+ j = (flags & RACE) ? (n_execs / 2 + 1) : (((n_execs - 1) & ~0x1f) + 1);
+ else if (flags & REBIND)
+ /*
+ * For REBIND cases xe_wait_ufence has been called in above for-loop
+ * except the last batch of submissions.
+ */
+ j = ((n_execs - 1) & ~0x1f) + 1;
+
for (i = j; i < n_execs; i++)
xe_wait_ufence(fd, &data[i].exec_sync, USER_FENCE_VALUE,
exec_queues[i % n_exec_queues], fence_timeout);
- /* Wait for all execs to complete */
- if (flags & INVALIDATE)
- sleep(1);
-
sync[0].addr = to_user_pointer(&data[0].vm_sync);
xe_vm_unbind_async(fd, vm, 0, 0, addr, bo_size, sync, 1);
xe_wait_ufence(fd, &data[0].vm_sync, USER_FENCE_VALUE, 0, fence_timeout);
--
2.25.1
More information about the igt-dev
mailing list