[Mesa-dev] [PATCH 1/3] anv: Stop racing relocation offsets
Chris Wilson
chris at chris-wilson.co.uk
Thu May 11 07:41:13 UTC 2017
On Wed, May 10, 2017 at 04:08:53PM -0700, Jason Ekstrand wrote:
> One of the key invariants of the relocation system is the
> presumed_offset field. The assumption is made that the value currently
> in the address to be relocated agrees with the presumed_offset field.
> If presumed_offset is equal to the offset of the BO, the kernel will
> skip the relocation assuming that the value is already correct.
>
> Our initial implementation of relocation handling had a race where we
> would read bo->offset once when we wrote the relocation entry and again
> when we filled out actual address.
>
> Found with helgrind
>
> Cc: "17.0 17.1" <mesa-stable at lists.freedesktop.org>
> ---
> src/intel/vulkan/anv_batch_chain.c | 21 +++++++++++++++++----
> src/intel/vulkan/anv_private.h | 2 +-
> src/intel/vulkan/genX_blorp_exec.c | 5 ++++-
> src/intel/vulkan/genX_cmd_buffer.c | 7 +++++--
> 4 files changed, 27 insertions(+), 8 deletions(-)
>
> diff --git a/src/intel/vulkan/anv_batch_chain.c b/src/intel/vulkan/anv_batch_chain.c
> index 9def174..13303b1 100644
> --- a/src/intel/vulkan/anv_batch_chain.c
> +++ b/src/intel/vulkan/anv_batch_chain.c
> @@ -143,7 +143,8 @@ anv_reloc_list_grow(struct anv_reloc_list *list,
> VkResult
> anv_reloc_list_add(struct anv_reloc_list *list,
> const VkAllocationCallbacks *alloc,
> - uint32_t offset, struct anv_bo *target_bo, uint32_t delta)
> + uint32_t offset, struct anv_bo *target_bo, uint32_t delta,
> + uint64_t *bo_offset_out)
> {
> struct drm_i915_gem_relocation_entry *entry;
> int index;
> @@ -155,6 +156,14 @@ anv_reloc_list_add(struct anv_reloc_list *list,
> if (result != VK_SUCCESS)
> return result;
>
> + /* Read the BO offset once. This same value will be used in the relocation
> + * entry and passed back to the caller for it to use when it writes the
> + * actual value. This guarantees that the two values match even if there
> + * is a data race between now and when the caller gets around to writing
> + * the address into the BO.
> + */
> + uint64_t presumed_offset = target_bo->offset;
If it really can be written in parallel and if helgrind says it is, it
is, then you also need a compiler barrier to prevent a reload.
#define READ_ONCE(x) (*(volatile typeof(x) *)&(x))
uint64_t presumed_offset = READ_ONCE(target_bo->offset);
-Chris
--
Chris Wilson, Intel Open Source Technology Centre
More information about the mesa-dev
mailing list