[Intel-gfx] [PATCH v2] drm/i915: Force wmb() on using GTT io_mapping_map_wc
Daniel Vetter
daniel at ffwll.ch
Tue May 26 01:01:24 PDT 2015
On Mon, May 25, 2015 at 06:48:44PM +0100, Chris Wilson wrote:
> Since the advent of mmap(wc), where we reused the same cache domain for
> WC and GTT paths (oh, how I regret that double-edged advice), we need to
> be extra cautious when using GTT iomap_wc internally. Since userspace maybe
> modifying through the mmap(wc) and we then modify modifying through an
> aliased WC path through the GTT, those writes may overlap and not be
> visible to the other path. Easiest to trigger appears to be write the
> batch through mmap(wc) and then attempt to perform reloc the GTT,
> corruption quickly ensues.
>
> v2: Be stricter and do a full mb() in case we are reading through one
> path and about to write through the second path. Also, apply the
> barriers on transitioning into the GTT domain as well to cover the white
> lie asychronous cases.
>
> Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
> Cc: Akash Goel <akash.goel at intel.com>
> Cc: stable at vger.kernel.org
There are a lot more mb() and wmb() in the gem code. I count execlist,
legacy rings, pwrite, set_domain and finish_gtt.
Time to add a flags parameter to the set_domain ioctl so that we can
untangle the domain tracking from the waiting? Add mb() all over the place
in userspace around unsychronized access? Something else?
-Daniel
> ---
> drivers/gpu/drm/i915/i915_gem.c | 26 ++++++++++++++++++--------
> 1 file changed, 18 insertions(+), 8 deletions(-)
>
> diff --git a/drivers/gpu/drm/i915/i915_gem.c b/drivers/gpu/drm/i915/i915_gem.c
> index 517c5b8100d1..f17168b2cd37 100644
> --- a/drivers/gpu/drm/i915/i915_gem.c
> +++ b/drivers/gpu/drm/i915/i915_gem.c
> @@ -3923,7 +3923,7 @@ i915_gem_object_set_to_gtt_domain(struct drm_i915_gem_object *obj, bool write)
> int ret;
>
> if (obj->base.write_domain == I915_GEM_DOMAIN_GTT)
> - return 0;
> + goto out;
>
> ret = i915_gem_object_wait_rendering(obj, !write);
> if (ret)
> @@ -3943,13 +3943,6 @@ i915_gem_object_set_to_gtt_domain(struct drm_i915_gem_object *obj, bool write)
>
> i915_gem_object_flush_cpu_write_domain(obj);
>
> - /* Serialise direct access to this object with the barriers for
> - * coherent writes from the GPU, by effectively invalidating the
> - * GTT domain upon first access.
> - */
> - if ((obj->base.read_domains & I915_GEM_DOMAIN_GTT) == 0)
> - mb();
> -
> old_write_domain = obj->base.write_domain;
> old_read_domains = obj->base.read_domains;
>
> @@ -3977,6 +3970,23 @@ i915_gem_object_set_to_gtt_domain(struct drm_i915_gem_object *obj, bool write)
> list_move_tail(&vma->mm_list,
> &to_i915(obj->base.dev)->gtt.base.inactive_list);
>
> +out:
> + /*
> + * Userspace may be writing through mmap(wc) with
> + * write_domain=GTT (or even with a white lie unsynchronized write),
> + * so we need to explicitly flush before transitioning to writing
> + * through the GTT, or vice versa. As we reused the GTT cache domain
> + * for both paths, we must always have a memory barrier just in case.
> + *
> + * We need a full memory barrier in case we are reading through
> + * the GTT and before we starting through the WC (or vice versa).
> + * If we are only about to read through this access, we need only
> + * wait for any pending writes on the other path.
> + */
> + if (write)
> + mb();
> + else
> + wmb();
> return 0;
> }
>
> --
> 2.1.4
>
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx
--
Daniel Vetter
Software Engineer, Intel Corporation
http://blog.ffwll.ch
More information about the Intel-gfx
mailing list