[Intel-gfx] [PATCH v2] drm/i915: Force wmb() on using GTT io_mapping_map_wc

Chris Wilson chris at chris-wilson.co.uk
Mon May 25 10:48:44 PDT 2015


Since the advent of mmap(wc), where we reused the same cache domain for
WC and GTT paths (oh, how I regret that double-edged advice), we need to
be extra cautious when using GTT iomap_wc internally. Since userspace maybe
modifying through the mmap(wc) and we then modify modifying through an
aliased WC path through the GTT, those writes may overlap and not be
visible to the other path. Easiest to trigger appears to be write the
batch through mmap(wc) and then attempt to perform reloc the GTT,
corruption quickly ensues.

v2: Be stricter and do a full mb() in case we are reading through one
path and about to write through the second path. Also, apply the
barriers on transitioning into the GTT domain as well to cover the white
lie asychronous cases.

Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
Cc: Akash Goel <akash.goel at intel.com>
Cc: stable at vger.kernel.org
---
 drivers/gpu/drm/i915/i915_gem.c | 26 ++++++++++++++++++--------
 1 file changed, 18 insertions(+), 8 deletions(-)

diff --git a/drivers/gpu/drm/i915/i915_gem.c b/drivers/gpu/drm/i915/i915_gem.c
index 517c5b8100d1..f17168b2cd37 100644
--- a/drivers/gpu/drm/i915/i915_gem.c
+++ b/drivers/gpu/drm/i915/i915_gem.c
@@ -3923,7 +3923,7 @@ i915_gem_object_set_to_gtt_domain(struct drm_i915_gem_object *obj, bool write)
 	int ret;
 
 	if (obj->base.write_domain == I915_GEM_DOMAIN_GTT)
-		return 0;
+		goto out;
 
 	ret = i915_gem_object_wait_rendering(obj, !write);
 	if (ret)
@@ -3943,13 +3943,6 @@ i915_gem_object_set_to_gtt_domain(struct drm_i915_gem_object *obj, bool write)
 
 	i915_gem_object_flush_cpu_write_domain(obj);
 
-	/* Serialise direct access to this object with the barriers for
-	 * coherent writes from the GPU, by effectively invalidating the
-	 * GTT domain upon first access.
-	 */
-	if ((obj->base.read_domains & I915_GEM_DOMAIN_GTT) == 0)
-		mb();
-
 	old_write_domain = obj->base.write_domain;
 	old_read_domains = obj->base.read_domains;
 
@@ -3977,6 +3970,23 @@ i915_gem_object_set_to_gtt_domain(struct drm_i915_gem_object *obj, bool write)
 		list_move_tail(&vma->mm_list,
 			       &to_i915(obj->base.dev)->gtt.base.inactive_list);
 
+out:
+	/*
+	 * Userspace may be writing through mmap(wc) with
+	 * write_domain=GTT (or even with a white lie unsynchronized write),
+	 * so we need to explicitly flush before transitioning to writing
+	 * through the GTT, or vice versa. As we reused the GTT cache domain
+	 * for both paths, we must always have a memory barrier just in case.
+	 *
+	 * We need a full memory barrier in case we are reading through
+	 * the GTT and before we starting through the WC (or vice versa).
+	 * If we are only about to read through this access, we need only
+	 * wait for any pending writes on the other path.
+	 */
+	if (write)
+		mb();
+	else
+		wmb();
 	return 0;
 }
 
-- 
2.1.4



More information about the Intel-gfx mailing list