Mesa (master): aco: be more careful about using SMEM for load_global

GitLab Mirror gitlab-mirror at kemper.freedesktop.org
Fri Apr 24 19:01:30 UTC 2020


Module: Mesa
Branch: master
Commit: 69b92db131b00fcfb64d77f876e51770d2f0aa5b
URL:    http://cgit.freedesktop.org/mesa/mesa/commit/?id=69b92db131b00fcfb64d77f876e51770d2f0aa5b

Author: Rhys Perry <pendingchaos02 at gmail.com>
Date:   Mon Apr  6 20:15:36 2020 +0100

aco: be more careful about using SMEM for load_global

Signed-off-by: Rhys Perry <pendingchaos02 at gmail.com>
Reviewed-by: Daniel Schürmann <daniel at schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4639>

---

 src/amd/compiler/aco_instruction_selection.cpp | 7 ++++---
 1 file changed, 4 insertions(+), 3 deletions(-)

diff --git a/src/amd/compiler/aco_instruction_selection.cpp b/src/amd/compiler/aco_instruction_selection.cpp
index c83cf00f01a..c86d69baae9 100644
--- a/src/amd/compiler/aco_instruction_selection.cpp
+++ b/src/amd/compiler/aco_instruction_selection.cpp
@@ -6082,12 +6082,14 @@ void visit_load_global(isel_context *ctx, nir_intrinsic_instr *instr)
 
    bool glc = nir_intrinsic_access(instr) & (ACCESS_VOLATILE | ACCESS_COHERENT);
    bool dlc = glc && ctx->options->chip_class >= GFX10;
+   /* VMEM stores don't update the SMEM cache and it's difficult to prove that
+    * it's safe to use SMEM */
+   bool can_use_smem = nir_intrinsic_access(instr) & ACCESS_NON_WRITEABLE;
    aco_opcode op;
-   if (dst.type() == RegType::vgpr || (glc && ctx->options->chip_class < GFX8)) {
+   if (dst.type() == RegType::vgpr || (glc && ctx->options->chip_class < GFX8) || !can_use_smem) {
       bool global = ctx->options->chip_class >= GFX9;
 
       if (ctx->options->chip_class >= GFX7) {
-         aco_opcode op;
          switch (num_bytes) {
          case 4:
             op = global ? aco_opcode::global_load_dword : aco_opcode::flat_load_dword;
@@ -6128,7 +6130,6 @@ void visit_load_global(isel_context *ctx, nir_intrinsic_instr *instr)
          /* GFX6 doesn't support loading vec3, expand to vec4. */
          num_bytes = num_bytes == 12 ? 16 : num_bytes;
 
-         aco_opcode op;
          switch (num_bytes) {
          case 4:
             op = aco_opcode::buffer_load_dword;



More information about the mesa-commit mailing list