<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html;
      charset=windows-1252">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    <div class="moz-cite-prefix">Oh, I know where this is coming from.<br>
      <br>
      The problem is that we remove the BOs from the LRU during CS and
      so we can't wait for the CS to finish up.<br>
      <br>
      Already working on this problem for Marek's similar issue,<br>
      Christian.<br>
      <br>
      Am 09.05.19 um 16:46 schrieb Zhou, David(ChunMing):<br>
    </div>
    <blockquote type="cite"
cite="mid:-yd30ko-7yy2hg-rmre4orxyza7-xndome2qdx81-wtzk8a-eccw2rc52fs2-grqgt7bajdlk-5lgjxo-xu3otx-5m9i78-2xa184-z2lana96yogmoisk45t18dyy9462b5f86c1cg5hlpw-xorlqm-lev8gs.1557413178321@email.android.com">
      <meta http-equiv="Content-Type" content="text/html;
        charset=windows-1252">
      <meta name="Generator" content="Microsoft Exchange Server">
      <!-- converted from text -->
      <style><!-- .EmailQuote { margin-left: 1pt; padding-left: 4pt; border-left: #800000 2px solid; } --></style>
      <div>I know that before, it will issue warning only when debug
        option is enabled. Removing that is ok to me.<br>
        I only help Prike draft your idea, and Prike is trying this
        patch on his side. The latest feedback he gave me is first_bo is
        always null, code doesn't run into busy path, which is very
        confusing me, and he said  he is debugging  that.<br>
        <br>
        -David<br>
        <br>
        <br>
        -------- Original Message --------<br>
        Subject: Re: [PATCH 1/2] drm/ttm: fix busy memory to fail other
        user v7<br>
        From: "Koenig, Christian" <br>
        To: "Zhou, David(ChunMing)" ,"Liang, Prike"
        ,<a class="moz-txt-link-abbreviated" href="mailto:dri-devel@lists.freedesktop.org">dri-devel@lists.freedesktop.org</a><br>
        CC: <br>
        <br>
      </div>
      <font size="2"><span style="font-size:11pt;">
          <div class="PlainText">I've foudn one more problem with this.<br>
            <br>
            With lockdep enabled I get a warning because
            ttm_eu_reserve_buffers() <br>
            has called ww_acquire_done() on the ticket (which
            essentially means we <br>
            are done, no more locking with that ticket).<br>
            <br>
            The simplest solution is probably to just remove the call to
            <br>
            ww_acquire_done() from ttm_eu_reserve_buffers().<br>
            <br>
            Christian.<br>
            <br>
            Am 07.05.19 um 13:45 schrieb Chunming Zhou:<br>
            > heavy gpu job could occupy memory long time, which lead
            other user fail to get memory.<br>
            ><br>
            > basically pick up Christian idea:<br>
            ><br>
            > 1. Reserve the BO in DC using a ww_mutex ticket
            (trivial).<br>
            > 2. If we then run into this EBUSY condition in TTM
            check if the BO we need memory for (or rather the ww_mutex
            of its reservation object) has a ticket assigned.<br>
            > 3. If we have a ticket we grab a reference to the first
            BO on the LRU, drop the LRU lock and try to grab the
            reservation lock with the ticket.<br>
            > 4. If getting the reservation lock with the ticket
            succeeded we check if the BO is still the first one on the
            LRU in question (the BO could have moved).<br>
            > 5. If the BO is still the first one on the LRU in
            question we try to evict it as we would evict any other BO.<br>
            > 6. If any of the "If's" above fail we just back off and
            return -EBUSY.<br>
            ><br>
            > v2: fix some minor check<br>
            > v3: address Christian v2 comments.<br>
            > v4: fix some missing<br>
            > v5: handle first_bo unlock and bo_get/put<br>
            > v6: abstract unified iterate function, and handle all
            possible usecase not only pinned bo.<br>
            > v7: pass request bo->resv to ttm_bo_evict_first<br>
            ><br>
            > Change-Id: I21423fb922f885465f13833c41df1e134364a8e7<br>
            > Signed-off-by: Chunming Zhou
            <a class="moz-txt-link-rfc2396E" href="mailto:david1.zhou@amd.com"><david1.zhou@amd.com></a><br>
            > ---<br>
            >   drivers/gpu/drm/ttm/ttm_bo.c | 111
            +++++++++++++++++++++++++++++------<br>
            >   1 file changed, 94 insertions(+), 17 deletions(-)<br>
            ><br>
            > diff --git a/drivers/gpu/drm/ttm/ttm_bo.c
            b/drivers/gpu/drm/ttm/ttm_bo.c<br>
            > index 8502b3ed2d88..f5e6328e4a57 100644<br>
            > --- a/drivers/gpu/drm/ttm/ttm_bo.c<br>
            > +++ b/drivers/gpu/drm/ttm/ttm_bo.c<br>
            > @@ -766,11 +766,13 @@
            EXPORT_SYMBOL(ttm_bo_eviction_valuable);<br>
            >    * b. Otherwise, trylock it.<br>
            >    */<br>
            >   static bool ttm_bo_evict_swapout_allowable(struct
            ttm_buffer_object *bo,<br>
            > -                     struct ttm_operation_ctx *ctx,
            bool *locked)<br>
            > +                     struct ttm_operation_ctx *ctx,
            bool *locked, bool *busy)<br>
            >   {<br>
            >        bool ret = false;<br>
            >   <br>
            >        *locked = false;<br>
            > +     if (busy)<br>
            > +             *busy = false;<br>
            >        if (bo->resv == ctx->resv) {<br>
            >               
            reservation_object_assert_held(bo->resv);<br>
            >                if (ctx->flags &
            TTM_OPT_FLAG_ALLOW_RES_EVICT<br>
            > @@ -779,35 +781,46 @@ static bool
            ttm_bo_evict_swapout_allowable(struct ttm_buffer_object *bo,<br>
            >        } else {<br>
            >                *locked =
            reservation_object_trylock(bo->resv);<br>
            >                ret = *locked;<br>
            > +             if (!ret && busy)<br>
            > +                     *busy = true;<br>
            >        }<br>
            >   <br>
            >        return ret;<br>
            >   }<br>
            >   <br>
            > -static int ttm_mem_evict_first(struct ttm_bo_device
            *bdev,<br>
            > -                            uint32_t mem_type,<br>
            > -                            const struct ttm_place
            *place,<br>
            > -                            struct ttm_operation_ctx
            *ctx)<br>
            > +static struct ttm_buffer_object*<br>
            > +ttm_mem_find_evitable_bo(struct ttm_bo_device *bdev,<br>
            > +                      struct ttm_mem_type_manager
            *man,<br>
            > +                      const struct ttm_place *place,<br>
            > +                      struct ttm_operation_ctx *ctx,<br>
            > +                      struct ttm_buffer_object
            **first_bo,<br>
            > +                      bool *locked)<br>
            >   {<br>
            > -     struct ttm_bo_global *glob = bdev->glob;<br>
            > -     struct ttm_mem_type_manager *man =
            &bdev->man[mem_type];<br>
            >        struct ttm_buffer_object *bo = NULL;<br>
            > -     bool locked = false;<br>
            > -     unsigned i;<br>
            > -     int ret;<br>
            > +     int i;<br>
            >   <br>
            > -     spin_lock(&glob->lru_lock);<br>
            > +     if (first_bo)<br>
            > +             *first_bo = NULL;<br>
            >        for (i = 0; i < TTM_MAX_BO_PRIORITY; ++i) {<br>
            >                list_for_each_entry(bo,
            &man->lru[i], lru) {<br>
            > -                     if
            (!ttm_bo_evict_swapout_allowable(bo, ctx, &locked))<br>
            > +                     bool busy = false;<br>
            > +<br>
            > +                     if
            (!ttm_bo_evict_swapout_allowable(bo, ctx, locked,<br>
            >
            +                                                        
            &busy)) {<br>
            > +                             if (first_bo &&
            !(*first_bo) && busy) {<br>
            > +                                     ttm_bo_get(bo);<br>
            > +                                     *first_bo = bo;<br>
            > +                             }<br>
            >                                continue;<br>
            > +                     }<br>
            >   <br>
            >                        if (place &&
            !bdev->driver->eviction_valuable(bo,<br>
>                                                                     
            place)) {<br>
            > -                             if (locked)<br>
            > +                             if (*locked)<br>
            >                                       
            reservation_object_unlock(bo->resv);<br>
            >                                continue;<br>
            >                        }<br>
            > +<br>
            >                        break;<br>
            >                }<br>
            >   <br>
            > @@ -818,9 +831,67 @@ static int
            ttm_mem_evict_first(struct ttm_bo_device *bdev,<br>
            >                bo = NULL;<br>
            >        }<br>
            >   <br>
            > +     return bo;<br>
            > +}<br>
            > +<br>
            > +static int ttm_mem_evict_first(struct ttm_bo_device
            *bdev,<br>
            > +                            uint32_t mem_type,<br>
            > +                            const struct ttm_place
            *place,<br>
            > +                            struct ttm_operation_ctx
            *ctx,<br>
            > +                            struct reservation_object
            *request_resv)<br>
            > +{<br>
            > +     struct ttm_bo_global *glob = bdev->glob;<br>
            > +     struct ttm_mem_type_manager *man =
            &bdev->man[mem_type];<br>
            > +     struct ttm_buffer_object *bo = NULL, *first_bo =
            NULL;<br>
            > +     bool locked = false;<br>
            > +     int ret;<br>
            > +<br>
            > +     spin_lock(&glob->lru_lock);<br>
            > +     bo = ttm_mem_find_evitable_bo(bdev, man, place,
            ctx, &first_bo,<br>
            > +                                   &locked);<br>
            >        if (!bo) {<br>
            > +             struct ttm_operation_ctx busy_ctx;<br>
            > +<br>
            >                spin_unlock(&glob->lru_lock);<br>
            > -             return -EBUSY;<br>
            > +             /* check if other user occupy memory too
            long time */<br>
            > +             if (!first_bo || !request_resv ||
            !request_resv->lock.ctx) {<br>
            > +                     if (first_bo)<br>
            > +                             ttm_bo_put(first_bo);<br>
            > +                     return -EBUSY;<br>
            > +             }<br>
            > +             if (first_bo->resv == request_resv) {<br>
            > +                     ttm_bo_put(first_bo);<br>
            > +                     return -EBUSY;<br>
            > +             }<br>
            > +             if (ctx->interruptible)<br>
            > +                     ret =
            ww_mutex_lock_interruptible(&first_bo->resv->lock,<br>
            > +                                                      
            request_resv->lock.ctx);<br>
            > +             else<br>
            > +                     ret =
            ww_mutex_lock(&first_bo->resv->lock,
            request_resv->lock.ctx);<br>
            > +             if (ret) {<br>
            > +                     ttm_bo_put(first_bo);<br>
            > +                     return ret;<br>
            > +             }<br>
            > +             spin_lock(&glob->lru_lock);<br>
            > +             /* previous busy resv lock is held by
            above, idle now,<br>
            > +              * so let them evictable.<br>
            > +              */<br>
            > +             busy_ctx.interruptible =
            ctx->interruptible;<br>
            > +             busy_ctx.no_wait_gpu   =
            ctx->no_wait_gpu;<br>
            > +             busy_ctx.resv          =
            first_bo->resv;<br>
            > +             busy_ctx.flags         =
            TTM_OPT_FLAG_ALLOW_RES_EVICT;<br>
            > +<br>
            > +             bo = ttm_mem_find_evitable_bo(bdev, man,
            place, &busy_ctx, NULL,<br>
            > +                                          
            &locked);<br>
            > +             if (bo && (bo->resv ==
            first_bo->resv))<br>
            > +                     locked = true;<br>
            > +             else if (bo)<br>
            > +                    
            ww_mutex_unlock(&first_bo->resv->lock);<br>
            > +             if (!bo) {<br>
            > +                    
            spin_unlock(&glob->lru_lock);<br>
            > +                     ttm_bo_put(first_bo);<br>
            > +                     return -EBUSY;<br>
            > +             }<br>
            >        }<br>
            >   <br>
            >        kref_get(&bo->list_kref);<br>
            > @@ -829,11 +900,15 @@ static int
            ttm_mem_evict_first(struct ttm_bo_device *bdev,<br>
            >                ret = ttm_bo_cleanup_refs(bo,
            ctx->interruptible,<br>
            >                                         
            ctx->no_wait_gpu, locked);<br>
            >                kref_put(&bo->list_kref,
            ttm_bo_release_list);<br>
            > +             if (first_bo)<br>
            > +                     ttm_bo_put(first_bo);<br>
            >                return ret;<br>
            >        }<br>
            >   <br>
            >        ttm_bo_del_from_lru(bo);<br>
            >        spin_unlock(&glob->lru_lock);<br>
            > +     if (first_bo)<br>
            > +             ttm_bo_put(first_bo);<br>
            >   <br>
            >        ret = ttm_bo_evict(bo, ctx);<br>
            >        if (locked) {<br>
            > @@ -907,7 +982,7 @@ static int
            ttm_bo_mem_force_space(struct ttm_buffer_object *bo,<br>
            >                        return ret;<br>
            >                if (mem->mm_node)<br>
            >                        break;<br>
            > -             ret = ttm_mem_evict_first(bdev, mem_type,
            place, ctx);<br>
            > +             ret = ttm_mem_evict_first(bdev, mem_type,
            place, ctx, bo->resv);<br>
            >                if (unlikely(ret != 0))<br>
            >                        return ret;<br>
            >        } while (1);<br>
            > @@ -1413,7 +1488,8 @@ static int
            ttm_bo_force_list_clean(struct ttm_bo_device *bdev,<br>
            >        for (i = 0; i < TTM_MAX_BO_PRIORITY; ++i) {<br>
            >                while (!list_empty(&man->lru[i]))
            {<br>
            >                       
            spin_unlock(&glob->lru_lock);<br>
            > -                     ret = ttm_mem_evict_first(bdev,
            mem_type, NULL, &ctx);<br>
            > +                     ret = ttm_mem_evict_first(bdev,
            mem_type, NULL, &ctx,<br>
            > +                                               NULL);<br>
            >                        if (ret)<br>
            >                                return ret;<br>
            >                       
            spin_lock(&glob->lru_lock);<br>
            > @@ -1784,7 +1860,8 @@ int ttm_bo_swapout(struct
            ttm_bo_global *glob, struct ttm_operation_ctx *ctx)<br>
            >        spin_lock(&glob->lru_lock);<br>
            >        for (i = 0; i < TTM_MAX_BO_PRIORITY; ++i) {<br>
            >                list_for_each_entry(bo,
            &glob->swap_lru[i], swap) {<br>
            > -                     if
            (ttm_bo_evict_swapout_allowable(bo, ctx, &locked)) {<br>
            > +                     if
            (ttm_bo_evict_swapout_allowable(bo, ctx, &locked,<br>
            >
            +                                                       
            NULL)) {<br>
            >                                ret = 0;<br>
            >                                break;<br>
            >                        }<br>
            <br>
          </div>
        </span></font>
      <br>
      <fieldset class="mimeAttachmentHeader"></fieldset>
      <pre class="moz-quote-pre" wrap="">_______________________________________________
dri-devel mailing list
<a class="moz-txt-link-abbreviated" href="mailto:dri-devel@lists.freedesktop.org">dri-devel@lists.freedesktop.org</a>
<a class="moz-txt-link-freetext" href="https://lists.freedesktop.org/mailman/listinfo/dri-devel">https://lists.freedesktop.org/mailman/listinfo/dri-devel</a></pre>
    </blockquote>
    <br>
  </body>
</html>