✓ CI.checkpatch: success for series starting with [1/2] drm/xe: Skip CCS clear for WB type BOs

Tue Aug 27 16:28:18 UTC 2024

== Series Details ==

Series: series starting with [1/2] drm/xe: Skip CCS clear for WB type BOs
URL   : https://patchwork.freedesktop.org/series/137855/
State : success

== Summary ==

+ KERNEL=/kernel
+ git clone https://gitlab.freedesktop.org/drm/maintainer-tools mt
Cloning into 'mt'...
warning: redirecting to https://gitlab.freedesktop.org/drm/maintainer-tools.git/
+ git -C mt rev-list -n1 origin/master
9fe5037901cabbcdf27a6fe0dfb047ca1474d363
+ cd /kernel
+ git config --global --add safe.directory /kernel
+ git log -n1
commit 8048d986adda7fa6a82d5f4925ccf5fb465d4632
Author: Nirmoy Das <nirmoy.das at intel.com>
Date:   Tue Aug 27 17:49:10 2024 +0200

    Revert "drm/xe/lnl: Offload system clear page activity to GPU"
    
    This optimization relied on having to clear CCS on allocations.
    If there is no need to clear CCS on allocations then this would mostly
    help in reducing CPU utilization.
    
    Revert this patch because of:
    1 Currently Xe can't do clear on free and using a invalid ttm flag,
    TTM_TT_FLAG_CLEARED_ON_FREE which could poison global ttm pool on
    multi-device setup.
    
    2 BO with WB caching type can't have compression enabled so there
    is no need to do clear CCS for such allocation. Reducing CPU
    utilization is a plus but latency is more important.
    
    Without this patch:
    UsmMemoryAllocation(api=l0 type=Host size=4KB)  113.1 us
    UsmMemoryAllocation(api=l0 type=Host size=1GB)  91452.6 us
    13.93% api_overhead_be  [kernel.kallsyms] [k] pages_are_mergeable
    7.59%  api_overhead_be  [kernel.kallsyms] [k] __lock_acquire
    3.86%  api_overhead_be  [kernel.kallsyms] [k] lock_is_held_type
    3.39%  api_overhead_be  [kernel.kallsyms] [k] rcu_is_watching
    3.37%  api_overhead_be  [xe]              [k] emit_pte
    
    With this patch:
    UsmMemoryAllocation(api=l0 type=Host size=4KB) 91.7 us
    UsmMemoryAllocation(api=l0 type=Host size=1GB) 89486.5 us
    91.86%  api_overhead_be [kernel.kallsyms [k] clear_page_erms
    1.81%  api_overhead_be  [kernel.kallsyms [k] pages_are_mergeable
    0.99%  api_overhead_be  [kernel.kallsyms [k] get_page_from_freelist
    0.63%  api_overhead_be  [kernel.kallsyms [k] __free_pages_ok
    0.50%  api_overhead_be  [kernel.kallsyms [k] __lock_acquire
    
    For larger memory the delta is very low so it is likely worth
    doing gpu based system page clear that can be reconsidered in the
    future.
    
    This reverts commit 23683061805be368c8d1c7e7ff52abc470cac275 with
    minor conflict fixes.
    
    Cc: Matthew Auld <matthew.auld at intel.com>
    Cc: Matthew Brost <matthew.brost at intel.com>
    Cc: Thomas Hellström <thomas.hellstrom at linux.intel.com>
    Signed-off-by: Nirmoy Das <nirmoy.das at intel.com>
+ /mt/dim checkpatch 2c3fbd94c8b2eda2275211dfa8ead639a4e3e161 drm-intel
c76bf288ff03 drm/xe: Skip CCS clear for WB type BOs
8048d986adda Revert "drm/xe/lnl: Offload system clear page activity to GPU"