[Intel-gfx] [PATCH rdma-next v5 0/4] Dynamicaly allocate SG table from the pages
Leon Romanovsky
leon at kernel.org
Sun Oct 4 15:43:36 UTC 2020
From: Leon Romanovsky <leonro at nvidia.com>
Changelog:
v5:
* Use sg_init_table to allocate table and avoid changes is __sg_alloc_table
* Fix offset issue
v4: https://lore.kernel.org/lkml/20200927064647.3106737-1-leon@kernel.org
* Fixed formatting in first patch.
* Added fix (clear tmp_netnts) in first patch to fix i915 failure.
* Added test patches
v3: https://lore.kernel.org/linux-rdma/20200922083958.2150803-1-leon@kernel.org/
* Squashed Christopher's suggestion to avoid introduced new API, but extend existing one.
v2: https://lore.kernel.org/linux-rdma/20200916140726.839377-1-leon@kernel.org
* Fixed indentations and comments
* Deleted sg_alloc_next()
* Squashed lib/scatterlist patches into one
v1: https://lore.kernel.org/lkml/20200910134259.1304543-1-leon@kernel.org
* Changed _sg_chain to be __sg_chain
* Added dependency on ARCH_NO_SG_CHAIN
* Removed struct sg_append
v0:
* https://lore.kernel.org/lkml/20200903121853.1145976-1-leon@kernel.org
--------------------------------------------------------------------------
>From Maor:
This series extends __sg_alloc_table_from_pages to allow chaining of
new pages to already initialized SG table.
This allows for the drivers to utilize the optimization of merging contiguous
pages without a need to pre allocate all the pages and hold them in
a very large temporary buffer prior to the call to SG table initialization.
The second patch changes the Infiniband driver to use the new API. It
removes duplicate functionality from the code and benefits the
optimization of allocating dynamic SG table from pages.
In huge pages system of 2MB page size, without this change, the SG table
would contain x512 SG entries.
E.g. for 100GB memory registration:
Number of entries Size
Before 26214400 600.0MB
After 51200 1.2MB
Thanks
Maor Gottlieb (2):
lib/scatterlist: Add support in dynamic allocation of SG table from
pages
RDMA/umem: Move to allocate SG table from pages
Tvrtko Ursulin (2):
tools/testing/scatterlist: Rejuvenate bit-rotten test
tools/testing/scatterlist: Show errors in human readable form
drivers/gpu/drm/i915/gem/i915_gem_userptr.c | 12 +-
drivers/gpu/drm/vmwgfx/vmwgfx_ttm_buffer.c | 15 ++-
drivers/infiniband/core/umem.c | 94 ++-------------
include/linux/scatterlist.h | 38 +++---
lib/scatterlist.c | 125 ++++++++++++++++----
tools/testing/scatterlist/Makefile | 3 +-
tools/testing/scatterlist/linux/mm.h | 35 ++++++
tools/testing/scatterlist/main.c | 53 ++++++---
8 files changed, 225 insertions(+), 150 deletions(-)
--
2.26.2
More information about the Intel-gfx
mailing list