[Mesa-dev] [PATCH 1/2] i965: Remove hand-rolled memcpy implementation.

Kenneth Graunke kenneth at whitecape.org
Mon Feb 23 00:14:48 PST 2015


On Sunday, February 22, 2015 08:40:17 PM Matt Turner wrote:
> ---
> I can't believe this is beneficial, especially since it came from the
> i915tex import.
> 
>  src/mesa/drivers/dri/i965/brw_context.h       | 27 ---------------------------
>  src/mesa/drivers/dri/i965/intel_batchbuffer.c |  2 +-
>  2 files changed, 1 insertion(+), 28 deletions(-)
> 
> diff --git a/src/mesa/drivers/dri/i965/brw_context.h b/src/mesa/drivers/dri/i965/brw_context.h
> index 197ed3e..7e86c1b 100644
> --- a/src/mesa/drivers/dri/i965/brw_context.h
> +++ b/src/mesa/drivers/dri/i965/brw_context.h
> @@ -1920,33 +1920,6 @@ gen6_upload_push_constants(struct brw_context *brw,
>                             struct brw_stage_state *stage_state,
>                             enum aub_state_struct_type type);
>  
> -/* ================================================================
> - * From linux kernel i386 header files, copes with odd sizes better
> - * than COPY_DWORDS would:
> - * XXX Put this in src/mesa/main/imports.h ???
> - */
> -#if defined(i386) || defined(__i386__)
> -static inline void * __memcpy(void * to, const void * from, size_t n)
> -{
> -   int d0, d1, d2;
> -   __asm__ __volatile__(
> -      "rep ; movsl\n\t"
> -      "testb $2,%b4\n\t"
> -      "je 1f\n\t"
> -      "movsw\n"
> -      "1:\ttestb $1,%b4\n\t"
> -      "je 2f\n\t"
> -      "movsb\n"
> -      "2:"
> -      : "=&c" (d0), "=&D" (d1), "=&S" (d2)
> -      :"0" (n/4), "q" (n),"1" ((long) to),"2" ((long) from)
> -      : "memory");
> -   return (to);
> -}
> -#else
> -#define __memcpy(a,b,c) memcpy(a,b,c)
> -#endif
> -
>  #ifdef __cplusplus
>  }
>  #endif
> diff --git a/src/mesa/drivers/dri/i965/intel_batchbuffer.c b/src/mesa/drivers/dri/i965/intel_batchbuffer.c
> index 7b96005..45c7493 100644
> --- a/src/mesa/drivers/dri/i965/intel_batchbuffer.c
> +++ b/src/mesa/drivers/dri/i965/intel_batchbuffer.c
> @@ -390,7 +390,7 @@ intel_batchbuffer_data(struct brw_context *brw,
>  {
>     assert((bytes & 3) == 0);
>     intel_batchbuffer_require_space(brw, bytes, ring);
> -   __memcpy(brw->batch.map + brw->batch.used, data, bytes);
> +   memcpy(brw->batch.map + brw->batch.used, data, bytes);
>     brw->batch.used += bytes >> 2;
>  }
>  
> 

I submitted an identical patch in September 2013:
http://lists.freedesktop.org/archives/mesa-dev/2013-September/045360.html

Ian seemed to think it was for atom systems, and suggested it might be
useful to use on 64-bit atoms as well.  He also requested benchmarking
proving it wasn't useful rather than my handwaving:

http://lists.freedesktop.org/archives/mesa-dev/2013-September/045516.html

However, in the i965 driver, it appears to only be used for:
- glBitmap BLT commands
- Gen4 URB fences
So, unless I'm wrong, this is so irrelevant that I don't have any qualms
about nuking it.

In i915c, it's apparently used for a lot more.  I really can't get
motivated about benchmarking CPU overhead on 8xx/9xx, though...
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: This is a digitally signed message part.
URL: <http://lists.freedesktop.org/archives/mesa-dev/attachments/20150223/574eb325/attachment.sig>


More information about the mesa-dev mailing list