[Mesa-dev] [PATCH] intel: Fix glCopyTexSubImage on buffers whose width >= 32kbytes
Kenneth Graunke
kenneth at whitecape.org
Thu Jan 24 17:29:19 PST 2013
On 01/24/2013 04:54 PM, Paul Berry wrote:
> When possible, glCopyTexSubImage calls are performed using the
> hardware blitter. However, according to the Ivy Bridge PRM, Vol1
> Part4, section 1.2.1.2 (Graphics Data Size Limitations):
>
> The BLT engine is capable of transferring very large quantities of
> graphics data. Any graphics data read from and written to the
> destination is permitted to represent a number of pixels that
> occupies up to 65,536 scan lines and up to 32,768 bytes per scan
> line at the destination. The maximum number of pixels that may be
> represented per scan line’s worth of graphics data depends on the
> color depth.
>
> With an RGBA32F color buffer (which has 16 bytes per pixel) this
> imposes a maximum width of 2048 pixels. Other pixel formats have
> accordingly larger limits.
>
> To make matters worse, if the pitch of the buffer is 32k or greater,
> intel_copy_texsubimage's call to intelEmitCopyBlit will overflow
> intelEmitCopyBlit's src_pitch and dst_pitch parameters (which are
> 16-bit signed integers).
>
> We can conveniently avoid both problems by avoiding use of the blitter
> when the miptree's pitch is >= 32k.
>
> Fixes gles3conform "framebuffer_blit_functionality_magnifying_blit"
> tests when the buffer width is equal to 8192.
>
> Note: this is very similar to the recent patch "intel: Fix ReadPixels
> on buffers whose width >= 32kbytes" except that it applies to
> glCopyTexSubImage instead of glReadPixels. In a future patch it would
> be nice to refactor the code so that (a) overflow is avoided, and (b)
> intelEmitCopyBlit is responsible for checking whether the blitter can
> handle the width, so that all callers of intelEmitCopyBlit work
> properly, rather than just these two.
> ---
> src/mesa/drivers/dri/intel/intel_tex_copy.c | 21 +++++++++++++++++++++
> 1 file changed, 21 insertions(+)
>
> diff --git a/src/mesa/drivers/dri/intel/intel_tex_copy.c b/src/mesa/drivers/dri/intel/intel_tex_copy.c
> index 1af7b1c..c9cbcf4 100644
> --- a/src/mesa/drivers/dri/intel/intel_tex_copy.c
> +++ b/src/mesa/drivers/dri/intel/intel_tex_copy.c
> @@ -70,6 +70,27 @@ intel_copy_texsubimage(struct intel_context *intel,
> assert(region);
> }
>
> + /* According to the Ivy Bridge PRM, Vol1 Part4, section 1.2.1.2 (Graphics
> + * Data Size Limitations):
> + *
> + * The BLT engine is capable of transferring very large quantities of
> + * graphics data. Any graphics data read from and written to the
> + * destination is permitted to represent a number of pixels that
> + * occupies up to 65,536 scan lines and up to 32,768 bytes per scan line
> + * at the destination. The maximum number of pixels that may be
> + * represented per scan line’s worth of graphics data depends on the
> + * color depth.
> + *
> + * Furthermore, intelEmitCopyBlit (which is called below) uses a signed
> + * 16-bit integer to represent buffer pitch, so it can only handle buffer
> + * pitches < 32k.
> + *
> + * As a result of these two limitations, we can only use the blitter to do
> + * this copy when the region's pitch is less than 32k.
> + */
> + if (region->pitch >= 32768)
> + return false;
> +
> if (intelImage->base.Base.TexObject->Target == GL_TEXTURE_1D_ARRAY ||
> intelImage->base.Base.TexObject->Target == GL_TEXTURE_2D_ARRAY) {
> perf_debug("no support for array textures\n");
>
Reviewed-by: Kenneth Graunke <kenneth at whitecape.org>
More information about the mesa-dev
mailing list