[Mesa-dev] [PATCH 2/4 v4] i965: Implement CopyTexSubImage2D via BLORP (and use it by default).
Paul Berry
stereotype441 at gmail.com
Wed Feb 6 05:42:02 PST 2013
On 5 February 2013 22:40, Kenneth Graunke <kenneth at whitecape.org> wrote:
> The BLT engine has many limitations. Currently, it can only blit
> X-tiled buffers (since we don't have a kernel API to whack the BLT
> tiling mode register), which means all depth/stencil operations get
> punted to meta code, which can be very CPU-intensive.
>
> Even if we used the BLT engine, it can't blit between buffers with
> different tiling modes, such as an X-tiled non-MSAA ARGB8888 texture
> and a Y-tiled CMS ARGB8888 renderbuffer. This is a fundamental
> limitation, and the only way around that is to use BLORP.
>
> Previously, BLORP only handled BlitFramebuffer. This patch adds an
> additional frontend for doing CopyTexSubImage. It also makes it the
> default. This is partly to increase testing and avoid hiding bugs,
> and partly because the BLORP path can already handle more cases. With
> trivial extensions, it should be able to handle everything the BLT can.
>
> This helps PlaneShift massively, which tries to CopyTexSubImage2D
> between depth buffers whenever a player casts a spell. Since these
> are Y-tiled, we hit meta and software ReadPixels paths, eating 99% CPU
> while delivering ~1 FPS. This is particularly bad in an MMO setting
> because people cast spells all the time.
>
> It also helps Xonotic in 4X MSAA mode. At default power management
> settings, I measured a 6.35138% +/- 0.672548% performance boost (n=5).
> (This data is from v1 of the patch.)
>
> No Piglit regressions on Ivybridge (v3) or Sandybridge (v2).
>
> v2: Create a fake intel_renderbuffer to wrap the destination texture
> image and then reuse do_blorp_blit rather than reimplementing most
> of it. Remove unnecessary clipping code and conditional rendering
> check.
>
> v3: Reuse formats_match() to centralize checks; delete temporary
> renderbuffers. Reorganize the code.
>
> v4: Actually copy stencil when dealing with separate stencil buffers but
> packed depth/stencil formats. Tested by a new Piglit test.
>
> NOTE: This is a candidate for the 9.1 branch.
>
> Signed-off-by: Kenneth Graunke <kenneth at whitecape.org>
> Cc: Paul Berry <stereotype441 at gmail.com>
> Reviewed-and-tested-by: Carl Worth <cworth at cworth.org> [v2]
> Reviewed-by: Ian Romanick <ian.d.romanick at intel.com> [v3]
> Tested-by: Martin Steigerwald <martin at lichtvoll.de> [v3]
>
Reviewed-by: Paul Berry <stereotype441 at gmail.com>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/mesa-dev/attachments/20130206/ab34219e/attachment.html>
More information about the mesa-dev
mailing list