[Intel-gfx] [PATCH 1/2] drm/i915: Prefault the entire object on first page fault
Chris Wilson
chris at chris-wilson.co.uk
Thu Jun 12 09:21:25 CEST 2014
On Wed, Jun 11, 2014 at 01:41:45PM -0700, Volkin, Bradley D wrote:
> On Tue, Jun 10, 2014 at 04:14:40AM -0700, Chris Wilson wrote:
> > Inserting additional PTEs has no side-effect for us as the pfn are fixed
> > for the entire time the object is resident in the global GTT. The
> > downside is that we pay the entire cost of faulting the object upon the
> > first hit, for which we in return receive the benefit of removing the
> > per-page faulting overhead.
> >
> > On an Ivybridge i7-3720qm with 1600MHz DDR3, with 32 fences,
using i-g-t/gem_fence_upload
> > Upload rate for 2 linear surfaces: 8127MiB/s -> 8134MiB/s
> > Upload rate for 2 tiled surfaces: 8607MiB/s -> 8625MiB/s
> > Upload rate for 4 linear surfaces: 8127MiB/s -> 8127MiB/s
> > Upload rate for 4 tiled surfaces: 8611MiB/s -> 8602MiB/s
> > Upload rate for 8 linear surfaces: 8114MiB/s -> 8124MiB/s
> > Upload rate for 8 tiled surfaces: 8601MiB/s -> 8603MiB/s
> > Upload rate for 16 linear surfaces: 8110MiB/s -> 8123MiB/s
> > Upload rate for 16 tiled surfaces: 8595MiB/s -> 8606MiB/s
> > Upload rate for 32 linear surfaces: 8104MiB/s -> 8121MiB/s
> > Upload rate for 32 tiled surfaces: 8589MiB/s -> 8605MiB/s
> > Upload rate for 64 linear surfaces: 8107MiB/s -> 8121MiB/s
> > Upload rate for 64 tiled surfaces: 2013MiB/s -> 3017MiB/s
> >
Testcase: i-g-t/gem_fence_upload
> > Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
> > Cc: "Goel, Akash" <akash.goel at intel.com>
>
> For reproducibility it would be nice to have the testcase info, assuming
> it's something from i-g-t. Other than that, I think this change looks good.
It was a proposed test case along with the last set of patches. I should
have referenced it properly in the commit.
-Chris
--
Chris Wilson, Intel Open Source Technology Centre
More information about the Intel-gfx
mailing list