[Intel-gfx] [PATCH] RFC drm/i915: Slaughter the thundering i915_wait_request herd
Gong, Zhipeng
zhipeng.gong at intel.com
Mon Nov 2 06:00:47 PST 2015
Attach the perf data for BDW async1 and async5 with or without patch.
> -----Original Message-----
> From: Chris Wilson [mailto:chris at chris-wilson.co.uk]
> Sent: Monday, November 02, 2015 7:42 PM
> To: Gong, Zhipeng
> Cc: intel-gfx at lists.freedesktop.org; Rogozhkin, Dmitry V
> Subject: Re: [PATCH] RFC drm/i915: Slaughter the thundering
> i915_wait_request herd
>
> On Mon, Nov 02, 2015 at 11:26:29AM +0000, Gong, Zhipeng wrote:
> > > -----Original Message-----
> > > From: Chris Wilson [mailto:chris at chris-wilson.co.uk]
> > > Sent: Monday, November 02, 2015 5:59 PM
> > > To: Gong, Zhipeng
> > > Cc: intel-gfx at lists.freedesktop.org; Rogozhkin, Dmitry V
> > > Subject: Re: [PATCH] RFC drm/i915: Slaughter the thundering
> > > i915_wait_request herd
> > >
> > > On Mon, Nov 02, 2015 at 05:39:54AM +0000, Gong, Zhipeng wrote:
> > > > Chris-
> > > >
> > > > The patch cannot be applied on the latest drm-intel-nightly directly.
> > > > I modified it a little bit to make it applied.
> > > > The patch can help much in HSW, but a little bit in BDW.
> > > > The test is to transcode 26 streams, which creates 244 threads.
> > > >
> > > > CPU util | w/o patch | w/ patch
> > > > ----------------------------------------------------------
> > > > HSW async 1 | 102% | 61%
> > > > HSW async 5 | 114% | 46%
> > > > BDW async 1 | 116% | 116%
> > > > BDW async 5 | 111% | 107%
> > >
> > > Could I get the perf report for the kernel time? One aspect that I
> > > find hard to believe is that it is not the execbuf/mutex-contention
> > > that is the ratelimiting step.
> >
> > Sure, what command would you like to run with "perf"?
>
> Each of them :) I want to be sure that I know what's going on with bdw (to
> check if my semaphores guess is correct), and comparing 1-vs-5 should help
> understand the contention points better.
> As for the actual command, something like
> perf report -G -d '[kernel.vmlinux]' | head -5000 should do, though you may
> have to modify the DSO list to match if you don't use a i915.ko builtin.
> -Chris
>
> --
> Chris Wilson, Intel Open Source Technology Centre
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: perf.async5.patch.txt
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20151102/2dfd9b41/attachment-0004.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: perf.async1.orig.txt
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20151102/2dfd9b41/attachment-0005.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: perf.async1.patch.txt
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20151102/2dfd9b41/attachment-0006.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: perf.async5.orig.txt
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20151102/2dfd9b41/attachment-0007.txt>
More information about the Intel-gfx
mailing list