[Intel-gfx] [PATCH] RFC drm/i915: Slaughter the thundering i915_wait_request herd

Chris Wilson chris at chris-wilson.co.uk
Mon Nov 2 03:41:34 PST 2015


On Mon, Nov 02, 2015 at 11:26:29AM +0000, Gong, Zhipeng wrote:
> > -----Original Message-----
> > From: Chris Wilson [mailto:chris at chris-wilson.co.uk]
> > Sent: Monday, November 02, 2015 5:59 PM
> > To: Gong, Zhipeng
> > Cc: intel-gfx at lists.freedesktop.org; Rogozhkin, Dmitry V
> > Subject: Re: [PATCH] RFC drm/i915: Slaughter the thundering
> > i915_wait_request herd
> > 
> > On Mon, Nov 02, 2015 at 05:39:54AM +0000, Gong, Zhipeng wrote:
> > > Chris-
> > >
> > > The patch cannot be applied on the latest drm-intel-nightly directly.
> > > I modified it a little bit to make it applied.
> > > The patch can help much in HSW, but a little bit in BDW.
> > > The test is to transcode 26 streams, which creates 244 threads.
> > >
> > > CPU util      |	w/o patch  |   w/ patch
> > > ----------------------------------------------------------
> > > HSW async 1   |   102%     |     61%
> > > HSW async 5   |   114%     |     46%
> > > BDW async 1   |   116%     |     116%
> > > BDW async 5   |   111%     |     107%
> > 
> > Could I get the perf report for the kernel time? One aspect that I find hard to
> > believe is that it is not the execbuf/mutex-contention that is the ratelimiting
> > step.
> 
> Sure, what command would you like to run with "perf"?

Each of them :) I want to be sure that I know what's going on with bdw
(to check if my semaphores guess is correct), and comparing 1-vs-5
should help understand the contention points better.
As for the actual command, something like
  perf report -G -d '[kernel.vmlinux]' | head -5000
should do, though you may have to modify the DSO list to match if you
don't use a i915.ko builtin.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre


More information about the Intel-gfx mailing list