[Intel-gfx] [PATCH 1/2] igt/gem_exec_nop: add burst submission to parallel execution test

Dave Gordon david.s.gordon at intel.com
Thu Aug 18 15:59:44 UTC 2016


On 18/08/16 16:27, Dave Gordon wrote:
> On 18/08/16 13:01, John Harrison wrote:

[snip]

>> Can you post the numbers that you get?
>>
>> I seem to get massive variability on my BDW. The render ring always
>> gives me around 2.9us/batch but the other rings sometimes give me region
>> of 1.2us and sometimes 7-8us.
>
> skylake# ./intel-gpu-tools/tests/gem_exec_nop --run-subtest basic
> IGT-Version: 1.15-gd09ad86 (x86_64) (Linux:
> 4.8.0-rc1-dsg-10839-g5e5a29c-z-tvrtko-fwname x86_64)
> Using GuC submission
> render: 594,944 cycles: 3.366us/batch
> bsd: 737,280 cycles: 2.715us/batch
> blt: 833,536 cycles: 2.400us/batch
> vebox: 710,656 cycles: 2.818us/batch
> Slowest engine was render, 3.366us/batch
> Total for all 4 engines is 11.300us per cycle, average 2.825us/batch
> All 4 engines (parallel/64): 5,324,800 cycles, average 1.878us/batch,
> overlap 90.1%
> Subtest basic: SUCCESS (18.013s)

That was GuC f/w 6.1, here's the results from 8.11:

skylake# sudo ./intel-gpu-tools/tests/gem_exec_nop --run-subtest basic
IGT-Version: 1.15-gd09ad86 (x86_64) (Linux: 
4.8.0-rc2-dsg-11313-g7430e5f-dsg-work-101 x86_64)
Using GuC submission
render: 585,728 cycles: 3.418us/batch
bsd: 930,816 cycles: 2.151us/batch
blt: 930,816 cycles: 2.150us/batch
vebox: 930,816 cycles: 2.150us/batch
Slowest engine was render, 3.418us/batch
Total for all 4 engines is 9.869us per cycle, average 2.467us/batch
All 4 engines (parallel/64): 5,668,864 cycles, average 1.765us/batch, 
overlap 89.9%
Subtest basic: SUCCESS (18.016s)

... showing minor improvements generally, especially the non-render engines.

.Dave.


More information about the Intel-gfx mailing list