[Intel-gfx] [PATCH i-g-t] scripts/trace.pl: Optimize event parsing and processing
John Harrison
John.C.Harrison at Intel.com
Wed Dec 20 23:50:44 UTC 2017
On 12/20/2017 1:54 AM, Tvrtko Ursulin wrote:
> What was the effect of all this on your big traces? I am only testing
> with a smaller one which goes from ~3.3s to ~2.2s. On a larger trace
> it might be non-linear gains due to double sort avoidance, unless
> there will be some other effects to cancel that out.
>
So with a trace of a shortened gem_exec_nop/basic_sequential, the 'perf
script' output is 439MB and the original trace.pl before any of the
changes took ~180s. After the 'auto-detect field order' patch, it went
up to ~201s. With the optimisation patch it is down to ~129s.
However, I am also seeing differences in the HTML output since the
optimisation patch. The differences aren't massive, just slight
variations in the times. The structure is all the same, its just that
the accounting and/or time stamps are out.
For example:
{id: 1, content: 'Ring0<br><small><br>79.48% idle<br><br>34.32%
busy<br>584.97% runnable<br>2103.60% queued<br><br>16.18%
wait<br><br>200931 batches<br>331.28us avg batch<br>331.38us avg engine
batch<br></small>'},
vs
{id: 1, content: 'Ring0<br><small><br>79.48% idle<br><br>34.32%
busy<br>584.97% runnable<br>2103.60% queued<br><br>16.18%
wait<br><br>200931 batches<br>338.56us avg batch<br>338.56us avg engine
batch<br></small>'},
Or:
{id: 58, key: -210383407, type: 'range', group: 4, subgroup: 2,
subgroupOrder: 3, content: '428/3 <small>0</small>
<small><i>???</i></small> <small><i>++</i></small> <br>142us
<small>(0us)</small>', start: '2017-01-05 21:27:45.352968', end:
'2017-01-05 21:27:45.353110', style: 'color: white; background-color:
red;'},
vs
{id: 58, key: -210383407, type: 'range', group: 4, subgroup: 2,
subgroupOrder: 3, content: '428/3 <small>0</small>
<small><i>???</i></small> <small><i>++</i></small> <br>159us
<small>(0us)</small>', start: '2017-01-05 21:27:45.352968', end:
'2017-01-05 21:27:45.353127', style: 'color: white; background-color:
red;'},
I can send you the full output if it is useful and the source logs too.
The HTML output is about 840KB but as noted, the perf logs are hundreds
of MBs.
John.
More information about the Intel-gfx
mailing list