[Intel-gfx] [PATCH i-g-t 7/9] tests/perf_pmu: Tests for i915 PMU API
Tvrtko Ursulin
tvrtko.ursulin at linux.intel.com
Tue Oct 10 13:38:10 UTC 2017
On 10/10/2017 13:37, Chris Wilson wrote:
> Quoting Tvrtko Ursulin (2017-10-10 10:30:06)
>> +static void
>> +event_wait(int gem_fd, const struct intel_execution_engine2 *e)
>> +{
>> + struct drm_i915_gem_exec_object2 obj = { };
>> + struct drm_i915_gem_execbuffer2 eb = { };
>> + data_t data;
>> + igt_display_t *display = &data.display;
>> + const uint32_t DERRMR = 0x44050;
>> + unsigned int valid_tests = 0;
>> + uint32_t batch[8], *b;
>> + igt_output_t *output;
>> + uint32_t bb_handle;
>> + uint32_t reg;
>> + enum pipe p;
>> + int fd;
>> +
>> + igt_require(intel_gen(intel_get_drm_devid(gem_fd)) >= 6);
>> + igt_require(intel_register_access_init(intel_get_pci_device(),
>> + false, gem_fd) == 0);
>> +
>> + /**
>> + * We will use the display to render event forwarind so need to
>> + * program the DERRMR register and restore it at exit.
>
> DERRMR is always masked until we need it. If you really wanted to
> preserve the old value, SRM, LRM around the test. Not that fussed, just
> a general dislike of direct poking of registers.
Thought I can get away with it since the handle is drm master and also
there doesn't seem to be a patch in i915 which touches this register.
>> + *
>> + * We will emit a MI_WAIT_FOR_EVENT listening for vblank events,
>> + * have a background helper to indirectly enable vblank irqs, and
>> + * listen to the recorded time spent in engine wait state as reported
>> + * by the PMU.
>> + */
>> + reg = intel_register_read(DERRMR);
>> +
>> + kmstest_set_vt_graphics_mode();
>> + igt_display_init(&data.display, gem_fd);
>> +
>> + bb_handle = gem_create(gem_fd, 4096);
>> +
>> + b = batch;
>> + *b++ = MI_LOAD_REGISTER_IMM;
>> + *b++ = DERRMR;
>> + *b++ = reg & ~((1 << 3) | (1 << 11) | (1 << 21));
>> + *b++ = MI_WAIT_FOR_EVENT | MI_WAIT_FOR_PIPE_A_VBLANK;
>> + *b++ = MI_LOAD_REGISTER_IMM;
>> + *b++ = DERRMR;
>> + *b++ = reg;
>> + *b++ = MI_BATCH_BUFFER_END;
>
>> +static int chain_nop(int gem_fd, unsigned long sz, int in_fence, bool sync)
>> +{
>> + struct drm_i915_gem_exec_object2 obj = {};
>> + struct drm_i915_gem_execbuffer2 eb =
>> + { .buffer_count = 1, .buffers_ptr = (uintptr_t)&obj};
>> + const uint32_t bbe = 0xa << 23;
>> +
>> + sz = ALIGN(sz, sizeof(uint32_t));
>> +
>> + obj.handle = gem_create(gem_fd, sz);
>> + gem_write(gem_fd, obj.handle, sz - sizeof(bbe), &bbe, sizeof(bbe));
>> +
>> + eb.flags = I915_EXEC_RENDER | I915_EXEC_FENCE_OUT;
>> +
>> + if (in_fence >= 0) {
>> + eb.flags |= I915_EXEC_FENCE_IN;
>> + eb.rsvd2 = in_fence;
>> + }
>> +
>> + gem_execbuf_wr(gem_fd, &eb);
>
> On the same ctx/engine, this shouldn't be generating interrupts between
> the batches. The fence should be resolved to an i915 request and then we
> see that the requests are naturally ordered so the fence is elided.
>> +
>> + if (sync)
>> + gem_sync(gem_fd, obj.handle);
>
> So it looks like this will remain the only interrupt generator.
>
> If we exported the out-fence and then polled that, that is currently
> hooked up to an interrupt only path.
So you think I'm counting ctx switches and other stuff? Maybe we should
have separate PMU counters for all the different interrupts. :)
What do you mean by exporting the out fence? Looping it through via
something external to hide the source? Not sure how to do it. Dup the
fd? Or create a swfence and merge the out fence to it? (Talking from
memory here.)
P.S.
I did miss to subtract the previous count before the assert.
Regards,
Tvrtko
>
>
> I don't have anything else to say! Afaict you have everything covered,
> so the only way to find what's not is by letting it go live!
>
> Give or take more tuning of the interrupt test,
> Reviewed-by: Chris Wilson <chris at chris-wilson.co.uk>
> -Chris
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>
More information about the Intel-gfx
mailing list