[igt-dev] [PATCH i-g-t 1/3] tests/gem_ctx_shared: Adopt to use allocator
Zbigniew Kempczyński
zbigniew.kempczynski at intel.com
Fri Aug 20 04:46:14 UTC 2021
On Thu, Aug 19, 2021 at 02:16:13PM -0700, Dixit, Ashutosh wrote:
> On Wed, 18 Aug 2021 21:49:39 -0700, Zbigniew Kempczyński wrote:
> >
> > @@ -518,12 +533,24 @@ static void store_dword(int i915, const intel_ctx_t *ctx, unsigned ring,
> >
> > memset(obj, 0, sizeof(obj));
> > obj[0].handle = cork;
> > - obj[0].offset = cork << 20;
> > obj[1].handle = target;
> > - obj[1].offset = target << 20;
> > obj[2].handle = gem_create(i915, 4096);
> > - obj[2].offset = 256 << 10;
> > - obj[2].offset += (random() % 128) << 12;
> > + if (ahnd) {
> > + obj[0].offset = cork_offset;
> > + obj[0].flags |= EXEC_OBJECT_PINNED;
> > + obj[1].offset = target_offset;
> > + obj[1].flags |= EXEC_OBJECT_PINNED;
> > + if (write_domain)
> > + obj[1].flags |= EXEC_OBJECT_WRITE;
> > + obj[2].offset = get_offset(ahnd, obj[2].handle, 4096, 0x0);
> > + obj[2].flags |= EXEC_OBJECT_PINNED;
> > + execbuf.flags |= I915_EXEC_NO_RELOC;
>
> I915_EXEC_NO_RELOC needed (since I think we weren't doing this earlier)?
That's minor optimization for softpinning. We got relocation_count == 0 for
each gem object, so eb->relocs is empty list. We also don't expect offset
change on execbuf return.
>
> > -static void unplug_show_queue(int i915, struct igt_cork *c,
> > +static void unplug_show_queue(int i915, struct igt_cork *c, uint64_t ahnd,
> > const intel_ctx_cfg_t *cfg, unsigned int engine)
> > {
> > igt_spin_t *spin[MAX_ELSP_QLEN];
> >
> > for (int n = 0; n < ARRAY_SIZE(spin); n++) {
> > const intel_ctx_t *ctx = create_highest_priority(i915, cfg);
> > - spin[n] = __igt_spin_new(i915, .ctx = ctx, .engine = engine);
> > +
> > + /*
> > + * When we're using same vm we should use allocator handle
> > + * passed by the caller. This is the case where cfg->vm
> > + * is not NULL.
> > + *
> > + * For cases where context use its own vm we need separate
> > + * ahnd for it.
> > + */
> > + if (!cfg->vm)
> > + ahnd = get_reloc_ahnd(i915, ctx->id);
>
> Ok, good, even though in this code we always used shared vm this is good in
> case something changes later.
>
> > @@ -599,8 +643,10 @@ static uint32_t store_timestamp(int i915,
> > uint32_t handle = gem_create(i915, 4096);
> > struct drm_i915_gem_exec_object2 obj = {
> > .handle = handle,
> > - .relocation_count = 1,
> > - .offset = (32 << 20) + (handle << 16),
> > + .relocation_count = !ahnd ? 1 : 0,
> > + .offset = !ahnd ? (32 << 20) + (handle << 16) :
> > + get_offset(ahnd, handle, 4096, 0),
> > + .flags = !ahnd ? 0 : EXEC_OBJECT_PINNED,
>
> | EXEC_OBJECT_WRITE ?
No, see there's no write_domain in reloc struct. We're waiting for objects
explicitely - see gem_set_domain() /* no write hazard lies */.
>
> > @@ -612,7 +658,7 @@ static uint32_t store_timestamp(int i915,
> > struct drm_i915_gem_execbuffer2 execbuf = {
> > .buffers_ptr = to_user_pointer(&obj),
> > .buffer_count = 1,
> > - .flags = ring | I915_EXEC_FENCE_IN,
> > + .flags = ring | I915_EXEC_FENCE_IN | (!!ahnd * I915_EXEC_NO_RELOC),
>
> I915_EXEC_NO_RELOC needed? Also !! has gone out of fashion :)
I wanted to put minimal code changes here. If you really don't like !!ahnd
I will remove it, putting dedicated if () to add this flag later.
>
> > @@ -831,6 +903,8 @@ static void smoketest(int i915, const intel_ctx_cfg_t *cfg,
> > unsigned engine;
> > uint32_t scratch;
> > uint32_t *ptr;
> > + uint64_t ahnd = get_reloc_ahnd(i915, 0); /* same vm */
> > + uint64_t scratch_offset;
> >
> > q_cfg = *cfg;
> > q_cfg.vm = gem_vm_create(i915);
> > @@ -848,9 +922,11 @@ static void smoketest(int i915, const intel_ctx_cfg_t *cfg,
> > igt_require(nengine);
> >
> > scratch = gem_create(i915, 4096);
> > + scratch_offset = get_offset(ahnd, scratch, 4096, 0);
> > igt_fork(child, ncpus) {
> > unsigned long count = 0;
> > const intel_ctx_t *ctx;
> > + ahnd = get_reloc_ahnd(i915, 0); /* ahnd to same vm */
>
> Can't we use the parent's ahnd here, since we are using
> intel_allocator_multiprocess_start/stop?
No, you cannot use ahnd created within parent process in the child.
IPC communication rely on child_tid value which is properly assigned on
intel_allocator_open(). Idea of this is to send requests to the allocator
by the children with their own child_tid to distinguish them. Please track
- intel_allocator.c: send_req_recv_resp()
- intel_allocator.c: send_req()
- intel_allocator_msgchannel.c: msgqueue_send_req()
calls. Using child_tid in this communication will become clear then
(especially it's initialization necessity on intel_allocator_open()).
That's why calling intel_allocator_open() in each child is necessary,
it just assigns that magic number for IPC which distinguishes requests.
--
Zbigniew
>
> Please take a look at the nits above and if anything needs to be
> done. Otherwise this is:
>
> Reviewed-by: Ashutosh Dixit <ashutosh.dixit at intel.com>
More information about the igt-dev
mailing list