[Intel-gfx] [PATCH 3/3] drm/i915: Group the irq breadcrumb variables into the same cacheline
Chris Wilson
chris at chris-wilson.co.uk
Wed Jul 6 09:36:34 UTC 2016
On Wed, Jul 06, 2016 at 10:18:32AM +0100, Tvrtko Ursulin wrote:
>
> On 06/07/16 08:45, Chris Wilson wrote:
> >As we inspect both the tasklet (to check for an active bottom-half) and
> >set the irq-posted flag at the same time (both in the interrupt handler
> >and then in the bottom-halt), group those two together into the same
> >cacheline. (Not having total control over placement of the struct means
> >we can't guarantee the cacheline boundary, we need to align the kmalloc
> >and then each struct, but the grouping should help.)
>
> Any actual difference or just tidy?
Just motivated by tidying. I expect this to be in the noise (but since I
have the tools, I should check just in case).
> >@@ -167,16 +167,20 @@ struct intel_engine_cs {
> > * the overhead of waking that client is much preferred.
> > */
> > struct intel_breadcrumbs {
> >+ struct task_struct *irq_tasklet; /* bh for user interrupts */
>
> Tasklet was kind of passable, irq_tasklet is imho worse. I think
> anyone who see that name would thing this handles interrupts of some
> sort. :)
My thinking was to give a similar name to the three variables used in
the irq handler (and bottom-half) and move them aside from the spinlock.
> How about first_wait_task ?
>
> I know it may feel like pointless bike-shedding and maybe it is so I
> am leaving it with you.
Similarity argument still holds imo.
irq_seqno_bh ?
> >+ unsigned long irq_count;
> >+ bool irq_posted;
> >+
> > spinlock_t lock; /* protects the lists of requests */
> > struct rb_root waiters; /* sorted by retirement, priority */
> > struct rb_root signals; /* sorted by retirement */
> > struct intel_wait *first_wait; /* oldest waiter by retirement */
> >- struct task_struct *tasklet; /* bh for user interrupts */
> > struct task_struct *signaler; /* used for fence signalling */
> > struct drm_i915_gem_request *first_signal;
> > struct timer_list fake_irq; /* used after a missed interrupt */
> >- bool irq_enabled;
> >- bool rpm_wakelock;
> >+
> >+ bool irq_enabled : 1;
> >+ bool rpm_wakelock : 1;
>
> Is there much point in having bitfields? To me two plain bools would
> be just fine and smaller code.
In this case a fractionally smaller struct (-4 bytes)
The code size in this case is identical
text data bss dec hex
1067277 4565 416 1072258 105c82 as bool
1067277 4565 416 1072258 105c82 as bool : 1
since we only do very simple test and sets.
-Chris
--
Chris Wilson, Intel Open Source Technology Centre
More information about the Intel-gfx
mailing list