[Intel-gfx] [PATCH v4] drm/i915: avoid processing spurious/shared interrupts in low-power states

Chris Wilson chris at chris-wilson.co.uk
Tue Feb 24 01:29:15 PST 2015


On Tue, Feb 24, 2015 at 11:14:30AM +0200, Imre Deak wrote:
> Atm, it's possible that the interrupt handler is called when the device
> is in D3 or some other low-power state. It can be due to another device
> that is still in D0 state and shares the interrupt line with i915, or on
> some platforms there could be spurious interrupts even without sharing
> the interrupt line. The latter case was reported by Klaus Ethgen using a
> Lenovo x61p machine (gen 4). He noticed this issue via a system
> suspend/resume hang and bisected it to the following commit:
> 
> commit e11aa362308f5de467ce355a2a2471321b15a35c
> Author: Jesse Barnes <jbarnes at virtuousgeek.org>
> Date:   Wed Jun 18 09:52:55 2014 -0700
> 
>     drm/i915: use runtime irq suspend/resume in freeze/thaw
> 
> This is a problem, since in low-power states IIR will always read
> 0xffffffff resulting in an endless IRQ servicing loop.
> 
> Fix this by handling interrupts only when the driver explicitly enables
> them and so it's guaranteed that the interrupt registers return a valid
> value.
> 
> Note that this issue existed even before the above commit, since during
> runtime suspend/resume we never unregistered the handler.
> 
> v2:
> - clarify the purpose of smp_mb() vs. synchronize_irq() in the
>   code comment (Chris)
> 
> v3:
> - no need for an explicit smp_mb(), we can assume that synchronize_irq()
>   and the mmio read/writes in the install hooks provide for this (Daniel)
> - remove code comment as the remaining synchronize_irq() is self
>   explanatory (Daniel)

Why make the assumption though? I thought the comments documenting the
interaction between the irq enablements, the irq handler and shared
interrupts was beneficial. At the very least the assumption should be
made explicit through comments in the code - because I am not convinced
that a cached write will be flushed by an uncached write to another area
of memory. In particular, note that on the gen most troubled by rogue
irqs (gen4), we do not have any memory barriers in the mmio paths.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre


More information about the Intel-gfx mailing list