[Intel-gfx] [core-for-CI][PATCH v2 0/2] iommu: Remove iova cpu hotplugging flushing

Tvrtko Ursulin tvrtko.ursulin at linux.intel.com
Wed Sep 28 09:22:21 UTC 2022


On 28/09/2022 09:58, Janusz Krzysztofik wrote:
> Manual revert of commit f598a497bc7d ("iova: Add CPU hotplug handler to
> flush rcaches").  It is trying to instantiate a cpuhp notifier from inside
> a cpuhp callback.  That code replaced intel_iommu implementation of
> flushing per-IOVA domain CPU rcaches which used a single instance of cpuhp
> held for the module lifetime.
> 
> v2: Restore formerly reverted commit ac9a5d522bb8 "iommu/dma: Fix race
>      condition during iova_domain initialization" after we have this more
>      effective fix applied.  The former occurred not sufficient and is no
>      longer needed once we have the latter in place.
> 
> Note: I'm no longer CCing IOMMU nor mainstream, they have been already
>        warned.

Will that also fix the one below?

<4> [18.297089] ================================
<4> [18.297090] WARNING: inconsistent lock state
<4> [18.297092] 6.0.0-rc2-CI_DINF_262-g320305923c88+ #1 Not tainted
<4> [18.297095] --------------------------------
<4> [18.297096] inconsistent {SOFTIRQ-ON-W} -> {IN-SOFTIRQ-W} usage.
<4> [18.297098] swapper/0/0 [HC0[0]:SC1[1]:HE1:SE0] takes:
<4> [18.297101] ffff8881030ed678 (&domain->lock){+.?.}-{2:2}, at: iommu_flush_dev_iotlb.part.61+0x23/0x80
<4> [18.297108] {SOFTIRQ-ON-W} state was registered at:
<4> [18.297110]   lock_acquire+0xd3/0x310
<4> [18.297113]   _raw_spin_lock+0x2a/0x40
<4> [18.297116]   domain_update_iommu_cap+0x20b/0x2c0
<4> [18.297118]   intel_iommu_attach_device+0x5bd/0x860
<4> [18.297121]   __iommu_attach_device+0x18/0xe0
<4> [18.297124]   bus_iommu_probe+0x1f3/0x2d0
<4> [18.297126]   bus_set_iommu+0x82/0xd0
<4> [18.297129]   intel_iommu_init+0xe45/0x102a
<4> [18.297132]   pci_iommu_init+0x9/0x31
<4> [18.297135]   do_one_initcall+0x53/0x2f0
<4> [18.297138]   kernel_init_freeable+0x18f/0x1e1
<4> [18.297140]   kernel_init+0x11/0x120
<4> [18.297143]   ret_from_fork+0x1f/0x30
<4> [18.297145] irq event stamp: 512712
<4> [18.297147] hardirqs last  enabled at (512712): [<ffffffff81b59254>] _raw_spin_unlock_irqrestore+0x54/0x70
<4> [18.297151] hardirqs last disabled at (512711): [<ffffffff81b58ffb>] _raw_spin_lock_irqsave+0x4b/0x50
<4> [18.297155] softirqs last  enabled at (512700): [<ffffffff81e00323>] __do_softirq+0x323/0x48e
<4> [18.297158] softirqs last disabled at (512705): [<ffffffff810c16a8>] irq_exit_rcu+0xb8/0xe0
<4> [18.297162]
other info that might help us debug this:
<4> [18.297164]  Possible unsafe locking scenario:
<4> [18.297166]        CPU0
<4> [18.297168]        ----
<4> [18.297169]   lock(&domain->lock);
<4> [18.297171]   <Interrupt>
<4> [18.297172]     lock(&domain->lock);
<4> [18.297174]
  *** DEADLOCK ***
<4> [18.297176] 1 lock held by swapper/0/0:
<4> [18.297177]  #0: ffffc90000003ea8 ((&cookie->fq_timer)){+.-.}-{0:0}, at: call_timer_fn+0x0/0x2c0
<4> [18.297183]
stack backtrace:
<4> [18.297185] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 6.0.0-rc2-CI_DINF_262-g320305923c88+ #1
<4> [18.297189] Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 0415 09/29/2021
<4> [18.297192] Call Trace:
<4> [18.297193]  <IRQ>
<4> [18.297195]  dump_stack_lvl+0x56/0x7f
<4> [18.297198]  mark_lock.part.47+0x8d1/0xdb0
<4> [18.297200]  ? qi_submit_sync+0x3c8/0x650
<4> [18.297203]  __lock_acquire+0x861/0xb90
<4> [18.297205]  lock_acquire+0xd3/0x310
<4> [18.297207]  ? iommu_flush_dev_iotlb.part.61+0x23/0x80
<4> [18.297211]  ? iommu_dma_map_sg+0x400/0x400
<4> [18.297213]  _raw_spin_lock+0x2a/0x40
<4> [18.297215]  ? iommu_flush_dev_iotlb.part.61+0x23/0x80
<4> [18.297218]  iommu_flush_dev_iotlb.part.61+0x23/0x80
<4> [18.297220]  intel_flush_iotlb_all+0x95/0x110
<4> [18.297223]  fq_flush_iotlb+0x1d/0x30
<4> [18.297225]  fq_flush_timeout+0x28/0xc0
<4> [18.297227]  ? iommu_dma_map_sg+0x400/0x400
<4> [18.297229]  ? iommu_dma_map_sg+0x400/0x400
<4> [18.297231]  call_timer_fn+0x9c/0x2c0
<4> [18.297234]  run_timer_softirq+0x548/0x570
<4> [18.297237]  __do_softirq+0xda/0x48e
<4> [18.297240]  irq_exit_rcu+0xb8/0xe0
<4> [18.297242]  sysvec_apic_timer_interrupt+0x9e/0xc0
<4> [18.297245]  </IRQ>

Regards,

Tvrtko

> 
> 
> Chris Wilson (1):
>    iommu: Remove iova cpu hotplugging flushing
> 
> Janusz Krzysztofik (1):
>    Restore "iommu/dma: Fix race condition during iova_domain
>      initialization"
> 
>   drivers/iommu/dma-iommu.c  | 17 +++++++++++++----
>   drivers/iommu/iova.c       | 28 ----------------------------
>   include/linux/cpuhotplug.h |  1 -
>   include/linux/iova.h       |  1 -
>   4 files changed, 13 insertions(+), 34 deletions(-)
> 


More information about the Intel-gfx mailing list