[Intel-gfx] [PATCH] drm/i915: Don't check #active_requests from i915_gem_wait_for_idle()

Chris Wilson chris at chris-wilson.co.uk
Wed Dec 13 12:06:09 UTC 2017


Quoting Joonas Lahtinen (2017-12-13 11:07:01)
> On Tue, 2017-12-12 at 13:21 +0000, Chris Wilson wrote:
> > i915_gem_wait_for_idle() is called from inside the shrinker, to ensure
> > that we drain the last resources from the GPU in dire circumstances (OOM).
> > As we may allocate whilst building a request, it is then possible to hit
> > the shrinker with a request under construction, and so we must account
> > for the incomplete request whilst waiting. In particular, we
> > preincrement (in reserve_engine) the i915->gt.active_requests counter
> > and mark the GPU as busy, therefore we can use that counter for quickly
> > shortcircuiting the wait-for-idle.
> > 
> > [  950.859024] GEM_BUG_ON(i915->gt.active_requests)
> > [  950.859041] WARNING: CPU: 2 PID: 2178 at drivers/gpu/drm/i915/i915_gem.c:3615 i915_gem_wait_for_idle.part.56+0x166/0x4e0
> > [  950.859041] Modules linked in: ccm tun fuse nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_mangle iptable_security iptable_raw arc4 iwldvm mac80211 snd_hda_codec_hdmi snd_hda_codec_idt snd_hda_codec_generic snd_hda_intel snd_hda_codec btusb snd_hda_core btrtl btbcm iwlwifi snd_hwdep btintel bluetooth snd_seq snd_seq_device snd_pcm ecdh_generic x86_pkg_temp_thermal tpm_infineon coretemp tpm_tis crc32_pclmul wmi_bmof crc32c_intel iTCO_wdt hp_wmi snd_timer iTCO_vendor_support sparse_keymap tpm_tis_core mei_me cfg80211
> > [  950.859082]  snd joydev tpm mei rfkill pcspkr wmi soundcore lpc_ich hp_accel lis3lv02d input_polldev binfmt_misc e1000e ptp serio_raw pps_core
> > [  950.859094] CPU: 2 PID: 2178 Comm: gem_exec_nop Tainted: G     U           4.15.0-rc2+ #900
> > [  950.859102] Hardware name: Hewlett-Packard HP ProBook 6360b/1620, BIOS 68SCF Ver. B.42 12/29/2010
> > [  950.859107] task: c5119cb4 task.stack: f3ccb8d8
> > [  950.859112] EIP: i915_gem_wait_for_idle.part.56+0x166/0x4e0
> > [  950.859113] EFLAGS: 00010296 CPU: 2
> > [  950.859114] EAX: 00000024 EBX: f36c1888 ECX: f777a044 EDX: 00000007
> > [  950.859115] ESI: f36c1888 EDI: edd53958 EBP: edd53970 ESP: edd53938
> > [  950.859116]  DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
> > [  950.859117] CR0: 80050033 CR2: b7f39000 CR3: 2f2b3000 CR4: 000406d0
> > [  950.859118] Call Trace:
> > [  950.859125]  ? drm_printk+0x70/0x70
> > [  950.859129]  i915_gem_wait_for_idle+0x18/0x30
> > [  950.859133]  i915_gem_shrink+0x360/0x410
> > [  950.859138]  ? vmpressure+0xa8/0xf0
> > [  950.859142]  ? ktime_get+0x4a/0x100
> > [  950.859147]  i915_gem_shrink_all+0x21/0x40
> > [  950.859151]  i915_gem_shrinker_oom+0x23/0x130
> > [  950.859156]  notifier_call_chain+0x4e/0x70
> > [  950.859160]  __blocking_notifier_call_chain+0x2f/0x60
> > [  950.859164]  blocking_notifier_call_chain+0x11/0x20
> > [  950.859169]  out_of_memory+0x207/0x280
> > [  950.859174]  __alloc_pages_nodemask+0xd47/0xe60
> > [  950.859179]  new_slab+0x32d/0x450
> > [  950.859183]  ___slab_alloc.constprop.81+0x358/0x4e0
> > [  950.859189]  ? i915_sw_fence_await_dma_fence+0x53/0x160
> > [  950.859193]  ? __slab_free+0x1fe/0x310
> > [  950.859197]  ? native_sched_clock+0x1e/0xc0
> > [  950.859201]  ? i915_gem_request_alloc+0xcf/0x510
> > [  950.859205]  ? sched_clock+0x9/0x10
> > [  950.859209]  __slab_alloc.constprop.80+0x29/0x40
> > [  950.859212]  ? __slab_alloc.constprop.80+0x29/0x40
> > [  950.859216]  kmem_cache_alloc_trace+0x160/0x1a0
> > [  950.859220]  ? i915_sw_fence_await_dma_fence+0x53/0x160
> > [  950.859224]  i915_sw_fence_await_dma_fence+0x53/0x160
> > [  950.859229]  i915_gem_request_await_dma_fence+0x1eb/0x390
> > [  950.859233]  i915_gem_request_await_object+0xee/0x230
> > [  950.859239]  i915_gem_do_execbuffer+0xc16/0x1200
> > [  950.859246]  ? irqtime_account_irq+0x3e/0xc0
> > [  950.859251]  ? irq_exit+0x4f/0xb0
> > [  950.859257]  ? smp_apic_timer_interrupt+0x5f/0x110
> > [  950.859261]  ? apic_timer_interrupt+0x35/0x3c
> > [  950.859266]  i915_gem_execbuffer2_ioctl+0x212/0x440
> > [  950.859270]  ? apic_timer_interrupt+0x35/0x3c
> > [  950.859274]  ? i915_gem_do_execbuffer+0x1200/0x1200
> > [  950.859279]  ? insn_get_seg_base+0x1b/0x50
> > [  950.859283]  ? i915_gem_do_execbuffer+0x1200/0x1200
> > [  950.859287]  drm_ioctl_kernel+0x51/0xa0
> > [  950.859291]  drm_ioctl+0x2a3/0x350
> > [  950.859294]  ? i915_gem_do_execbuffer+0x1200/0x1200
> > [  950.859300]  ? sched_clock+0x9/0x10
> > [  950.859303]  ? drm_getunique+0x70/0x70
> > [  950.859308]  do_vfs_ioctl+0x7d/0x640
> > [  950.859311]  ? native_sched_clock+0x1e/0xc0
> > [  950.859315]  ? sched_clock+0x9/0x10
> > [  950.859319]  ? sched_clock_cpu+0x13/0x120
> > [  950.859323]  SyS_ioctl+0x4e/0x80
> > [  950.859326]  do_fast_syscall_32+0x75/0x250
> > [  950.859331]  ? irq_exit+0x4f/0xb0
> > [  950.859334]  entry_SYSENTER_32+0x47/0x71
> > [  950.859338] EIP: 0xb7f81d11
> > [  950.859339] EFLAGS: 00000296 CPU: 2
> > [  950.859340] EAX: ffffffda EBX: 00000003 ECX: 40406469 EDX: bfde4c20
> > [  950.859340] ESI: 00000003 EDI: 40406469 EBP: 00000003 ESP: bfde4b38
> > [  950.859341]  DS: 007b ES: 007b FS: 0000 GS: 0033 SS: 007b
> > [  950.859343] Code: e8 30 60 01 00 83 c4 10 83 c3 04 39 f3 75 e0 8b 45 d8 8b 80 14 37 00 00 85 c0 74 13 68 dd 33 e4 c0 68 49 6f e3 c0 e8 4a 55 be ff <0f> ff 5e 5f b8 fe ff ff 3f bb 0a 00 00 00 e8 b7 14 c4 ff 8b 15
> > 
> > Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
> > Cc: Joonas Lahtinen <joonas.lahtinen at linux.intel.com>
> 
> Reviewed-by: Joonas Lahtinen <joonas.lahtinen at linux.intel.com>

Applied, thanks for the review. Another hopeful explanation for a few
mystery incompletes, although we typically don't stress memory enough in
CI to worry..
-Chris


More information about the Intel-gfx mailing list