[Intel-gfx] [PATCH] drm/i915: Hold a RPM reference during i915_driver_unload

Joonas Lahtinen joonas.lahtinen at linux.intel.com
Wed Dec 30 05:03:25 PST 2015


Hi,

On ti, 2015-12-29 at 12:55 +0200, Gabriel Feceoru wrote:
> This fixes an issue added with: "1f814da drm/i915: add support for
> checking
> if we hold an RPM reference", noticed while running
> drv_module_reload_basic.
> 
> WARNING: CPU: 1 PID: 2032 at drivers/gpu/drm/i915/intel_drv.h:1446
> gen6_read32+0x1ca/0x1e0 [i915]()
> [  138.682686] RPM wakelock ref not held during HW access
> [  138.682687] Modules linked in:
> [  138.682688]  i915(-) drm_kms_helper drm snd_hda_codec_hdmi
> snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec snd_hwdep
> x86_pkg_temp_thermal snd_hda_core i2c_algo_bit syscopyarea
> sysfillrect sysimgblt fb_sys_fops xhci_pci ehci_pci r8169 xhci_hcd
> mii ehci_hcd video [last unloaded: snd_hda_intel]
> [  138.682699] CPU: 1 PID: 2032 Comm: rmmod Tainted: G        W      
>  4.4.0-rc4+ #44
> [  138.682701] Hardware name: Dell Inc. Inspiron 3847/088DT1       ,
> BIOS A06 01/15/2015
> [  138.682702]  ffffffffc03b6358 ffff880210d8ba58 ffffffff813e0c0f
> ffff880210d8baa0
> [  138.682703]  ffff880210d8ba90 ffffffff8105f6a2 ffff8800daa40000
> 0000000000064400
> [  138.682705]  0000000000000004 ffff880210d8bb9c ffff8800daa40000
> ffff880210d8baf0
> [  138.682706] Call Trace:
> [  138.682710]  [<ffffffff813e0c0f>] dump_stack+0x44/0x55
> [  138.682713]  [<ffffffff8105f6a2>] warn_slowpath_common+0x82/0xc0
> [  138.682715]  [<ffffffff8105f72c>] warn_slowpath_fmt+0x4c/0x50
> [  138.682725]  [<ffffffffc031aefc>] ?
> i915_gem_object_unpin_from_display_plane+0x1c/0x50 [i915]
> [  138.682734]  [<ffffffffc0333b9a>] gen6_read32+0x1ca/0x1e0 [i915]
> [  138.682737]  [<ffffffff8172c562>] ? mutex_lock+0x12/0x30
> [  138.682747]  [<ffffffffc03715ca>]
> intel_ddi_get_hw_state+0x7a/0x180 [i915]
> [  138.682758]  [<ffffffffc0355c88>]
> intel_connector_get_hw_state+0x28/0x30 [i915]
> [  138.682767]  [<ffffffffc03543fc>] intel_atomic_commit+0xa9c/0x17e0
> [i915]
> [  138.682779]  [<ffffffffc00a7e8e>] ?
> drm_atomic_check_only+0x18e/0x590 [drm]
> [  138.682786]  [<ffffffffc00a78cc>] ?
> drm_atomic_add_affected_connectors+0x8c/0xf0 [drm]
> [  138.682792]  [<ffffffffc00a82c7>] drm_atomic_commit+0x37/0x60
> [drm]
> [  138.682797]  [<ffffffffc0163356>]
> drm_atomic_helper_set_config+0x76/0xb0 [drm_kms_helper]
> [  138.682804]  [<ffffffffc00a696a>] ?
> drm_modeset_lock_all_ctx+0x9a/0xb0 [drm]
> [  138.682809]  [<ffffffffc00979c2>]
> drm_mode_set_config_internal+0x62/0x100 [drm]
> [  138.682814]  [<ffffffffc0097b48>]
> drm_framebuffer_remove+0xe8/0x120 [drm]
> [  138.682826]  [<ffffffffc036bb4d>] intel_fbdev_fini+0x6d/0x90
> [i915]
> [  138.682838]  [<ffffffffc0396b9a>] i915_driver_unload+0x1a/0x290
> [i915]
> [  138.682844]  [<ffffffffc0090ff9>] drm_dev_unregister+0x29/0xb0
> [drm]
> [  138.682848]  [<ffffffffc0091673>] drm_put_dev+0x23/0x60 [drm]
> [  138.682854]  [<ffffffffc02dc315>] i915_pci_remove+0x15/0x20 [i915]
> [  138.682856]  [<ffffffff8141f409>] pci_device_remove+0x39/0xc0
> [  138.682859]  [<ffffffff814e3d61>]
> __device_release_driver+0xa1/0x150
> [  138.682860]  [<ffffffff814e4833>] driver_detach+0xa3/0xb0
> [  138.682862]  [<ffffffff814e3825>] bus_remove_driver+0x55/0xd0
> [  138.682864]  [<ffffffff814e4e2c>] driver_unregister+0x2c/0x50
> [  138.682866]  [<ffffffff8141db31>] pci_unregister_driver+0x21/0x90
> [  138.682871]  [<ffffffffc0092ec4>] drm_pci_exit+0x94/0xb0 [drm]
> [  138.682883]  [<ffffffffc0397404>] i915_exit+0x20/0xc1c [i915]
> 
> Reported-by: Marius Vlad <marius.c.vlad at intel.com>
> Signed-off-by: Gabriel Feceoru <gabriel.feceoru at intel.com>
> ---
>  drivers/gpu/drm/i915/i915_dma.c | 6 ++++++
>  1 file changed, 6 insertions(+)
> 
> diff --git a/drivers/gpu/drm/i915/i915_dma.c
> b/drivers/gpu/drm/i915/i915_dma.c
> index 988a380..08ad01f0 100644
> --- a/drivers/gpu/drm/i915/i915_dma.c
> +++ b/drivers/gpu/drm/i915/i915_dma.c
> @@ -1136,6 +1136,8 @@ int i915_driver_unload(struct drm_device *dev)
>  	struct drm_i915_private *dev_priv = dev->dev_private;
>  	int ret;
>  
> +	intel_runtime_pm_get(dev_priv);
> +
>  	intel_fbdev_fini(dev);
>  
>  	i915_audio_component_cleanup(dev_priv);
> @@ -1143,6 +1145,7 @@ int i915_driver_unload(struct drm_device *dev)
>  	ret = i915_gem_suspend(dev);
>  	if (ret) {
>  		DRM_ERROR("failed to idle hardware: %d\n", ret);
> +		intel_runtime_pm_put(dev_priv);

This should be made into goto construct.

>  		return ret;
>  	}
>  
> @@ -1221,6 +1224,9 @@ int i915_driver_unload(struct drm_device *dev)
>  	kmem_cache_destroy(dev_priv->vmas);
>  	kmem_cache_destroy(dev_priv->objects);
>  	pci_dev_put(dev_priv->bridge_dev);
> +
> +	intel_runtime_pm_put(dev_priv);
> +

Not sure if we should/can keep the runtime reference until this point.
At worst this could lead into the runtime_pm_put function poking at the
hardware registers after the pci_dev has been released.

Also if we change the hangcheck task to execute depending on the
runtime_pm count, this will surely cause trouble. Added Imre as CC to
comment on this.

>  	kfree(dev_priv);
> 
> 	return 0;

Insert goto label around here and make it "return ret;".

Regards, Joonas

> 
-- 
Joonas Lahtinen
Open Source Technology Center
Intel Corporation



More information about the Intel-gfx mailing list