[Bug 65761] HD 7970M Hybrid - hangs and errors and rmmod causes crash

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Sat Nov 30 04:12:47 PST 2013


https://bugzilla.kernel.org/show_bug.cgi?id=65761

--- Comment #3 from Christoph Haag <haagch.christoph at googlemail.com> ---
Created attachment 116871
  --> https://bugzilla.kernel.org/attachment.cgi?id=116871&action=edit
dmesg from 3.13-rc2 with rmmod radeon in line 2095

Having enough time to test everything would be nice.

But in the meantime I have only tested 3.13-rc2. With defaults the issues are
still there (I have not yet checked whether mentioned commits made it into
rc2).

Now I have some more messsages in dmesg after rmmod'ing radeon. This time it
didn't hang after all that, but rebooting didn't work for some reason. Maybe
it's just me but I'd like to have proper unloading to have the option to switch
between radeon and fglrx without rebooting. :)

Anyway with runpm=0 I don't get any errors so dpm does really work fine. On an
unrelated note: WOW, has radeonsi improved in performance! Much better than
fglrx with "official" hybrid support in some applications at least.

Unfortunately using runpm=0 and using vgaswitcheroo manually is not a viable
workaround because of https://bugzilla.kernel.org/show_bug.cgi?id=51381

But with runpm=0 there still is a similar (?) problem when rmmod'ing radeon:

[ 1897.087151] [drm] radeon: finishing device.
[ 1897.518615] ------------[ cut here ]------------
[ 1897.518634] WARNING: CPU: 4 PID: 2539 at drivers/gpu/drm/drm_mm.c:578
drm_mm_takedown+0x2e/0x30 [drm]()
[ 1897.518635] Memory manager not clean during takedown.
[ 1897.518636] Modules linked in: bnep bluetooth iTCO_wdt iTCO_vendor_support
arc4 iwldvm snd_hda_codec_hdmi mac80211 snd_hda_codec_realtek joydev
x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul
crct10dif_common crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw
gf128mul glue_helper ablk_helper cryptd snd_hda_intel microcode snd_hda_codec
iwlwifi snd_hwdep psmouse cfg80211 snd_pcm serio_raw pcspkr snd_page_alloc
radeon(-) rtsx_pci_ms r8169 snd_timer memstick rfkill lpc_ich mii snd i2c_i801
soundcore ttm wmi thermal mei_me mei shpchp processor battery ac evdev nfs
lockd sunrpc fscache fuse ext4 crc16 mbcache jbd2 sr_mod cdrom sd_mod
hid_generic usbhid hid rtsx_pci_sdmmc mmc_core ahci i915 libahci crc32c_intel
i2c_algo_bit libata intel_agp firewire_ohci intel_gtt
[ 1897.518659]  firewire_core crc_itu_t drm_kms_helper ehci_pci ehci_hcd
xhci_hcd scsi_mod rtsx_pci drm usbcore i2c_core usb_common video button
[ 1897.518665] CPU: 4 PID: 2539 Comm: rmmod Not tainted 3.13.0-1-mainline #1
[ 1897.518666] Hardware name: CLEVO                             P170EM/P170EM,
BIOS 4.6.5 08/22/2012
[ 1897.518667]  0000000000000009 ffff8807d4b47c20 ffffffff814f5570
ffff8807d4b47c68
[ 1897.518669]  ffff8807d4b47c58 ffffffff81061bad ffff8807fef616c0
ffff8807fef61768
[ 1897.518671]  ffff8808029a47b0 0000000000000000 0000000001ce5090
ffff8807d4b47cb8
[ 1897.518673] Call Trace:
[ 1897.518677]  [<ffffffff814f5570>] dump_stack+0x4d/0x6f
[ 1897.518681]  [<ffffffff81061bad>] warn_slowpath_common+0x7d/0xa0
[ 1897.518683]  [<ffffffff81061c1c>] warn_slowpath_fmt+0x4c/0x50
[ 1897.518688]  [<ffffffffa05644b8>] ? ttm_bo_man_takedown+0x48/0x70 [ttm]
[ 1897.518693]  [<ffffffffa00c2f6e>] drm_mm_takedown+0x2e/0x30 [drm]
[ 1897.518696]  [<ffffffffa05644a8>] ttm_bo_man_takedown+0x38/0x70 [ttm]
[ 1897.518699]  [<ffffffffa055ebf9>] ttm_bo_clean_mm+0x49/0x80 [ttm]
[ 1897.518709]  [<ffffffffa05eebcd>] radeon_ttm_fini+0xbd/0x190 [radeon]
[ 1897.518716]  [<ffffffffa05ef662>] radeon_bo_fini+0x12/0x20 [radeon]
[ 1897.518727]  [<ffffffffa06570c1>] si_fini+0xc1/0x100 [radeon]
[ 1897.518733]  [<ffffffffa05d404e>] radeon_device_fini+0x3e/0x120 [radeon]
[ 1897.518739]  [<ffffffffa05d5f7e>] radeon_driver_unload_kms+0x4e/0x70
[radeon]
[ 1897.518744]  [<ffffffffa00bed4c>] drm_dev_unregister+0x2c/0xe0 [drm]
[ 1897.518749]  [<ffffffffa00bee8b>] drm_put_dev+0x3b/0x70 [drm]
[ 1897.518754]  [<ffffffffa05d229d>] radeon_pci_remove+0x1d/0x20 [radeon]
[ 1897.518756]  [<ffffffff812c66cb>] pci_device_remove+0x3b/0xb0
[ 1897.518759]  [<ffffffff8138caaf>] __device_release_driver+0x7f/0xf0
[ 1897.518761]  [<ffffffff8138d438>] driver_detach+0xb8/0xc0
[ 1897.518763]  [<ffffffff8138c6e5>] bus_remove_driver+0x55/0xd0
[ 1897.518765]  [<ffffffff8138daac>] driver_unregister+0x2c/0x50
[ 1897.518767]  [<ffffffff812c60a9>] pci_unregister_driver+0x29/0x90
[ 1897.518772]  [<ffffffffa00c10e8>] drm_pci_exit+0x98/0xa0 [drm]
[ 1897.518778]  [<ffffffffa06a35af>] radeon_exit+0x17/0x1e [radeon]
[ 1897.518780]  [<ffffffff810d6682>] SyS_delete_module+0x172/0x240
[ 1897.518783]  [<ffffffff81014e5c>] ? do_notify_resume+0x8c/0xa0
[ 1897.518785]  [<ffffffff815039ed>] system_call_fastpath+0x1a/0x1f
[ 1897.518786] ---[ end trace ba1fe37dd4719714 ]---
[ 1897.518790] [TTM] Finalizing pool allocator
[ 1897.518793] [TTM] Finalizing DMA pool allocator
[ 1897.518797] ------------[ cut here ]------------
[ 1897.518799] WARNING: CPU: 4 PID: 2539 at
drivers/gpu/drm/ttm/ttm_page_alloc_dma.c:534 ttm_dma_free_pool+0x12b/0x130
[ttm]()
[ 1897.518800] Modules linked in: bnep bluetooth iTCO_wdt iTCO_vendor_support
arc4 iwldvm snd_hda_codec_hdmi mac80211 snd_hda_codec_realtek joydev
x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul
crct10dif_common crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw
gf128mul glue_helper ablk_helper cryptd snd_hda_intel microcode snd_hda_codec
iwlwifi snd_hwdep psmouse cfg80211 snd_pcm serio_raw pcspkr snd_page_alloc
radeon(-) rtsx_pci_ms r8169 snd_timer memstick rfkill lpc_ich mii snd i2c_i801
soundcore ttm wmi thermal mei_me mei shpchp processor battery ac evdev nfs
lockd sunrpc fscache fuse ext4 crc16 mbcache jbd2 sr_mod cdrom sd_mod
hid_generic usbhid hid rtsx_pci_sdmmc mmc_core ahci i915 libahci crc32c_intel
i2c_algo_bit libata intel_agp firewire_ohci intel_gtt
[ 1897.518829]  firewire_core crc_itu_t drm_kms_helper ehci_pci ehci_hcd
xhci_hcd scsi_mod rtsx_pci drm usbcore i2c_core usb_common video button
[ 1897.518843] CPU: 4 PID: 2539 Comm: rmmod Tainted: G        W   
3.13.0-1-mainline #1
[ 1897.518844] Hardware name: CLEVO                             P170EM/P170EM,
BIOS 4.6.5 08/22/2012
[ 1897.518844]  0000000000000009 ffff8807d4b47c18 ffffffff814f5570
0000000000000000
[ 1897.518846]  ffff8807d4b47c50 ffffffff81061bad ffff880801a31e40
ffff8808032a2290
[ 1897.518848]  0000000000000008 0000000000000000 0000000001ce5090
ffff8807d4b47c60
[ 1897.518850] Call Trace:
[ 1897.518852]  [<ffffffff814f5570>] dump_stack+0x4d/0x6f
[ 1897.518854]  [<ffffffff81061bad>] warn_slowpath_common+0x7d/0xa0
[ 1897.518856]  [<ffffffff81061c8a>] warn_slowpath_null+0x1a/0x20
[ 1897.518861]  [<ffffffffa0564c9b>] ttm_dma_free_pool+0x12b/0x130 [ttm]
[ 1897.518863]  [<ffffffffa0565dee>] ttm_dma_page_alloc_fini+0x8e/0x104 [ttm]
[ 1897.518867]  [<ffffffffa055b549>] ttm_mem_global_release+0x19/0x90 [ttm]
[ 1897.518875]  [<ffffffffa05ed892>] radeon_ttm_mem_global_release+0x12/0x20
[radeon]
[ 1897.518879]  [<ffffffffa00d06c3>] drm_global_item_unref+0x53/0x80 [drm]
[ 1897.518887]  [<ffffffffa05eebfe>] radeon_ttm_fini+0xee/0x190 [radeon]
[ 1897.518894]  [<ffffffffa05ef662>] radeon_bo_fini+0x12/0x20 [radeon]
[ 1897.518904]  [<ffffffffa06570c1>] si_fini+0xc1/0x100 [radeon]
[ 1897.518910]  [<ffffffffa05d404e>] radeon_device_fini+0x3e/0x120 [radeon]
[ 1897.518917]  [<ffffffffa05d5f7e>] radeon_driver_unload_kms+0x4e/0x70
[radeon]
[ 1897.518922]  [<ffffffffa00bed4c>] drm_dev_unregister+0x2c/0xe0 [drm]
[ 1897.518926]  [<ffffffffa00bee8b>] drm_put_dev+0x3b/0x70 [drm]
[ 1897.518932]  [<ffffffffa05d229d>] radeon_pci_remove+0x1d/0x20 [radeon]
[ 1897.518934]  [<ffffffff812c66cb>] pci_device_remove+0x3b/0xb0
[ 1897.518936]  [<ffffffff8138caaf>] __device_release_driver+0x7f/0xf0
[ 1897.518938]  [<ffffffff8138d438>] driver_detach+0xb8/0xc0
[ 1897.518940]  [<ffffffff8138c6e5>] bus_remove_driver+0x55/0xd0
[ 1897.518942]  [<ffffffff8138daac>] driver_unregister+0x2c/0x50
[ 1897.518945]  [<ffffffff812c60a9>] pci_unregister_driver+0x29/0x90
[ 1897.518951]  [<ffffffffa00c10e8>] drm_pci_exit+0x98/0xa0 [drm]
[ 1897.518956]  [<ffffffffa06a35af>] radeon_exit+0x17/0x1e [radeon]
[ 1897.518958]  [<ffffffff810d6682>] SyS_delete_module+0x172/0x240
[ 1897.518960]  [<ffffffff81014e5c>] ? do_notify_resume+0x8c/0xa0
[ 1897.518962]  [<ffffffff815039ed>] system_call_fastpath+0x1a/0x1f
[ 1897.518963] ---[ end trace ba1fe37dd4719715 ]---
[ 1897.519005] [TTM] Zone  kernel: Used memory at exit: 13 kiB
[ 1897.519007] [TTM] Zone   dma32: Used memory at exit: 9 kiB
[ 1897.519009] [drm] radeon: ttm finalized
[ 1897.519012] vga_switcheroo: disabled
[ 1897.519212] [drm] Module unloaded

Fortunately all the other problems are not happening.



Another problem that also persists is starting X early (kdm with systemd). The
display just hangs with the _ cursor in the upper left, no error messages
anywhere to be found. Killing X leaves the display hanging there and only
starting another X (presumably reinitializing the display output) releases the
hanging and goes back to normal. This only happens with radeon enabled with and
without runpm but it might rather be a problem in X or maybe even the intel
driver. Anyway, not starting X directly at boot but waiting a few seconds
before starting it works.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list