[Bug 83201] New: CPU soft lockups in nouveau under load

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Mon Aug 25 09:44:12 PDT 2014


https://bugzilla.kernel.org/show_bug.cgi?id=83201

            Bug ID: 83201
           Summary: CPU soft lockups in nouveau under load
           Product: Drivers
           Version: 2.5
    Kernel Version: 3.17.0-rc1-00231-g7be141d
          Hardware: x86-64
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri at kernel-bugs.osdl.org
          Reporter: ted at midg3t.net
        Regression: No

Created attachment 148071
  --> https://bugzilla.kernel.org/attachment.cgi?id=148071&action=edit
Full boot/run log

I'm seeing a ton of CPU soft lockups in 3.17.0-rc1-00231-g7be141d when building
a kernel (make -j8). It seems to lock up hard enough that I have to hard power
off.

Sorry if this is the wrong component. I guessed that this is a bug in nouveau.

Full log attached.

Aug 25 10:30:27 slctperciva6520 kernel: [  367.753110] NMI watchdog: BUG: soft
lockup - CPU#2 stuck for 22s! [Xorg:4775]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753114] Modules linked in: bnep
rfcomm binfmt_misc uinput nfsd auth_rpcgss oid_registry nfs_acl nfs lockd
fscache sunrpc loop hid_generic usbhid hid x86_pkg_temp_thermal
snd_hda_codec_hdmi ecb coretemp kvm_intel kvm btusb bluetooth
ghash_clmulni_intel snd_hda_codec_idt snd_hda_codec_generic joydev aesni_intel
snd_hda_intel snd_hda_controller arc4 snd_hda_codec aes_x86_64 brcmsmac cordic
brcmutil b43 ablk_helper snd_hwdep cryptd snd_pcm lrw gf128mul snd_seq
snd_timer snd_seq_device snd soundcore ehci_pci glue_helper mac80211 cfg80211
nouveau ssb rng_core pcmcia pcmcia_core mxm_wmi ttm dell_wmi sparse_keymap
dell_laptop rfkill ehci_hcd bcma drm_kms_helper drm i2c_algo_bit wmi psmouse
usbcore iTCO_wdt iTCO_vendor_support tpm_tis i2c_i801 lpc_ich mfd_core i2ccore
evdev serio_raw usb_common dcdbas acpi_cpufreq tpm battery processor video ac
button ext4 crc16 jbd2 mbcache sg sd_mod sr_mod crc_t10dif cdrom
crct10dif_common crc32c_intel microcode ahci libahci libata scsi_mod firewir
Aug 25 10:30:27 slctperciva6520 kernel: e_ohci sdhci_pci sdhci mmc_core
firewire_core crc_itu_t thermal thermal_sys e1000e ptp pps_core
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753181] CPU: 2 PID: 4775 Comm:
Xorg Tainted: G        W    L 3.17.0-rc1-00231-g7be141d #4
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753183] Hardware name: Dell Inc.
Latitude E6520/0692FT, BIOS A13 05/17/2012
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753185] task: ffff8800ce2d7750
ti: ffff880222c28000 task.ti: ffff880222c28000
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753187] RIP:
0010:[<ffffffff8109e30a>]  [<ffffffff8109e30a>] csd_lock_wait.isra.1+0x7/0xa
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753193] RSP:
0018:ffff880222c2ba90  EFLAGS: 00000202
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753194] RAX: 0000000000000003
RBX: ffff88022dc54d48 RCX: 0000000000000002
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753196] RDX: ffff88022dc77b98
RSI: fffffffffffffffc RDI: ffff88022dc77bb0
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753197] RBP: 0000000000000003
R08: ffff88022dc54d48 R09: 0000000000000000
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753199] R10: 0000000000000008
R11: ffff8800ced85d80 R12: 0000000000000002
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753200] R13: 00000002000c0000
R14: ffff88022dc0de00 R15: 0000000000000296
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753202] FS: 
00007f8b97f35880(0000) GS:ffff88022dc40000(0000) knlGS:0000000000000000
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753204] CS:  0010 DS: 0000 ES:
0000 CR0: 0000000080050033
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753205] CR2: 00007f8b90560000
CR3: 0000000223ae7000 CR4: 00000000000407e0
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753206] Stack:
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753208]  ffffffff8109e8c2
ffff88022364c800 ffffffff00000007 0000000000000007
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753210]  ffffffff810452d7
ffff88022364ca08 ffffffff810452d7 0000000000000000
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753212]  0000000000000001
ffff880222c2bbe8 ffff880222c2bbf0 0000000000000000
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753215] Call Trace:
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753219]  [<ffffffff8109e8c2>] ?
smp_call_function_many+0x1e3/0x21a
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753223]  [<ffffffff810452d7>] ?
leave_mm+0x9a/0x9a
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753225]  [<ffffffff810452d7>] ?
leave_mm+0x9a/0x9a
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753228]  [<ffffffff8109e914>] ?
smp_call_function+0x1b/0x1f
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753231]  [<ffffffff8109e940>] ?
on_each_cpu+0x12/0x3a
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753233]  [<ffffffff810455df>] ?
flush_tlb_kernel_range+0x50/0x55
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753238]  [<ffffffff813d0f4d>] ?
_raw_spin_trylock+0x5/0x13
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753241]  [<ffffffff8110ea6f>] ?
__purge_vmap_area_lazy+0x2ea/0x351
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753244]  [<ffffffff811fc314>] ?
__bitmap_weight+0x27/0x58
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753248]  [<ffffffff8110ef34>] ?
free_vmap_area_noflush+0x4f/0x55
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753252]  [<ffffffff8110fc87>] ?
remove_vm_area+0x53/0x67
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753254]  [<ffffffff8110fdd2>] ?
__vunmap+0xb1/0xc4
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753260]  [<ffffffffa02e8c18>] ?
ttm_dma_tt_fini+0x30/0x4a [ttm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753281]  [<ffffffffa0387f6b>] ?
nouveau_sgdma_destroy+0xe/0x19 [nouveau]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753286]  [<ffffffffa02e9091>] ?
ttm_bo_cleanup_memtype_use+0x36/0x5a [ttm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753291]  [<ffffffffa02e9e6e>] ?
ttm_bo_release+0xe4/0x1c2 [ttm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753296]  [<ffffffffa02e9d8a>] ?
ttm_bo_delayed_workqueue+0x21/0x21 [ttm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753300]  [<ffffffffa02e9030>] ?
kref_sub+0x32/0x3c [ttm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753318]  [<ffffffffa038a8b2>] ?
nouveau_gem_object_del+0x50/0x56 [nouveau]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753324]  [<ffffffffa0261b79>] ?
drm_gem_object_unreference_unlocked+0x38/0x55 [drm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753331]  [<ffffffffa0261d26>] ?
drm_gem_handle_delete+0xa4/0xb3 [drm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753338]  [<ffffffffa02627cb>] ?
drm_ioctl+0x288/0x3e3 [drm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753342]  [<ffffffff81111e65>] ?
free_pages_and_swap_cache+0x45/0x5b
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753351]  [<ffffffffa02621c6>] ?
drm_gem_handle_create+0x37/0x37 [drm]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753356]  [<ffffffff811026b5>] ?
tlb_finish_mmu+0xb/0x2f
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753360]  [<ffffffff8111fb34>] ?
__cache_free.isra.45+0x1e8/0x1f7
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753364]  [<ffffffff813d1016>] ?
_raw_spin_unlock_irqrestore+0xc/0xd
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753382]  [<ffffffffa038505a>] ?
nouveau_drm_ioctl+0x74/0xa7 [nouveau]
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753385]  [<ffffffff811399d4>] ?
do_vfs_ioctl+0x3ed/0x436
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753389]  [<ffffffff8106cc36>] ?
vtime_account_user+0x35/0x40
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753392]  [<ffffffff810e23b0>] ?
context_tracking_user_exit+0x48/0xa3
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753395]  [<ffffffff81139a66>] ?
SyS_ioctl+0x49/0x77
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753397]  [<ffffffff813d16d8>] ?
tracesys+0x7e/0xe2
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753399]  [<ffffffff813d1737>] ?
tracesys+0xdd/0xe2
Aug 25 10:30:27 slctperciva6520 kernel: [  367.753400] Code: 90 66 66 90 c3 c3
f3 48 0f b8 c7 c3 ff c7 50 48 89 f0 48 63 d7 be 00 02 00 00 48 89 c7 e8 27 fb
15 00 5a c3 eb 02 f3 90 f6 07 01 <75> f9 c3 41 57 31 c0 49 89 d7 41 56 49 89 ce
b9 08 00 00 00 41

-- 
You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list