[Nouveau] Kernel crash in 3.17-rc1 when loading nouveau on (non-POSTed) NV1A

Bruno Prémont bonbons at linux-vserver.org
Sun Aug 24 13:11:01 PDT 2014


On Sun, 24 August 2014 Bruno Prémont <bonbons at linux-vserver.org> wrote:
> System was booted with PCI graphics card first in VGA text mode.
> 
> Loading nouveau from there on causes the following BUG, after what
> kernel produces trace over trace until overflowing its stack.
> (trace captured via netconsole)
> 
> [  154.323717] wmi: Mapper loaded
> [  154.735793] nouveau 0000:02:00.0: enabling device (0004 -> 0006)
> [  154.743189] ACPI: PCI Interrupt Link [LNK5] enabled at IRQ 16
> [  154.754844] nouveau  [  DEVICE][0000:02:00.0] BOOT0  : 0x01a000b1
> [  154.761111] nouveau  [  DEVICE][0000:02:00.0] Chipset: nForce (NV1A)
> [  154.767534] nouveau  [  DEVICE][0000:02:00.0] Family : NV10
> [  154.773918] nouveau  [   VBIOS][0000:02:00.0] checking PRAMIN for image...
> [  154.832093] nouveau  [   VBIOS][0000:02:00.0] ... appears to be valid
> [  154.838624] nouveau  [   VBIOS][0000:02:00.0] using image from PRAMIN
> [  154.845146] nouveau  [   VBIOS][0000:02:00.0] BMP version 5.14
> [  154.851300] nouveau  [   VBIOS][0000:02:00.0] version 03.1a.01.03.00
> [  154.857785] nouveau W[   VBIOS][0000:02:00.0] DCB contains no useful data
> [  154.866214] nouveau W[   VBIOS][0000:02:00.0] DCB contains no useful data
> [  154.874456] nouveau W[   VBIOS][0000:02:00.0] DCB contains no useful data
> [  154.881386] nouveau W[   VBIOS][0000:02:00.0] DCB contains no useful data
> [  154.888274] nouveau  [ DEVINIT][0000:02:00.0] adaptor not initialised
> [  154.894789] nouveau  [   VBIOS][0000:02:00.0] running init tables
> [  155.060171] nouveau W[  PTIMER][0000:02:00.0] unknown input clock freq
> [  155.066831] nouveau  [     PFB][0000:02:00.0] RAM type: stolen system memory
> [  155.073960] nouveau  [     PFB][0000:02:00.0] RAM size: 32 MiB
> [  155.079857] nouveau  [     PFB][0000:02:00.0]    ZCOMP: 0 tags
> [  155.090902] nouveau  [     CLK][0000:02:00.0] --:   
> [  155.096002] ------------[ cut here ]------------
> [  155.100004] kernel BUG at /usr/src/linux-git/drivers/gpu/drm/nouveau/core/core/event.c:42!
> 
> This is a BUG_ON(!spin_is_locked(&event->refs_lock))
> 
> Is that a valid check for CONFIG_SMP=n?
> As far as I know spin_locks are NOOP on UP configs... and in the recent
> past that kind of test has been complained about on lkml.

It was maybe even on dri-devel I saw it:
  https://lkml.org/lkml/2014/8/11/4

> [  155.100004] invalid opcode: 0000 [#1] 
> [  155.100004] Modules linked in: nouveau(+) wmi ttm drm_kms_helper nfsv3 nfs_acl nfs lockd sunrpc
> [  155.100004] CPU: 0 PID: 15 Comm: kworker/0:1 Not tainted 3.17.0-rc1-jupiter-00002-gec30df4 #6
> [  155.100004] Hardware name: NVIDIA Corporation. nFORCE-MCP/MS-6373, BIOS 6.00 PG 04/12/2002
> [  155.100004] Workqueue: events nouveau_pstate_work [nouveau]
> [  155.100004] task: dd451c70 ti: dd5c8000 task.ti: dd5c8000
> [  155.100004] EIP: 0060:[<dea19e13>] EFLAGS: 00010046 CPU: 0
> [  155.100004] EIP is at nvkm_event_get+0x3/0x10 [nouveau]
> [  155.100004] EAX: dcd4c484 EBX: 00000286 ECX: 00000000 EDX: 00000001
> [  155.100004] ESI: 00000000 EDI: ffffffff EBP: dd5c9ea8 ESP: dd5c9ea8
> [  155.100004]  DS: 007b ES: 007b FS: 0000 GS: 00e0 SS: 0068
> [  155.100004] CR0: 8005003b CR2: b77d9000 CR3: 1ce83000 CR4: 000007d0
> [  155.100004] Stack:
> [  155.100004]  dd5c9eb4 dea1cc92 dd694d34 dd5c9f04 dea2e3b6 00000000 00000005 deae3e69
> [  155.100004]  ffffffff 00000001 ffffffff ffffffff ffffffff 00000000 00000000 dd451c70
> [  155.100004]  c05bc820 ffffffff 00000000 dd694c60 dd694d34 00000000 dd5b8070 dd5c9f44
> [  155.100004] Call Trace:
> [  155.100004]  [<dea1cc92>] nvkm_notify_get+0x32/0x40 [nouveau]
> [  155.100004]  [<dea2e3b6>] nouveau_pstate_work+0x396/0x3a0 [nouveau]
> [  155.100004]  [<c1047fa7>] process_one_work+0x1d7/0x340
> [  155.100004]  [<c10485bf>] worker_thread+0x2af/0x380
> [  155.100004]  [<c1048310>] ? rescuer_thread+0x1d0/0x1d0
> [  155.100004]  [<c1048310>] ? rescuer_thread+0x1d0/0x1d0
> [  155.100004]  [<c104bfa4>] kthread+0xa4/0xb0
> [  155.100004]  [<c14b6c00>] ret_from_kernel_thread+0x20/0x30
> [  155.100004]  [<c104bf00>] ? flush_kthread_worker+0x70/0x70
> [  155.100004] Code: 71 a9 e2 90 8d 74 26 00 83 c4 0c 5b 5e 5f 5d c3 66 90 66 90 66 90 66 90 55 89 e5 0f 0b 8d 74 26 00 8d bc 27 00 00 00 00 55 89 e5 <0f> 0b 8d 74 
> [  155.100004] EIP: [<dea19e13>] nvkm_event_get+0x3/0x10 [nouveau] SS:ESP 0068:dd5c9ea8
> [  155.100004] ---[ end trace 6142147b1d3fed4d ]---
> 
> things go downhill from here on.


More information about the Nouveau mailing list