[Bug 196117] amdgpu - RX 480 (polaris) - freeze during boot

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Mon Jun 19 08:17:38 UTC 2017


https://bugzilla.kernel.org/show_bug.cgi?id=196117

--- Comment #3 from Paul K. Gerke (paulkgerke at craftware.nl) ---
Ah, thank you! 

Really quick @Michel Dänzer:"Sounds like maybe some files under
/lib/firmware/amdgpu/ are missing in the initrd":

I thought the same, so I baked the important firmware blobs right into the
kernel, just in case there would be some issues. I will dig up the settings
that I used tonight when I have some more time.



Anyway, the other tip allowed me to extract the (Crash) logs using my serial
console. The machine freezes up just as usual after modprobing. I did the
following:

- Boot with: linux /vmlinuz... root=UUID=xxxx ro debug ignore_loglevel
modprobe.blacklist=amdgpu console=ttyUSB0,115200
- (I disabled all X-server functionality before so that it would not screw with
any graphics settings)
- I logged in and issued "sudo modprobe amdgpu", as suggested.
- I recorded the following logs on my second system.

[  279.264565] [drm] amdgpu kernel modesetting enabled.
[  279.271228] AMD IOMMUv2 driver by Joerg Roedel <jroedel at suse.de>
[  279.271248] AMD IOMMUv2 functionality not available on this system
[  279.279814] CRAT table not found
[  279.279837] Finished initializing topology ret=0
[  279.279877] kfd kfd: Initialized module
[  279.280153] checking generic (c0000000 760000) vs hw (c0000000 10000000)
[  279.280163] fb: switching to amdgpudrmfb from VESA VGA
[  279.280383] Console: switching to colour dummy device 80x25
[  279.280941] [drm] initializing kernel modesetting (POLARIS10 0x1002:0x67DF
0x1043:0x0505 0xC7).
[  279.280960] [drm] register mmio base: 0xDFFC0000
[  279.280962] [drm] register mmio size: 262144
[  279.280968] [drm] doorbell mmio base: 0xDEA00000
[  279.280971] [drm] doorbell mmio size: 2097152
[  279.280983] [drm] probing gen 2 caps for device 10de:778 = 313d02/0
[  279.280989] [drm] probing mlw for device 10de:778 = 313d02
[  279.280997] [drm] UVD is enabled in VM mode
[  279.280999] [drm] VCE enabled in VM mode
[  279.303168] [drm] BIOS signature incorrect 1 1
[  279.303174] amdgpu 0000:02:00.0: Invalid PCI ROM header signature: expecting
0xaa55, got 0x0000
[  279.303781] ATOM BIOS: 67DFHB.15.50.0.0.AS18
[  279.303792] [drm] GPU post is not needed
[  279.304306] amdgpu 0000:02:00.0: VRAM: 8192M 0x0000000000000000 -
0x00000001FFFFFFFF (8192M used)
[  279.304310] amdgpu 0000:02:00.0: GTT: 8192M 0x0000000200000000 -
0x00000003FFFFFFFF
[  279.304314] [drm] Detected VRAM RAM=8192M, BAR=256M
[  279.304316] [drm] RAM width 256bits GDDR5
[  279.304414] [TTM] Zone  kernel: Available graphics memory: 8214392 kiB
[  279.304416] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
[  279.304418] [TTM] Initializing pool allocator
[  279.304423] [TTM] Initializing DMA pool allocator
[  279.304457] [drm] amdgpu: 8192M of VRAM memory ready
[  279.304459] [drm] amdgpu: 8192M of GTT memory ready.
[  279.304468] [drm] GART: num cpu pages 2097152, num gpu pages 2097152
[  279.305642] [drm] PCIE GART of 8192M enabled (table at 0x0000000000040000).
[  279.305653] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[  279.305654] [drm] Driver supports precise vblank timestamp query.
[  279.305687] amdgpu 0000:02:00.0: amdgpu: using MSI.
[  279.305705] [drm] amdgpu: irq initialized.
[  279.305728] amdgpu: [powerplay] amdgpu: powerplay sw initialized
[  279.306759] [drm] AMDGPU Display Connectors
[  279.306762] [drm] Connector 0:
[  279.306765] [drm]   DP-1
[  279.306766] [drm]   HPD1
[  279.306770] [drm]   DDC: 0x486c 0x486c 0x486d 0x486d 0x486e 0x486e 0x486f
0x486f
[  279.306772] [drm]   Encoders:
[  279.306774] [drm]     DFP1: INTERNAL_UNIPHY1
[  279.306776] [drm] Connector 1:
[  279.306777] [drm]   DP-2
[  279.306780] [drm]   HPD5
[  279.306782] [drm]   DDC: 0x4874 0x4874 0x4875 0x4875 0x4876 0x4876 0x4877
0x4877
[  279.306783] [drm]   Encoders:
[  279.306785] [drm]     DFP2: INTERNAL_UNIPHY1
[  279.306786] [drm] Connector 2:
[  279.306788] [drm]   HDMI-A-1
[  279.306791] [drm]   HPD6
[  279.306793] [drm]   DDC: 0x4868 0x4868 0x4869 0x4869 0x486a 0x486a 0x486b
0x486b
[  279.306795] [drm]   Encoders:
[  279.306797] [drm]     DFP3: INTERNAL_UNIPHY2
[  279.306798] [drm] Connector 3:
[  279.306800] [drm]   HDMI-A-2
[  279.306802] [drm]   HPD4
[  279.306803] [drm]   DDC: 0x4870 0x4870 0x4871 0x4871 0x4872 0x4872 0x4873
0x4873
[  279.306804] [drm]   Encoders:
[  279.306806] [drm]     DFP4: INTERNAL_UNIPHY2
[  279.306809] [drm] Connector 4:
[  279.306812] [drm]   DVI-D-1
[  279.306813] [drm]   HPD3
[  279.306816] [drm]   DDC: 0x487c 0x487c 0x487d 0x487d 0x487e 0x487e 0x487f
0x487f
[  279.306818] [drm]   Encoders:
[  279.306820] [drm]     DFP5: INTERNAL_UNIPHY
[  279.306870] amdgpu 0000:02:00.0: fence driver on ring 0 use gpu addr
0x0000000200000008, cpu addr 0xffff8a3867e70008
[  279.307375] amdgpu 0000:02:00.0: fence driver on ring 1 use gpu addr
0x0000000200000018, cpu addr 0xffff8a3867e70018
[  279.307435] amdgpu 0000:02:00.0: fence driver on ring 2 use gpu addr
0x0000000200000028, cpu addr 0xffff8a3867e70028
[  279.307478] amdgpu 0000:02:00.0: fence driver on ring 3 use gpu addr
0x0000000200000038, cpu addr 0xffff8a3867e70038
[  279.307513] amdgpu 0000:02:00.0: fence driver on ring 4 use gpu addr
0x0000000200000048, cpu addr 0xffff8a3867e70048
[  279.307541] amdgpu 0000:02:00.0: fence driver on ring 5 use gpu addr
0x0000000200000058, cpu addr 0xffff8a3867e70058
[  279.307727] amdgpu 0000:02:00.0: fence driver on ring 6 use gpu addr
0x0000000200000068, cpu addr 0xffff8a3867e70068
[  279.308100] amdgpu 0000:02:00.0: fence driver on ring 7 use gpu addr
0x0000000200000078, cpu addr 0xffff8a3867e70078
[  279.308152] amdgpu 0000:02:00.0: fence driver on ring 8 use gpu addr
0x0000000200000088, cpu addr 0xffff8a3867e70088
[  279.308961] amdgpu 0000:02:00.0: fence driver on ring 9 use gpu addr
0x0000000200000098, cpu addr 0xffff8a3867e70098
[ [  279.348966] amdgpu: [powerplay] [AVFS] Something is broken. See log!
[  279.711657] amdgpu: [powerplay]
[  279.711657]  failed to send message 254 ret is 0
[  279.711681] amdgpu: [powerplay] Can't find requested voltage id in
vdd_dep_on_sclk table!
[  279.854919] amdgpu: [powerplay] DPM is already running
[  280.134978] clocksource: timekeeping watchdog on CPU2: Marking clocksource
'tsc' as unstable because the skew is too large:
[  280.135000] clocksource:                       'hpet' wd_now: a19f4644
wd_last: a0b4da18 mask: ffffffff
[  280.135005] clocksource:                       'tsc' cs_now: 1133a962472
cs_last: 112d67312e4 mask: ffffffffffffffff
[  280.135011] sched_clock: Marking unstable (279910536262,
224436834)<-(280245936976, -110963880)
[  280.135028] tsc: Marking TSC unstable due to clocksource watchdog
[  283.495299] clocksource: Switched to clocksource hpet
[  283.495410] amdgpu: [powerplay] SMC address must be 4 byte aligned.
[  283.495417] amdgpu: [powerplay] Failed to initialize Graphics Level!
[  283.495423] amdgpu: [powerplay] Failed to initialize SMC table!
[  283.635476] amdgpu: [powerplay] Failed to enable VR hot GPIO interrupt!
[  284.195615] amdgpu: [powerplay] Failed to enable ULV!
[  284.335700] amdgpu: [powerplay] Attempt to enable Master Deep Sleep switch
failed!
[  284.335719] amdgpu: [powerplay] Failed to enable deep sleep master switch!


- I spot a "Something is broken", so something seems fishy.

- Note: I do not have time right now to process the log since I have to go to
work.

-- 
You are receiving this mail because:
You are watching the assignee of the bug.


More information about the dri-devel mailing list