[short question for propper bug report] RX580 Nitro+ / 4.11 boot message 'Invalid PCI ROM header signature'

Alex Deucher alexdeucher at gmail.com
Mon May 8 16:22:41 UTC 2017


On Mon, May 8, 2017 at 11:25 AM, Dieter Nützel <Dieter at nuetzel-hh.de> wrote:
> Hello Alex,
>
> got mine Sapphire Nitro+ RX 580, 8 GB (of course) up and _flying_. ;-)
> Sadly 'only' with PCIe 2.0 x8 (cutted server mobo slot).
> With kernel 4.11 I've the below in dmesg:
>
> [    3.714150] [drm] amdgpu kernel modesetting enabled.
> [    3.721233] fb: switching to amdgpudrmfb from VESA VGA
> [    3.835926] amdgpu 0000:01:00.0: Invalid PCI ROM header signature:
> expecting 0xaa55, got 0xffff

Harmless.

> [    3.836424] amdgpu 0000:01:00.0: VRAM: 8192M 0x0000000000000000 -
> 0x00000001FFFFFFFF (8192M used)
> [    3.836426] amdgpu 0000:01:00.0: GTT: 8192M 0x0000000200000000 -
> 0x00000003FFFFFFFF
> [    3.836545] [drm] amdgpu: 8192M of VRAM memory ready
> [    3.836546] [drm] amdgpu: 8192M of GTT memory ready.
> [    3.837817] amdgpu 0000:01:00.0: amdgpu: using MSI.
> [    3.837835] [drm] amdgpu: irq initialized.
> [    3.837860] amdgpu: [powerplay] amdgpu: powerplay sw initialized
> [    3.838567] amdgpu 0000:01:00.0: fence driver on ring 0 use gpu addr
> 0x0000000200000008, cpu addr 0xffff8e4c22198008
> [    3.838752] amdgpu 0000:01:00.0: fence driver on ring 1 use gpu addr
> 0x0000000200000018, cpu addr 0xffff8e4c22198018
> [    3.838805] amdgpu 0000:01:00.0: fence driver on ring 2 use gpu addr
> 0x0000000200000028, cpu addr 0xffff8e4c22198028
> [    3.839778] amdgpu 0000:01:00.0: fence driver on ring 3 use gpu addr
> 0x0000000200000038, cpu addr 0xffff8e4c22198038
> [    3.839834] amdgpu 0000:01:00.0: fence driver on ring 4 use gpu addr
> 0x0000000200000048, cpu addr 0xffff8e4c22198048
> [    3.839881] amdgpu 0000:01:00.0: fence driver on ring 5 use gpu addr
> 0x0000000200000058, cpu addr 0xffff8e4c22198058
> [    3.839928] amdgpu 0000:01:00.0: fence driver on ring 6 use gpu addr
> 0x0000000200000068, cpu addr 0xffff8e4c22198068
> [    3.839972] amdgpu 0000:01:00.0: fence driver on ring 7 use gpu addr
> 0x0000000200000078, cpu addr 0xffff8e4c22198078
> [    3.840024] amdgpu 0000:01:00.0: fence driver on ring 8 use gpu addr
> 0x0000000200000088, cpu addr 0xffff8e4c22198088
> [    3.840069] amdgpu 0000:01:00.0: fence driver on ring 9 use gpu addr
> 0x0000000200000098, cpu addr 0xffff8e4c22198098
> [    3.840188] amdgpu 0000:01:00.0: fence driver on ring 10 use gpu addr
> 0x00000002000000a8, cpu addr 0xffff8e4c221980a8
> [    3.840233] amdgpu 0000:01:00.0: fence driver on ring 11 use gpu addr
> 0x00000002000000b8, cpu addr 0xffff8e4c221980b8
> [    3.841051] amdgpu 0000:01:00.0: fence driver on ring 12 use gpu addr
> 0x0000000001165420, cpu addr 0xffffa244c545a420
> [    3.841276] amdgpu 0000:01:00.0: fence driver on ring 13 use gpu addr
> 0x00000002000000d8, cpu addr 0xffff8e4c221980d8
> [    3.841316] amdgpu 0000:01:00.0: fence driver on ring 14 use gpu addr
> 0x00000002000000e8, cpu addr 0xffff8e4c221980e8
> [    3.883443] amdgpu: [powerplay] [AVFS] Something is broken. See log!
> [    3.885296] amdgpu: [powerplay] Can't find requested voltage id in
> vdd_dep_on_sclk table!

Also harmless.

> [    4.168877] fbcon: amdgpudrmfb (fb0) is primary device
> [    4.622128] amdgpu 0000:01:00.0: fb0: amdgpudrmfb frame buffer device
> [    4.638924] [drm] Initialized amdgpu 3.10.0 20150101 for 0000:01:00.0 on
> minor 0
>
>
> NO HDMI sound as expected, need your wip branch (which version?).

https://cgit.freedesktop.org/~agd5f/linux/log/?h=amd-staging-4.9
or if you want something based on newer upstream code:
https://cgit.freedesktop.org/~hwentland/linux/log/?h=amd-staging-dc-drm-next

> The (VM) speed regression (need your wip branch, too or 4.12-rcx).

what is the VM speed regression?

> NO 'Zero core' but GREAT temperatur.
>
> IDLE:
> amdgpu-pci-0100
> Adapter: PCI adapter
> fan1:         853 RPM
> temp1:        +27.0°C  (crit =  +0.0°C, hyst =  +0.0°C)
>
> Where should we put the RX5xx strings?

Unfortunately, there's no way to put them in the pci ids database,
because that database only tracks the pci device ids.  It does not
track the pci revision ids.  The marketing names are based on a
combination of device id and revision id.  E.g., an RX480 and RX580
have the same device ids, but different revision ids.

Alex


>
> GREAT stuff!
>
> Dieter
>
> 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI]
> Ellesmere [Radeon RX 470/480] (rev e7) (prog-if 00 [VGA controller])
>         Subsystem: Device 1da2:e366
>         Physical Slot: 6
>         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
> Stepping- SERR+ FastB2B- DisINTx+
>         Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
> <TAbort- <MAbort+ >SERR- <PERR- INTx-
>         Latency: 0, Cache Line Size: 32 bytes
>         Interrupt: pin A routed to IRQ 30
>         Region 0: Memory at c0000000 (64-bit, prefetchable) [size=256M]
>         Region 2: Memory at b0400000 (64-bit, prefetchable) [size=2M]
>         Region 4: I/O ports at 2000 [size=256]
>         Region 5: Memory at b0140000 (32-bit, non-prefetchable) [size=256K]
>         [virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>         Capabilities: [50] Power Management version 3
>                 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA
> PME(D0-,D1+,D2+,D3hot+,D3cold+)
>                 Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
>         Capabilities: [58] Express (v2) Legacy Endpoint, MSI 00
>                 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <4us,
> L1 unlimited
>                         ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
>                 DevCtl: Report errors: Correctable- Non-Fatal+ Fatal+
> Unsupported-
>                         RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
>                         MaxPayload 128 bytes, MaxReadReq 512 bytes
>                 DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr-
> TransPend-
>                 LnkCap: Port #0, Speed 8GT/s, Width x16, ASPM L1, Exit
> Latency L0s <64ns, L1 <1us
>                         ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp+
>                 LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
>                         ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>                 LnkSta: Speed 2.5GT/s, Width x8, TrErr- Train- SlotClk+
> DLActive- BWMgmt- ABWMgmt-
>                 DevCap2: Completion Timeout: Not Supported, TimeoutDis-,
> LTR+, OBFF Not Supported
>                 AtomicOpsCap: 32bit+ 64bit+ 128bitCAS-
>                 DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-,
> LTR-, OBFF Disabled
>                 AtomicOpsCtl: ReqEn-
>                 LnkCtl2: Target Link Speed: 8GT/s, EnterCompliance-
> SpeedDis-
>                          Transmit Margin: Normal Operating Range,
> EnterModifiedCompliance- ComplianceSOS-
>                          Compliance De-emphasis: -6dB
>                 LnkSta2: Current De-emphasis Level: -3.5dB,
> EqualizationComplete-, EqualizationPhase1-
>                          EqualizationPhase2-, EqualizationPhase3-,
> LinkEqualizationRequest-
>         Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
>                 Address: 00000000fee1000c  Data: 4152
>         Capabilities: [100 v1] Vendor Specific Information: ID=0001 Rev=1
> Len=010 <?>
>         Capabilities: [150 v2] Advanced Error Reporting
>                 UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
> RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
>                 UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
> RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
>                 UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt-
> RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
>                 CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr-
>                 CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr+
>                 AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+
> ChkEn-
>         Capabilities: [200 v1] #15
>         Capabilities: [270 v1] #19
>         Capabilities: [2b0 v1] Address Translation Service (ATS)
>                 ATSCap: Invalidate Queue Depth: 00
>                 ATSCtl: Enable-, Smallest Translation Unit: 00
>         Capabilities: [2c0 v1] Page Request Interface (PRI)
>                 PRICtl: Enable- Reset-
>                 PRISta: RF- UPRGI- Stopped+
>                 Page Request Capacity: 00000020, Page Request Allocation:
> 00000000
>         Capabilities: [2d0 v1] Process Address Space ID (PASID)
>                 PASIDCap: Exec+ Priv+, Max PASID Width: 10
>                 PASIDCtl: Enable- Exec- Priv-
>         Capabilities: [320 v1] Latency Tolerance Reporting
>                 Max snoop latency: 0ns
>                 Max no snoop latency: 0ns
>         Capabilities: [328 v1] Alternative Routing-ID Interpretation (ARI)
>                 ARICap: MFVC- ACS-, Next Function: 1
>                 ARICtl: MFVC- ACS-, Function Group: 0
>         Capabilities: [370 v1] L1 PM Substates
>                 L1SubCap: PCI-PM_L1.2+ PCI-PM_L1.1+ ASPM_L1.2+ ASPM_L1.1+
> L1_PM_Substates+
>                           PortCommonModeRestoreTime=0us
> PortTPowerOnTime=170us
>                 L1SubCtl1: PCI-PM_L1.2- PCI-PM_L1.1- ASPM_L1.2- ASPM_L1.1-
>                            T_CommonMode=0us LTR1.2_Threshold=0ns
>                 L1SubCtl2: T_PwrOn=10us
>         Kernel driver in use: amdgpu
>         Kernel modules: amdgpu
>
> 01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device aaf0
>         Subsystem: Device 1da2:aaf0
>         Physical Slot: 6
>         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
> Stepping- SERR+ FastB2B- DisINTx+
>         Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
> <TAbort- <MAbort- >SERR- <PERR- INTx-
>         Latency: 0, Cache Line Size: 32 bytes
>         Interrupt: pin B routed to IRQ 34
>         Region 0: Memory at b0100000 (64-bit, non-prefetchable) [size=16K]
>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>         Capabilities: [50] Power Management version 3
>                 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA
> PME(D0-,D1-,D2-,D3hot-,D3cold-)
>                 Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
>         Capabilities: [58] Express (v2) Legacy Endpoint, MSI 00
>                 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <4us,
> L1 unlimited
>                         ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
>                 DevCtl: Report errors: Correctable- Non-Fatal+ Fatal+
> Unsupported-
>                         RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
>                         MaxPayload 128 bytes, MaxReadReq 512 bytes
>                 DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr-
> TransPend-
>                 LnkCap: Port #0, Speed 8GT/s, Width x16, ASPM L1, Exit
> Latency L0s <64ns, L1 <1us
>                         ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp+
>                 LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
>                         ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>                 LnkSta: Speed 2.5GT/s, Width x8, TrErr- Train- SlotClk+
> DLActive- BWMgmt- ABWMgmt-
>                 DevCap2: Completion Timeout: Not Supported, TimeoutDis-,
> LTR+, OBFF Not Supported
>                 AtomicOpsCap: 32bit+ 64bit+ 128bitCAS-
>                 DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-,
> LTR-, OBFF Disabled
>                 AtomicOpsCtl: ReqEn-
>                 LnkSta2: Current De-emphasis Level: -3.5dB,
> EqualizationComplete-, EqualizationPhase1-
>                          EqualizationPhase2-, EqualizationPhase3-,
> LinkEqualizationRequest-
>         Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
>                 Address: 00000000feeff00c  Data: 41b2
>         Capabilities: [100 v1] Vendor Specific Information: ID=0001 Rev=1
> Len=010 <?>
>         Capabilities: [150 v2] Advanced Error Reporting
>                 UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
> RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
>                 UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
> RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
>                 UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt-
> RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
>                 CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr-
>                 CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr+
>                 AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+
> ChkEn-
>         Capabilities: [328 v1] Alternative Routing-ID Interpretation (ARI)
>                 ARICap: MFVC- ACS-, Next Function: 0
>                 ARICtl: MFVC- ACS-, Function Group: 0
>         Kernel driver in use: snd_hda_intel
>         Kernel modules: snd_hda_intel


More information about the dri-devel mailing list