BUG: *ERROR* No EDID read
Alex Deucher
alexdeucher at gmail.com
Tue Jul 10 19:34:29 UTC 2018
On Fri, Jul 6, 2018 at 12:44 PM, Daniel Andersson <engywook at gmail.com> wrote:
> Alright, I redid the bisect and this time ended up on
> 018d82e5f02ef3583411bcaa4e00c69786f46f19 . If I revert it, it fixes my
> issue. Reverting it on 4.18-rc2 also works.
I've gone ahead and reverted the patch.
Thanks,
Alex
>
> // Daniel
>
> On 5 July 2018 at 21:47, Daniel Andersson <engywook at gmail.com> wrote:
>> [ 0.000000] Command line: BOOT_IMAGE=/vmlinuz-linuxtest
>> root=UUID=27247597-a354-42f3-8040-caff9592a297 drm.debug=0x4 rw quiet
>> [ 0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-linuxtest
>> root=UUID=27247597-a354-42f3-8040-caff9592a297 drm.debug=0x4 rw quiet
>> [ 5.674793] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.674833] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.674887] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.674930] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.674974] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675014] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675056] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675095] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675138] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675178] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675221] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675260] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675304] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675342] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675384] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675422] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675465] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675503] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675587] [drm:bios_parser_get_firmware_info [amdgpu]] At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>> [ 5.675626] [drm:bios_parser_get_firmware_info [amdgpu]] *ERROR* At
>> bios_parser_get_firmware_info switch, got major 3 minor 1
>>
>> I don't really know what is going on. If I go back to 4.17 and apply
>> my "fix". It doesn't work. I suppose my bisect didn't get me the right
>> commit. I probably never tested that the commit, from the bisect, was
>> actually bad. I was also a little lazy and did "bisect start
>> 29dcea88779c856c7dc92040a0c01233263101d4
>> 6da6c0db5316275015e8cc2959f12a17584aeb64 -- drivers/gpu/drm/amd".
>>
>> I guess I'll try another bisect tomorrow on the entire tree, sorry for
>> the extra work.
>>
>> // Daniel
>>
>> On 5 July 2018 at 20:22, Harry Wentland <harry.wentland at amd.com> wrote:
>>> On 2018-07-05 01:43 PM, Daniel Andersson wrote:
>>>> Well, this workaround:
>>>>
>>>> diff --git a/drivers/gpu/drm/amd/display/dc/bios/bios_parser2.c
>>>> b/drivers/gpu/drm/amd/display/dc/bios/bios_parser2.c
>>>> index 10a5807a7e8b..d0f5910c906c 100644
>>>> --- a/drivers/gpu/drm/amd/display/dc/bios/bios_parser2.c
>>>> +++ b/drivers/gpu/drm/amd/display/dc/bios/bios_parser2.c
>>>> @@ -1321,6 +1321,8 @@ static enum bp_result bios_parser_get_firmware_info(
>>>> header = GET_IMAGE(struct atom_common_table_header,
>>>> DATA_TABLES(firmwareinfo));
>>>> get_atom_data_table_revision(header, &revision);
>>>> + dm_output_to_console("At bios_parser_get_firmware_info switch,
>>>> got major %d minor %d", revision.major, revision.minor);
>>>> + dm_error("At bios_parser_get_firmware_info switch, got major %d
>>>> minor %d", revision.major, revision.minor);
>>>> switch (revision.major) {
>>>> case 3:
>>>> switch (revision.minor) {
>>>> @@ -1328,7 +1330,7 @@ static enum bp_result bios_parser_get_firmware_info(
>>>> result = get_firmware_info_v3_1(bp, info);
>>>> break;
>>>> case 2:
>>>> - result = get_firmware_info_v3_2(bp, info);
>>>> + result = get_firmware_info_v3_1(bp, info);
>>>> break;
>>>> default:
>>>> break;
>>>>
>>>> "works":
>>>> [engy][~/devel/3pp/linux] ((6e65fb862064...)|BISECTING)$ xrandr
>>>> Screen 0: minimum 320 x 200, current 2560 x 1440, maximum 16384 x 16384
>>>> DisplayPort-0 connected 2560x1440+0+0 (normal left inverted right x
>>>> axis y axis) 598mm x 336mm
>>>> 2560x1440 59.95*+ 120.00 99.95 84.98 23.97
>>>> 1920x1200 59.95
>>>> 1920x1080 59.95
>>>> 1600x1200 59.95
>>>> 1680x1050 59.95
>>>> 1280x1024 59.95
>>>> 1440x900 59.95
>>>> 1280x800 59.95
>>>> 1280x720 59.95
>>>> 1024x768 59.95
>>>> 800x600 59.95
>>>> 640x480 59.95
>>>> DisplayPort-1 disconnected (normal left inverted right x axis y axis)
>>>> HDMI-A-0 disconnected (normal left inverted right x axis y axis)
>>>> HDMI-A-1 disconnected (normal left inverted right x axis y axis)
>>>>
>>>> Where does dm_error and dm_output_to_console end up?
>>>>
>>>
>>> dm_error -> DRM_ERROR will end up in dmesg as an error message
>>> dm_output_to_console -> DRM_DEBUG_KMS will end up in dmesg as a normal log message if drm.debug=0x4 is set on the kernel command line
>>>
>>> I'd be curious what the driver thinks the major and minor are.
>>>
>>> Harry
>>>
>>>> // Daniel
>>>>
>>>> On 5 July 2018 at 18:42, Deucher, Alexander <Alexander.Deucher at amd.com> wrote:
>>>>> So your vbios has table v3.1 so it should not be affected by that patch.
>>>>> Does reverting that patch actually fix the issue?
>>>>>
>>>>>
>>>>> Alex
>>>>>
>>>>> ________________________________
>>>>> From: amd-gfx <amd-gfx-bounces at lists.freedesktop.org> on behalf of Daniel
>>>>> Andersson <engywook at gmail.com>
>>>>> Sent: Thursday, July 5, 2018 12:22:17 PM
>>>>> To: Alex Deucher
>>>>> Cc: amd-gfx at lists.freedesktop.org
>>>>> Subject: Re: BUG: *ERROR* No EDID read
>>>>>
>>>>> I have not flashed any GPU BIOS. It's not a reference Vega though,
>>>>> Sapphire something. Maybe they made changes?
>>>>>
>>>>> vbios is attached.
>>>>>
>>>>> // Daniel
>>>>>
>>>>> On 5 July 2018 at 15:38, Alex Deucher <alexdeucher at gmail.com> wrote:
>>>>>> On Mon, Jul 2, 2018 at 5:39 PM, Daniel Andersson <engywook at gmail.com>
>>>>>> wrote:
>>>>>>> Sure, bisecting gets me 6e65fb862064663ad3a08f964af1e8f3f2abf688 .
>>>>>>>
>>>>>>> In drivers/gpu/drm/amd/display/dc/bios/bios_parser2.c,
>>>>>>> get_firmware_info_v3_1() works but get_firmware_info_v3_2() does not
>>>>>>> do the right thing for my Vega.
>>>>>>>
>>>>>>> Could I break my GPU if I were to set some bad/wrong frequency there?
>>>>>>
>>>>>> vega10 should not hit that new path at all. Have you edited your
>>>>>> vbios? Can you send us a copy? To get a copy of the vbios:
>>>>>>
>>>>>> Without the driver loaded:
>>>>>> (as root)
>>>>>> (use lspci to get the bus id)
>>>>>> cd /sys/bus/pci/devices/<pci bus id>
>>>>>> echo 1 > rom
>>>>>> cat rom > /tmp/vbios.rom
>>>>>> echo 0 > rom
>>>>>>
>>>>>> If the driver is loaded:
>>>>>> (as root)
>>>>>> cat /sys/kernel/debug/dri/0/amdgpu_vbios > /tmp/vbios.rom
>>>>>>
>>>>>> Alex
>>>>>>
>>>>>>>
>>>>>>> lspci:
>>>>>>> 43:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
>>>>>>> [AMD/ATI] Vega 10 XT [Radeon RX Vega 64] (rev c1) (prog-if 00 [VGA
>>>>>>> controller])
>>>>>>> Subsystem: Sapphire Technology Limited Vega 10 XT [Radeon RX Vega
>>>>>>> 64]
>>>>>>> Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
>>>>>>> Stepping- SERR- FastB2B- DisINTx+
>>>>>>> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>>>>>>> <TAbort- <MAbort- >SERR- <PERR- INTx-
>>>>>>> Latency: 0, Cache Line Size: 64 bytes
>>>>>>> Interrupt: pin A routed to IRQ 85
>>>>>>> Region 0: Memory at d0000000 (64-bit, prefetchable) [size=256M]
>>>>>>> Region 2: Memory at e0000000 (64-bit, prefetchable) [size=2M]
>>>>>>> Region 4: I/O ports at f000 [size=256]
>>>>>>> Region 5: Memory at ed400000 (32-bit, non-prefetchable) [size=512K]
>>>>>>> Expansion ROM at ed480000 [disabled] [size=128K]
>>>>>>> Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>>>>> Capabilities: [50] Power Management version 3
>>>>>>> Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
>>>>>>> PME(D0-,D1+,D2+,D3hot+,D3cold+)
>>>>>>> Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
>>>>>>> Capabilities: [64] Express (v2) Legacy Endpoint, MSI 00
>>>>>>> DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <4us, L1
>>>>>>> unlimited
>>>>>>> ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
>>>>>>> DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
>>>>>>> RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
>>>>>>> MaxPayload 256 bytes, MaxReadReq 512 bytes
>>>>>>> DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr-
>>>>>>> TransPend-
>>>>>>> LnkCap: Port #0, Speed 8GT/s, Width x16, ASPM L0s L1, Exit Latency
>>>>>>> L0s <64ns, L1 <1us
>>>>>>> ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
>>>>>>> LnkCtl: ASPM L0s L1 Enabled; RCB 64 bytes Disabled- CommClk+
>>>>>>> ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>>>>>>> LnkSta: Speed 8GT/s, Width x16, TrErr- Train- SlotClk+ DLActive-
>>>>>>> BWMgmt- ABWMgmt-
>>>>>>> DevCap2: Completion Timeout: Not Supported, TimeoutDis-, LTR+,
>>>>>>> OBFF Not Supported
>>>>>>> AtomicOpsCap: 32bit- 64bit- 128bitCAS-
>>>>>>> DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF
>>>>>>> Disabled
>>>>>>> AtomicOpsCtl: ReqEn-
>>>>>>> LnkCtl2: Target Link Speed: 8GT/s, EnterCompliance- SpeedDis-
>>>>>>> Transmit Margin: Normal Operating Range,
>>>>>>> EnterModifiedCompliance- ComplianceSOS-
>>>>>>> Compliance De-emphasis: -6dB
>>>>>>> LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete+,
>>>>>>> EqualizationPhase1+
>>>>>>> EqualizationPhase2+, EqualizationPhase3+,
>>>>>>> LinkEqualizationRequest-
>>>>>>> Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
>>>>>>> Address: 00000000fee00000 Data: 0000
>>>>>>> Capabilities: [100 v1] Vendor Specific Information: ID=0001 Rev=1
>>>>>>> Len=010 <?>
>>>>>>> Capabilities: [150 v2] Advanced Error Reporting
>>>>>>> UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
>>>>>>> MalfTLP- ECRC- UnsupReq- ACSViol-
>>>>>>> UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
>>>>>>> MalfTLP- ECRC- UnsupReq- ACSViol-
>>>>>>> UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+
>>>>>>> MalfTLP+ ECRC- UnsupReq- ACSViol-
>>>>>>> CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
>>>>>>> CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
>>>>>>> AERCap: First Error Pointer: 00, ECRCGenCap+ ECRCGenEn-
>>>>>>> ECRCChkCap+ ECRCChkEn-
>>>>>>> MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
>>>>>>> HeaderLog: 00000000 00000000 00000000 00000000
>>>>>>> Capabilities: [200 v1] #15
>>>>>>> Capabilities: [270 v1] #19
>>>>>>> Capabilities: [2a0 v1] Access Control Services
>>>>>>> ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd-
>>>>>>> EgressCtrl- DirectTrans-
>>>>>>> ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd-
>>>>>>> EgressCtrl- DirectTrans-
>>>>>>> Capabilities: [2b0 v1] Address Translation Service (ATS)
>>>>>>> ATSCap: Invalidate Queue Depth: 00
>>>>>>> ATSCtl: Enable+, Smallest Translation Unit: 00
>>>>>>> Capabilities: [2c0 v1] Page Request Interface (PRI)
>>>>>>> PRICtl: Enable- Reset-
>>>>>>> PRISta: RF- UPRGI- Stopped+
>>>>>>> Page Request Capacity: 00000020, Page Request Allocation: 00000000
>>>>>>> Capabilities: [2d0 v1] Process Address Space ID (PASID)
>>>>>>> PASIDCap: Exec+ Priv+, Max PASID Width: 10
>>>>>>> PASIDCtl: Enable- Exec- Priv-
>>>>>>> Capabilities: [320 v1] Latency Tolerance Reporting
>>>>>>> Max snoop latency: 0ns
>>>>>>> Max no snoop latency: 0ns
>>>>>>> Kernel driver in use: amdgpu
>>>>>>> Kernel modules: amdgpu
>>>>>>>
>>>>>>> // Daniel
>>>>>>>
>>>>>>> On 2 July 2018 at 21:29, Alex Deucher <alexdeucher at gmail.com> wrote:
>>>>>>>> On Mon, Jul 2, 2018 at 3:21 PM, Daniel Andersson <engywook at gmail.com>
>>>>>>>> wrote:
>>>>>>>>> I get:
>>>>>>>>> [drm:dc_link_detect [amdgpu]] *ERROR* No EDID read.
>>>>>>>>> on boot. This started happening on 4.17 and is still an issue on
>>>>>>>>> 4.18-rc2.
>>>>>>>>>
>>>>>>>>> I have a Vega64 connected on Display Port to a monitor(1440p). The
>>>>>>>>> monitor
>>>>>>>>> doesn't have any additional ports so I can't test HDMI.
>>>>>>>>>
>>>>>>>>> xrandr:
>>>>>>>>> Screen 0: minimum 320 x 200, current 1024 x 768, maximum 16384 x 16384
>>>>>>>>> DisplayPort-0 connected 1024x768+0+0 (normal left inverted right x axis
>>>>>>>>> y
>>>>>>>>> axis) 0mm x 0mm
>>>>>>>>> 1024x768 60.00*
>>>>>>>>> 800x600 60.32 56.25
>>>>>>>>> 848x480 60.00
>>>>>>>>> 640x480 59.94
>>>>>>>>> DisplayPort-1 disconnected (normal left inverted right x axis y axis)
>>>>>>>>> HDMI-A-0 disconnected (normal left inverted right x axis y axis)
>>>>>>>>> HDMI-A-1 disconnected (normal left inverted right x axis y axis)
>>>>>>>>>
>>>>>>>>> The 1440p resolution worked on 4.16.
>>>>>>>>
>>>>>>>> Can you use git to bisect?
>>>>>>>>
>>>>>>>> Alex
>>>> _______________________________________________
>>>> amd-gfx mailing list
>>>> amd-gfx at lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>>>
> _______________________________________________
> amd-gfx mailing list
> amd-gfx at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
More information about the amd-gfx
mailing list