5.13-rc3 Renoir (ChipID = 0x1636): error GPU reset, fences timed out, failed to initialize parser -125

Alex Deucher alexdeucher at gmail.com
Tue Jun 1 15:28:05 UTC 2021


On Mon, May 31, 2021 at 4:17 AM Julian Wollrath <jwollrath at web.de> wrote:
>
> Hello,
>
> on 5.13-rc3 with a 4650U (Renoir, ChipID = 0x1636) I am able to
> reliably get the GPU to reset under X11/Xorg with the amdgpu driver
> (current from git) and having persistent problems afterwards. I
> achieve this in the following way (using ROOT (root.cern.ch) to draw a
> histogram):
>
> $ root
> # TH1 *t = new TH1I("test", "test", 2, -.5, 1.5)
> # t->Fill(1)
> # t->Draw()
>
>
> this opens a window showing the histogram and results in
>
> kernel: [  370.709485] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
> kernel: [  375.820701] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
> kernel: [  375.830689] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3783, emitted seq=3785
> kernel: [  375.831015] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 1028 thread Xorg
> :cs0 pid 1029
> kernel: [  375.831174] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
> kernel: [  375.924610] [drm] free PSP TMR buffer
> kernel: [  375.951409] amdgpu 0000:03:00.0: amdgpu: MODE2 reset
> kernel: [  375.951531] amdgpu 0000:03:00.0: amdgpu: GPU reset succeeded, trying to resume
> kernel: [  375.951712] [drm] PCIE GART of 1024M enabled.
> kernel: [  375.951715] [drm] PTB located at 0x000000F400900000
> kernel: [  375.951957] [drm] PSP is resuming...
> kernel: [  375.971811] [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
> kernel: [  376.277979] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available
> kernel: [  376.288823] amdgpu 0000:03:00.0: amdgpu: RAP: optional rap ta ucode is not available
> kernel: [  376.288830] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
> kernel: [  376.288836] amdgpu 0000:03:00.0: amdgpu: SMU is resuming...
> kernel: [  376.290539] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully!
> kernel: [  376.648574] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  376.649562] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  376.721210] [drm] kiq ring mec 2 pipe 1 q 0
> kernel: [  376.722457] [drm] DMUB hardware initialized: version=0x01020008
> kernel: [  376.936133] [drm] VCN decode and encode initialized successfully(under DPG Mode).
> kernel: [  376.936628] [drm] JPEG decode initialized successfully.
> kernel: [  376.936635] amdgpu 0000:03:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
> kernel: [  376.936639] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
> kernel: [  376.936641] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
> kernel: [  376.936643] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
> kernel: [  376.936645] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
> kernel: [  376.936646] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
> kernel: [  376.936648] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
> kernel: [  376.936649] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
> kernel: [  376.936650] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
> kernel: [  376.936652] amdgpu 0000:03:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
> kernel: [  376.936654] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
> kernel: [  376.936656] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
> kernel: [  376.936657] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
> kernel: [  376.936659] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
> kernel: [  376.936660] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
> kernel: [  376.945068] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow start
> kernel: [  376.945075] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow done
> kernel: [  376.945081] [drm] Skip scheduling IBs!
> kernel: [  376.945084] [drm] Skip scheduling IBs!
> kernel: [  376.945119] amdgpu 0000:03:00.0: amdgpu: GPU reset(2) succeeded!

The GPU reset was successful.  You'll need to restart your GUI
environment.  Unfortunately no current desktop environments on Linux
properly handle lost contexts.

Alex


> kernel: [  376.945161] [drm] Skip scheduling IBs!
> kernel: [  376.945169] [drm] Skip scheduling IBs!
> kernel: [  376.945172] [drm] Skip scheduling IBs!
> kernel: [  376.945235] [drm] Skip scheduling IBs!
> kernel: [  376.945243] [drm] Skip scheduling IBs!
> kernel: [  376.945246] [drm] Skip scheduling IBs!
> kernel: [  376.945248] [drm] Skip scheduling IBs!
> kernel: [  376.945251] [drm] Skip scheduling IBs!
> kernel: [  376.945253] [drm] Skip scheduling IBs!
> kernel: [  376.948659] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  381.640053] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  381.641067] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  381.641510] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  382.795658] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  382.796462] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.157991] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.224317] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.225321] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.226164] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.226836] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.227294] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.227573] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  383.228525] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  391.623243] amdgpu_cs_ioctl: 13 callbacks suppressed
> kernel: [  391.623250] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  391.623979] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  396.615491] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  396.616130] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> kernel: [  398.242898] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
> [...]
>
> I guess, that this might not be detailed enough information but I am
> happy to provide more, if you tell me, what exactly is needed.
>
>
> With best regards,
> Julian Wollrath
>
> --
>  ()  ascii ribbon campaign - against html e-mail
>  /\                        - against proprietary attachments
> _______________________________________________
> amd-gfx mailing list
> amd-gfx at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx


More information about the amd-gfx mailing list