[amd-gfx] AMD Carrizo - GPU fault detected: 146 0x0842b714

Mads mads at ab3.no
Tue Jun 21 17:16:19 UTC 2016


I sent this for 1.5 hours ago, but since it hasn't arrived to the 
mailing list yet, I try again...

On 2016-06-21 17:48, Mads wrote:

> On 2016-06-21 10:12, Mads wrote:
> 
> On 2016-06-21 09:39, Nicolai Hähnle wrote:
> 
> Thanks. However, I still don't think this is going to help. Your 
> earlier trace experiments showed that the problematic SDMA commands 
> came from the X server, _not_ from plasmashell.
> 
> So what we see here is likely just the first set of GPU commands sent 
> by plasmashell after the VM fault occurred. Since the plasmashell 
> process is unable to tell who caused the VM fault, it takes the blame 
> incorrectly. Are you sure the X server is using your self-compiled 
> radeonsi_dri.so and has the environment variable set? If it creates a 
> ddebug_dump, it might be somewhere else (it's based off the HOME 
> environment variable, which may be different).
> I'll take a second look to see if there's an X dump there too, but 
> unfortunately it'll be in about ~8 hours before I have the machine at 
> hand again..
> 
> And yes, I'm sure, everything is built through portage, so there is no 
> "self-compiled" on the system per se. There's always just one lib 
> available at any time :)

You were right! X didn't have R600_DEBUG=check_vm in environment (no 
login shell/sourcing of /etc/profile).

Here's what i ran:

> $ XAUTHORITY=.Xauthority DISPLAY=:0 LIBGL_DEBUG=verbose dolphin
> libGL: pci id for fd 9: 1002:9874, driver radeonsi
> libGL: OpenDriver: trying /usr/lib64/dri/tls/radeonsi_dri.so
> libGL: OpenDriver: trying /usr/lib64/dri/radeonsi_dri.so
> si_vm_fault_occured: failed to parse line '                Either 
> enable ECC checking or force module loading by setting 
> 'ecc_enable_override'.
> '
> libGL: Using DRI3 for screen 0
> Trying to convert empty KLocalizedString to QString.
> Cannot creat accessible child interface for object:  
> PlacesView(0x118d670)  index:  5
> QPixmap::scaled: Pixmap is a null pixmap
> QPixmap::scaled: Pixmap is a null pixmap
> (... etc ...)
> The X11 connection broke (error 1). Did the X11 server die?

Attaching dmesg and ddebug_dump.

- Mads
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: carrizo.dmesg
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20160621/799d1b8a/attachment-0002.ksh>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: X_281_00000000
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20160621/799d1b8a/attachment-0003.ksh>


More information about the amd-gfx mailing list