[Bug 110674] Crashes / Resets From AMDGPU / Radeon VII

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Aug 12 13:21:03 UTC 2019


https://bugs.freedesktop.org/show_bug.cgi?id=110674

--- Comment #80 from Tom B <tom at r.je> ---
> I tried something like that before but a huge portion of the commits in that range won't build kernels that can boot (at least on my system). I ended up resorting to trying reverting individual vega20-affecting  commits out of 5.1. See my results far above in the thread (though someone else willing to spend more time doing a deeper analysis of the code could probably take my approach much further).

That's why my focus has been finding places in the code where something
different happens based on the number of displays. Though this may be a futile
avenue of exploration as it could just be an issue of additional memory
bandwith requirements or even something that should be done differently with 2
displays that isn't.

> It does make me wonder if it's worth testing like 2 simple 1080p 60 Hz displays. Maybe that wouldn't trigger this issue. Not that that would really be of use to me. But it might help distinguish between just monitor detect generally being broken and "high monitor load" being broken . . .

This would be an interesting test but I think 1080p 60hz monitors with
displayport are fairly uncommon and I don't have any to test with. My guess is
anyone with a Radeon VII, a high end card with 16gb VRAM, is likely to have a
high end display which could equally explain why there are no reports here of
people running 1080p 60hz displays. 

My next test is going to be logging dpm_table->dpm_state.hard_min_level on line
3354 (just before it's sent to the smc) on both 5.0.13 and 5.2.7 to see if the
same hard_min_level value is sent to the smc on both kernels. This will at
least let us know whether it's something that's incorrectly setting
hard_min_level or something that prevents the smc accepting the value. My hunch
from my previous tests is that it's the latter but I'll try it and report back.

I know nothing about driver development so I have no idea how this stuff should
work, I can only compare the differences between 5.0.13 and later kernels.

Anyway, thanks everyone for your input. Any information, even on things that
you tried and didn't work, is valuable as it can help us narrow down the
problem.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/dri-devel/attachments/20190812/aaa0af91/attachment.html>


More information about the dri-devel mailing list