[Bug 99611] [SNB] GPU hang after over temperature

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue Jun 27 18:50:35 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=99611

--- Comment #5 from Chris Tillman <toff.tillman at gmail.com> ---
The initial attachments (already attached to the bug) are all I have. I can
report that the root cause of the overheating was found .. an overheating
internal connector in the power supply chain. The connector was replaced,
and thermal grease re-applied to the heat sink, so the machine no longer
has overheating problems.

The point of this bug was not to trouble shoot my machine; the point was
that the available measurements from coretemp were not being heeded. The
logs show that an overtemperature is reported only for a cycle until it
sets back, saying for example

"[57849.613938] CPU1: Core temperature above threshold, cpu clock throttled
(total events = 12172)"
and then almost immediately (12 microseconds) after,
"[57849.614950] CPU1: Core temperature/speed normal"

It appears from the logs that the only response to monitoring is an
immediate reset of the sensor, and that protection of the machine is not
occurring.

Chris

On Tue, Jun 27, 2017 at 9:40 AM, <bugzilla-daemon at freedesktop.org> wrote:

> Elizabeth <elizabethx.de.la.torre.mena at intel.com> changed bug 99611
> <https://bugs.freedesktop.org/show_bug.cgi?id=99611>
> What Removed Added
> Summary GPU hang after over temperature [SNB] GPU hang after over
> temperature
> Status REOPENED NEEDINFO
>
> *Comment # 4 <https://bugs.freedesktop.org/show_bug.cgi?id=99611#c4> on
> bug 99611 <https://bugs.freedesktop.org/show_bug.cgi?id=99611> from
> Elizabeth <elizabethx.de.la.torre.mena at intel.com> *
>
> (In reply to Ricardo from comment #3 <https://bugs.freedesktop.org/show_bug.cgi?id=99611#c3>)> information provided by the submitter (logs) moving bug to reopen state
>
> Good afternoon,
> Is this bug still valid? Is the problem still present? If so could you add
> logs, HW and SW information. Thank you.
> Thank you.
>
> ------------------------------
> You are receiving this mail because:
>
>    - You reported the bug.
>
>

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20170627/7f2e40ed/attachment.html>


More information about the intel-gfx-bugs mailing list