<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body><span class="vcard"><a class="email" href="mailto:walicki@us.ibm.com" title="John Walicki <walicki@us.ibm.com>"> <span class="fn">John Walicki</span></a>
</span> changed
<a class="bz_bug_link
bz_status_NEW "
title="NEW - Nouveau system freeze fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]"
href="https://bugs.freedesktop.org/show_bug.cgi?id=100567">bug 100567</a>
<br>
<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>What</th>
<th>Removed</th>
<th>Added</th>
</tr>
<tr>
<td style="text-align:right;">CC</td>
<td>
</td>
<td>walicki@us.ibm.com
</td>
</tr></table>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW - Nouveau system freeze fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]"
href="https://bugs.freedesktop.org/show_bug.cgi?id=100567#c38">Comment # 38</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW - Nouveau system freeze fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]"
href="https://bugs.freedesktop.org/show_bug.cgi?id=100567">bug 100567</a>
from <span class="vcard"><a class="email" href="mailto:walicki@us.ibm.com" title="John Walicki <walicki@us.ibm.com>"> <span class="fn">John Walicki</span></a>
</span></b>
<pre>The nouveau driver on my ThinkPad P50 (running RHEL 7.7 with a
5.2.11-1.el7.elrepo.x86_64 kernel) just hung up with this same error.
Sep 5 09:04:48 jaw-p50rhel7 kernel: nouveau 0000:01:00.0: fifo: SCHED_ERROR 0a
[CTXSW_TIMEOUT]
Sep 5 09:04:48 jaw-p50rhel7 kernel: nouveau 0000:01:00.0: fifo: runlist 0:
scheduled for recovery
Sep 5 09:04:48 jaw-p50rhel7 kernel: nouveau 0000:01:00.0: fifo: channel 2:
killed
Sep 5 09:04:48 jaw-p50rhel7 kernel: nouveau 0000:01:00.0: fifo: engine 0:
scheduled for recovery
Sep 5 09:04:48 jaw-p50rhel7 kernel: nouveau 0000:01:00.0: X[5775]: channel 2
killed!
What is possibly interesting is that in /var/log/Xorg.0.log
there was a mouse event that matches exactly with the timing of the nouveau
errors.
[150241.393] AUDIT: Thu Sep 5 09:04:19 2019: 5775: client 43 disconnected
[150241.396] AUDIT: Thu Sep 5 09:04:19 2019: 5775: client 44 disconnected
[150270.473] (II) event8 - Logitech USB Receiver: SYN_DROPPED event - some
input events have been lost.
The [bracket] is a timestamp which indicates the time since the system last
booted, in seconds. So 150270 is 29 seconds after the 09:04:19 timestamped
line.
09:04:19 + 29 seconds is 09:04:48
The nouveau driver hung at that time (see /var/log/messages timestamp above)
Not certain if that was cause or effect of the video driver hang.
I was able to ssh into my system to reboot.
$ lspci -vv -s 01:00.0
01:00.0 VGA compatible controller: NVIDIA Corporation GM107GLM [Quadro M1000M]
(rev a2) (prog-if 00 [VGA controller])
Subsystem: Lenovo Device 2230
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort-
<MAbort- >SERR- <PERR- INTx-
Latency: 0
Interrupt: pin A routed to IRQ 129
Region 0: Memory at b2000000 (32-bit, non-prefetchable) [size=16M]
Region 1: Memory at a0000000 (64-bit, prefetchable) [size=256M]
Region 3: Memory at b0000000 (64-bit, prefetchable) [size=32M]
Region 5: I/O ports at 4000 [size=128]
Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: nouveau
Kernel modules: nouveau</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>