<html>
<head>
<base href="https://bugs.freedesktop.org/">
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - Kernel panic while “ modprobe amdkfd ; modprobe -r amdkfd " ; 4.14.35 kernel ."
href="https://bugs.freedesktop.org/show_bug.cgi?id=110225">110225</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>Kernel panic while “ modprobe amdkfd ; modprobe -r amdkfd " ; 4.14.35 kernel .
</td>
</tr>
<tr>
<th>Product</th>
<td>DRI
</td>
</tr>
<tr>
<th>Version</th>
<td>XOrg git
</td>
</tr>
<tr>
<th>Hardware</th>
<td>Other
</td>
</tr>
<tr>
<th>OS</th>
<td>All
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>major
</td>
</tr>
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Component</th>
<td>DRM/amdkfd
</td>
</tr>
<tr>
<th>Assignee</th>
<td>dri-devel@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Reporter</th>
<td>John.p.donnelly@oracle.com
</td>
</tr></table>
<p>
<div>
<pre>Hello ,
I am investigating a issue that our test group reported concerning this driver.
Their test loads and unloads every kernel module included in the Oracle
4.14.35 kernel release . You don’t even need a AMD platform . It occurs on any
Intel, or a KVM VM instance too.
Kernel panic while “ modprobe amdkfd ; modprobe -r amdkfd “
[ 329.425334] ? __slab_free+0x9b/0x2ba
[ 329.427836] ? process_slab+0x3c1/0x45c
[ 329.430336] dev_printk_emit+0x4e/0x65
[ 329.432829] __dev_printk+0x46/0x8b
[ 329.435183] _dev_info+0x6c/0x85
[ 329.437435] ? kfree+0x141/0x182
[ 329.439646] kfd_module_exit+0x37/0x39 [amdkfd]
[ 329.442258] SyS_delete_module+0x1c3/0x26f
[ 329.444722] ? entry_SYSCALL_64_after_hwframe+0xaa/0x0
[ 329.447479] ? entry_SYSCALL_64_after_hwframe+0xa3/0x0
[ 329.450206] ? entry_SYSCALL_64_after_hwframe+0x9c/0x0
[ 329.452912] ? entry_SYSCALL_64_after_hwframe+0x95/0x0
[ 329.455586] do_syscall_64+0x79/0x1ae
[ 329.457766] entry_SYSCALL_64_after_hwframe+0x151/0x0
[ 329.460369] RIP: 0033:0x7f1757a1b457
[ 329.462502] RSP: 002b:00007ffd62ce1f48 EFLAGS: 00000206 ORIG_RAX:
Looks like some memory corruption.
Sometimes the unload works but the message logged is garbage:
[root@jpd-vmbase02 ~]# modprobe -r amdkfd
[ 144.449981] ???????????? hn??蟟??xn??ן??kfd: Removed module
Is this something one of team members could have possibly corrected in an
upstream version ?
#define KFD_DRIVER_DESC "Standalone HSA driver for AMD's GPUs"
#define KFD_DRIVER_DATE "20150421"
#define KFD_DRIVER_MAJOR 0
#define KFD_DRIVER_MINOR 7
#define KFD_DRIVER_PATCHLEVEL 2
Thank you,
John</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>