RE: 答复: [PATCH] drm/amd/amdgpu: fix over-bound accessing in amdgpu_cs_wait_any_fence
He, Roger
Hongbo.He at amd.com
Fri Nov 17 05:31:57 UTC 2017
Theoretically, if first < fence_count, array[first] will not be NULL.
Hi Emily:
do you remember the issue you fixed has same error log?
Thanks
Roger(Hongbo.He)
-----Original Message-----
From: Zhou, David(ChunMing)
Sent: Friday, November 17, 2017 1:24 PM
To: Qu, Jim <Jim.Qu at amd.com>; He, Roger <Hongbo.He at amd.com>; amd-gfx at lists.freedesktop.org
Cc: Zhou, David(ChunMing) <David1.Zhou at amd.com>; Koenig, Christian <Christian.Koenig at amd.com>
Subject: Re: 答复: [PATCH] drm/amd/amdgpu: fix over-bound accessing in amdgpu_cs_wait_any_fence
Yes, As Jim pointed out, you lacks the array[] checking.
you can just change to if (first < fence_count && array[first]), otherwise it's a good fix for regression.
Regards,
David Zhou
On 2017年11月17日 13:16, Qu, Jim wrote:
> Hi Roger:
>
> - if (array[first])
> - r = array[first]->error;
> - else
> + if (first == ~0)
> r = 0;
> + else
> + r = array[first]->error;
>
> // The patch looks like change original logic that miss to check array[first].
>
> Thanks
> JimQu
>
> ________________________________________
> 发件人: amd-gfx <amd-gfx-bounces at lists.freedesktop.org> 代表 Roger He <Hongbo.He at amd.com>
> 发送时间: 2017年11月17日 13:04
> 收件人: amd-gfx at lists.freedesktop.org
> 抄送: Zhou, David(ChunMing); He, Roger; Koenig, Christian
> 主题: [PATCH] drm/amd/amdgpu: fix over-bound accessing in amdgpu_cs_wait_any_fence
>
> fix the following issue:
>
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.712090] Oops: 0000 [#2] SMP
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.712481] Modules linked in: amdgpu(OE) chash ttm(OE) drm_kms_helper(OE) drm(OE) i2c_algo_bit fb_sys_fops syscopyarea sysfillrect sysimgblt intel_rapl snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp snd_hda_codec_hdmi coretemp snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm kvm snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc snd_seq_device snd_timer aesni_intel snd mei_me mei aes_x86_64 crypto_simd serio_raw eeepc_wmi glue_helper asus_wmi sparse_keymap cryptd soundcore shpchp wmi_bmof lpc_ich mac_hid tpm_infineon nfsd auth_rpcgss nfs_acl lockd parport_pc grace ppdev sunrpc lp parport autofs4 hid_generic usbhid ahci mxm_wmi r8169 libahci hid mii wmi video
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.715120] CPU: 1 PID: 1330 Comm: deqp-vk Tainted: G D OE 4.13.0-custom #1
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.715879] Hardware name: ASUS All Series/Z87-A, BIOS 1802 01/28/2014
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.716658] task: ffff9b7115728000 task.stack: ffffb178016e0000
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.717494] RIP: 0010:amdgpu_cs_wait_fences_ioctl+0x20b/0x2e0 [amdgpu]
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.718312] RSP: 0018:ffffb178016e3cb0 EFLAGS: 00010246
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.719270] RAX: 00000000ffffffff RBX: ffffb178016e3d90 RCX: 0000000000000000
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.720247] RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff9b7116a1d8a8
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.721246] RBP: ffffb178016e3d00 R08: 00000000ffffffff R09: 0000000000000000
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.722262] R10: 000000000000ed00 R11: ffffb178016e3d90 R12: ffff9b7116a1d8a8
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.723299] R13: ffff9b7000707020 R14: 0000000000000001 R15: 0000000000000000
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.724358] FS: 00007f89f3af4740(0000) GS:ffff9b712ec80000(0000) knlGS:0000000000000000
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.725447] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.726550] CR2: ffff9b7916a1d8a0 CR3: 000000022042e000 CR4: 00000000001406e0
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.727687] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.728837] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.729992] Call Trace:
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.731193] ? amdgpu_cs_fence_to_handle_ioctl+0x1c0/0x1c0 [amdgpu]
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.732406] drm_ioctl_kernel+0x69/0xb0 [drm]
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.733626] drm_ioctl+0x2d2/0x390 [drm]
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.734883] ? amdgpu_cs_fence_to_handle_ioctl+0x1c0/0x1c0 [amdgpu]
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.736135] ? __do_fault+0x1e/0x70
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.737392] ? __handle_mm_fault+0x8ae/0x10f0
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.738665] ? apparmor_mmap_file+0x18/0x20
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.739980] amdgpu_drm_ioctl+0x4c/0x80 [amdgpu]
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.741277] do_vfs_ioctl+0x96/0x5b0
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.742582] ? handle_mm_fault+0xd3/0x1f0
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.743899] ? sched_clock+0x9/0x10
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.745224] SyS_ioctl+0x79/0x90
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.746553] ? vtime_user_exit+0x29/0x70
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.747897] do_syscall_64+0x6e/0x160
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.749247] entry_SYSCALL64_slow_path+0x25/0x25
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.750614] RIP: 0033:0x7f89f1fdff07
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.751987] RSP: 002b:00007ffd4c6262d8 EFLAGS: 00000202 ORIG_RAX: 0000000000000010
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.753407] RAX: ffffffffffffffda RBX: 0000000000000001 RCX: 00007f89f1fdff07
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.754847] RDX: 00007ffd4c6263a0 RSI: 00000000c0186452 RDI: 0000000000000005
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.756302] RBP: 00007ffd4c626310 R08: 0000000000000001 R09: 00007ffd4c62642c
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.757768] R10: 000000000000edf2 R11: 0000000000000202 R12: 00007ffd4c6264b0
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.759243] R13: 00000000000186a0 R14: 0000000000000000 R15: 00007ffd4c626700
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.760725] Code: ff ff ff e8 08 38 cd e6 eb e0 44 89 45 d4 44 89 c0 ba 01 00 00 00 48 c7 43 08 00 00 00 00 48 c7 43 10 00 00 00 00 89 13 89 43 04 <4b> 8b 04 c4 4c 63 78 58 eb a5 48 8b 4d b0 4c 8d 45 d4 ba 01 00
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.762416] RIP: amdgpu_cs_wait_fences_ioctl+0x20b/0x2e0 [amdgpu] RSP: ffffb178016e3cb0
> Nov 15 17:40:25 jenkins-MS-7984 kernel: [ 146.764058] CR2: ffff9b7916a1d8a0
>
> Change-Id: I60d90d13dda69cd8aa6396f0246379f8390e3fb1
> Signed-off-by: Roger He <Hongbo.He at amd.com>
> ---
> drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 6 +++---
> 1 file changed, 3 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c
> index ee77364..ad00f01 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c
> @@ -1503,10 +1503,10 @@ static int amdgpu_cs_wait_any_fence(struct amdgpu_device *adev,
> wait->out.status = (r > 0);
> wait->out.first_signaled = first;
>
> - if (array[first])
> - r = array[first]->error;
> - else
> + if (first == ~0)
> r = 0;
> + else
> + r = array[first]->error;
>
> err_free_fence_array:
> for (i = 0; i < fence_count; i++)
> --
> 2.7.4
>
> _______________________________________________
> amd-gfx mailing list
> amd-gfx at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
More information about the amd-gfx
mailing list