Oops in 4.16 kernel amdgpu driver

sunnanyong sunnanyong at huawei.com
Mon Apr 9 02:31:11 UTC 2018


Hi All ,
      Sometimes it happens Oops in linux kernel 4.16.0 AMDGPU driver, and it nerver happen in 4.15 kernel.
Software environment:Ubuntu17.10 + newest stable kernel 4.16.0 + mesa v17.2.8
GPU: Radeon Pro WX5100      CPU: arm64
Here is the calltrace:
.
Apr  8 14:56:01 ubuntu kernel: [30040.729903] ------------[ cut here ]------------
Apr  8 14:56:01 ubuntu kernel: [30040.729914] kernel BUG at drivers/dma-buf/reservation.c:234!
Apr  8 14:56:01 ubuntu kernel: [30040.729926] Internal error: Oops - BUG: 0 [#1] SMP
Apr  8 14:56:01 ubuntu kernel: [30040.729932] Modules linked in: xt_nat xt_tcpudp veth ipt_MASQUERADE nf_nat_masquerade_ipv4 xfrm_user xfrm_algo iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay snd_hda_codec_hdmi binfmt_misc nls_iso8859_1 snd_hda_intel snd_hda_codec joydev input_leds snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore ipmi_ssif shpchp ipmi_si ipmi_devintf ipmi_msghandler ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_decompress zstd_compress xxhash raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 multipath linear dm_mirror dm_region_hash dm_log raid0 ses enclosure hid_generic
Apr  8 14:56:01 ubuntu kernel: [30040.730037]  usbhid hid marvell amdgpu chash gpu_sched hibmc_drm i2c_algo_bit aes_ce_blk ttm aes_ce_cipher crc32_ce crct10dif_ce drm_kms_helper ghash_ce sha2_ce syscopyarea sha256_arm64 sysfillrect sysimgblt sha1_ce fb_sys_fops drm mpt3sas hisi_sas_v2_hw hisi_sas_main raid_class libsas ehci_platform hns_dsaf scsi_transport_sas hns_enet_drv hns_mdio hnae aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64
Apr  8 14:56:01 ubuntu kernel: [30040.730094] CPU: 41 PID: 59025 Comm: amdgpu_cs:0 Tainted: G        W        4.16.0 #2
Apr  8 14:56:01 ubuntu kernel: [30040.730097] Hardware name: Huawei Taishan 2280 /BC11SPCD, BIOS 1.37 11/21/2017
Apr  8 14:56:01 ubuntu kernel: [30040.730100] pstate: 60400005 (nZCv daif +PAN -UAO)
Apr  8 14:56:01 ubuntu kernel: [30040.730118] pc : reservation_object_add_shared_fence+0x348/0x3b8
Apr  8 14:56:01 ubuntu kernel: [30040.730147] lr : ttm_eu_fence_buffer_objects+0x5c/0xd8 [ttm]
Apr  8 14:56:01 ubuntu kernel: [30040.730148] sp : ffff00003aae39d0
Apr  8 14:56:01 ubuntu kernel: [30040.730150] x29: ffff00003aae39d0 x28: 0000000000000000
Apr  8 14:56:01 ubuntu kernel: [30040.730154] x27: ffff809c85067c60 x26: ffff80af5e89abb8
Apr  8 14:56:01 ubuntu kernel: [30040.730158] x25: ffff809c92bb5060 x24: ffff802e08141220
Apr  8 14:56:01 ubuntu kernel: [30040.730162] x23: ffff802fb55ea000 x22: ffff809c92bb5060
Apr  8 14:56:01 ubuntu kernel: [30040.730165] x21: 0000000000000000 x20: ffff80ad27471c00
Apr  8 14:56:01 ubuntu kernel: [30040.730169] x19: ffff00003aae3b70 x18: 0000fffd38040948
Apr  8 14:56:01 ubuntu kernel: [30040.730172] x17: 0000000000000000 x16: ffff809e434b8f00
Apr  8 14:56:01 ubuntu kernel: [30040.730176] x15: 0000fffd38040940 x14: ffff809f97332600
Apr  8 14:56:01 ubuntu kernel: [30040.730179] x13: ffff809c92bb4e60 x12: 0000000000000016
Apr  8 14:56:01 ubuntu kernel: [30040.730183] x11: 0000000000000000 x10: 0000000000000040
Apr  8 14:56:01 ubuntu kernel: [30040.730186] x9 : 0000000000000000 x8 : ffff809c92bb5100
Apr  8 14:56:01 ubuntu kernel: [30040.730190] x7 : 00000000000000b8 x6 : 0000000000000001
Apr  8 14:56:01 ubuntu kernel: [30040.730193] x5 : 0000000000000003 x4 : 0000000000000002
Apr  8 14:56:01 ubuntu kernel: [30040.730196] x3 : ffff809b77ead88c x2 : 0000000000000001
Apr  8 14:56:01 ubuntu kernel: [30040.730200] x1 : 0000000000000008 x0 : 0000000000000008
Apr  8 14:56:01 ubuntu kernel: [30040.730205] Process amdgpu_cs:0 (pid: 59025, stack limit = 0x00000000469f89e7)
Apr  8 14:56:01 ubuntu kernel: [30040.730208] Call trace:
Apr  8 14:56:01 ubuntu kernel: [30040.730212]  reservation_object_add_shared_fence+0x348/0x3b8
Apr  8 14:56:01 ubuntu kernel: [30040.730225]  ttm_eu_fence_buffer_objects+0x5c/0xd8 [ttm]
Apr  8 14:56:01 ubuntu kernel: [30040.730482]  amdgpu_cs_ioctl+0x14f8/0x19f8 [amdgpu]
Apr  8 14:56:01 ubuntu kernel: [30040.730533]  drm_ioctl_kernel+0x70/0xd8 [drm]
Apr  8 14:56:01 ubuntu kernel: [30040.730578]  drm_ioctl+0x1d4/0x428 [drm]
Apr  8 14:56:01 ubuntu kernel: [30040.730771]  amdgpu_drm_ioctl+0x58/0x98 [amdgpu]
Apr  8 14:56:01 ubuntu kernel: [30040.730779]  do_vfs_ioctl+0xc4/0xb50
Apr  8 14:56:01 ubuntu kernel: [30040.730782]  SyS_ioctl+0x8c/0xa8
Apr  8 14:56:01 ubuntu kernel: [30040.730787]  el0_svc_naked+0x30/0x34
Apr  8 14:56:01 ubuntu kernel: [30040.730793] Code: 17ffff6e f9000bb3 a902dfb6 f90027ba (d4210000)
Apr  8 14:56:01 ubuntu kernel: [30040.730797] ---[ end trace 07398b112d335fc8 ]---

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/amd-gfx/attachments/20180409/b6564a95/attachment-0001.html>


More information about the amd-gfx mailing list