[bug report] accel/habanalabs: enforce release order of compute device and dma-buf
Dan Carpenter
dan.carpenter at linaro.org
Thu Jul 25 22:33:12 UTC 2024
On Thu, Jul 25, 2024 at 08:21:51AM +0000, Tomer Tayar wrote:
> On 24/07/2024 19:08, Dan Carpenter wrote:
> > Hello Tomer Tayar,
> >
> > Commit 09524eb8824e ("accel/habanalabs: enforce release order of
> > compute device and dma-buf") from Jan 22, 2023 (linux-next), leads to
> > the following Smatch static checker warning:
> >
> > drivers/accel/habanalabs/common/memory.c:1844 hl_release_dmabuf()
> > error: dereferencing freed memory 'ctx' (line 1841)
> >
> > drivers/accel/habanalabs/common/memory.c
> > 1827 static void hl_release_dmabuf(struct dma_buf *dmabuf)
> > 1828 {
> > 1829 struct hl_dmabuf_priv *hl_dmabuf = dmabuf->priv;
> > 1830 struct hl_ctx *ctx;
> > 1831
> > 1832 if (!hl_dmabuf)
> > 1833 return;
> > 1834
> > 1835 ctx = hl_dmabuf->ctx;
> > 1836
> > 1837 if (hl_dmabuf->memhash_hnode)
> > 1838 memhash_node_export_put(ctx, hl_dmabuf->memhash_hnode);
> > 1839
> > 1840 atomic_dec(&ctx->hdev->dmabuf_export_cnt);
> > 1841 hl_ctx_put(ctx);
> > ^^^
> > This will free ctx on the last reference
> >
> > 1842
> > 1843 /* Paired with get_file() in export_dmabuf() */
> > --> 1844 fput(ctx->hpriv->file_priv->filp);
> > ^^^
> > Potential use after free
>
> Thanks for notifying us about this warning.
>
> Actually, because of this commit, the call to hl_ctx_put() here cannot
> be last.
> The release of the device file has another reference decrement [
> hl_device_release() -> hl_ctx_mgr_fini() ], and this change prevents
> that release as long as a dma-buf object is alive.
Thanks for looking at this. To be honest, I'm just going to take this
on trust. ;) I looked at the code but refcounting is always a bit
tricky.
>
> However, I will revise the function to get a pointer to
> 'ctx->hpriv->file_priv->filp' before calling hl_ctx_put(), so we won't
> have the warning.
Please, don't do things just to make the static checker happy. These
refcounted use after free warnings are prone to false positives. I try
to do some sanity checking but I'm not a domain expert in this subsystem.
So, you know, just look at the warning and ignore it if it's wrong.
These warnings are a one time email. Everyone who works on the kernel
is really good about fixing bugs so we assume all old warnings are false
positives. Plus if they have questions they can search lore for this
email thread.
regards,
dan carpenter
More information about the dri-devel
mailing list