Regression on linux-next (next-20240712)

David Hildenbrand david at redhat.com
Wed Jul 17 11:00:58 UTC 2024


On 16.07.24 07:37, Borah, Chaitanya Kumar wrote:
> Hello Pei,
> 
> Hope you are doing well. I am Chaitanya from the linux graphics team in Intel.
> 
> This mail is regarding a regression we are seeing in our CI runs[1] on linux-next repository.
> 
> In version next-20240712[2], we saw the following regression (currently being masked by another regression)
> 
> `````````````````````````````````````````````````````````````````````````````````
> <4>[   14.530533] ============================================
> <4>[   14.530533] WARNING: possible recursive locking detected
> <4>[   14.530534] 6.10.0-rc7-next-20240712-next-20240712-g3fe121b62282+ #1 Not tainted
> <4>[   14.530535] --------------------------------------------
> <4>[   14.530535] (direxec)/171 is trying to acquire lock:
> <4>[   14.530536] ffff8881010725d8 (&mm->mmap_lock){++++}-{3:3}, at: unmap_single_vma+0xea/0x170
> <4>[   14.530541]
>                    but task is already holding lock:
> <4>[   14.530542] ffff8881010725d8 (&mm->mmap_lock){++++}-{3:3}, at: exit_mmap+0x6a/0x450
> <4>[   14.530545]
>                    other info that might help us debug this:
> <4>[   14.530545]  Possible unsafe locking scenario:
> `````````````````````````````````````````````````````````````````````````````````
> Details log can be found in [3].
> 
> After bisecting the tree, the following patch [4] seems to be the first "bad"
> commit
> 
> `````````````````````````````````````````````````````````````````````````````````````````````````````````
> commit a13252049629a8225f38a9be7d8d4fc4ff5350e8
> Author: Pei Li mailto:peili.dev at gmail.com
> Date:   Wed Jul 10 22:13:17 2024 -0700
> 
>      mm: fix mmap_assert_locked() in follow_pte()
> 
> `````````````````````````````````````````````````````````````````````````````````````````````````````````
> 
> We also verified that if we revert the patch the issue is not seen.
> 
> Could you please check why the patch causes this regression and provide a fix if necessary?

This is know.

There is a discussion along the original patch [1] on how to do it 
differently. But likely we'll tackle it differently [2]. So this patch 
should be dropped for -- which I think already happened because I cannot 
spot that patch in mm-unstable anymore.

[1] https://lore.kernel.org/all/20240710-bug12-v1-1-0e5440f9b8d3@gmail.com/

[2] https://lkml.kernel.org/r/20240712144244.3090089-1-peterx@redhat.com

-- 
Cheers,

David / dhildenb



More information about the Intel-gfx mailing list