[RFC PATCH 0/3] gpu: nova-core: add basic timer subdevice implementation
Alexandre Courbot
acourbot at nvidia.com
Mon Feb 17 14:04:45 UTC 2025
Hi everyone,
This short RFC is based on top of Danilo's initial driver stub series
[1] and has for goal to initiate discussions and hopefully some design
decisions using the simplest subdevice of the GPU (the timer) as an
example, before implementing more devices allowing the GPU
initialization sequence to progress (Falcon being the logical next step
so we can get the GSP rolling).
It is kept simple and short for that purpose, and to avoid bumping into
a wall with much more device code because my assumptions were incorrect.
This is my first time trying to write Rust kernel code, and some of my
questions below are probably due to me not understanding yet how to use
the core kernel interfaces. So before going further I thought it would
make sense to raise the most obvious questions that came to my mind
while writing this draft:
- Where and how to store subdevices. The timer device is currently a
direct member of the GPU structure. It might work for GSP devices
which are IIUC supposed to have at least a few fixed devices required
to bring the GSP up ; but as a general rule this probably won't scale
as not all subdevices are present on all GPU variants, or in the same
numbers. So we will probably need to find an equivalent to the
`subdev` linked list in Nouveau.
- BAR sharing between subdevices. Right now each subdevice gets access
to the full BAR range. I am wondering whether we could not split it
into the relevant slices for each-subdevice, and transfer ownership of
each slice to the device that is supposed to use it. That way each
register would have a single owner, which is arguably safer - but
maybe not as flexible as we will need down the road?
- On a related note, since the BAR is behind a Devres its availability
must first be secured before any hardware access using try_access().
Doing this on a per-register or per-operation basis looks overkill, so
all methods that access the BAR take a reference to it, allowing to
call try_access() from the highest-level caller and thus reducing the
number of times this needs to be performed. Doing so comes at the cost
of an extra argument to most subdevice methods ; but also with the
benefit that we don't need to put the BAR behind another Arc and share
it across all subdevices. I don't know which design is better here,
and input would be very welcome.
- We will probably need sometime like a `Subdevice` trait or something
down the road, but I'll wait until we have more than one subdevice to
think about it.
The first 2 patches are small additions to the core Rust modules, that
the following patches make use of and which might be useful for other
drivers as well. The last patch is the naive implementation of the timer
device. I don't expect it to stay this way at all, so please point out
all the deficiencies in this very early code! :)
[1] https://lore.kernel.org/nouveau/20250209173048.17398-1-dakr@kernel.org/
Signed-off-by: Alexandre Courbot <acourbot at nvidia.com>
---
Alexandre Courbot (3):
rust: add useful ops for u64
rust: make ETIMEDOUT error available
gpu: nova-core: add basic timer device
drivers/gpu/nova-core/driver.rs | 4 +-
drivers/gpu/nova-core/gpu.rs | 35 ++++++++++++++-
drivers/gpu/nova-core/nova_core.rs | 1 +
drivers/gpu/nova-core/regs.rs | 43 ++++++++++++++++++
drivers/gpu/nova-core/timer.rs | 91 ++++++++++++++++++++++++++++++++++++++
rust/kernel/error.rs | 1 +
rust/kernel/lib.rs | 1 +
rust/kernel/num.rs | 32 ++++++++++++++
8 files changed, 206 insertions(+), 2 deletions(-)
---
base-commit: 6484e46f33eac8dd42aa36fa56b51d8daa5ae1c1
change-id: 20250216-nova_timer-c69430184f54
Best regards,
--
Alexandre Courbot <acourbot at nvidia.com>
More information about the Nouveau
mailing list