[Mesa-dev] [PATCH 10/11] clover: Add function for building a clover::module for non-TGSI targets
Tom Stellard
tstellar at gmail.com
Sun May 13 12:37:59 PDT 2012
On Sun, May 13, 2012 at 04:10:50PM +0200, Francisco Jerez wrote:
> Tom Stellard <tstellar at gmail.com> writes:
>
> > On Sun, May 13, 2012 at 12:40:43AM +0200, Francisco Jerez wrote:
> >>[...]
> >> I can think of two different ways this could work (your solution would
> >> be somewhere in between):
> >>
> >> - The r600g LLVM back-end could support some well-defined output object
> >> format (e.g. the one implemented by the clover::module classes), like
> >> most of the other LLVM back-ends. You probably want to do this
> >> anyway if you want to be able to compile CL programs off-line. If
> >> you do it this way, the state tracker will just call clang to
> >> completion using the appropriate target and pass the generated
> >> machine code to the pipe driver.
> >>
> >> If you think supporting different hardware versions with different
> >> ISAs would be a problem under this scheme, we could have another
> >> compute cap that would determine a specific variant of the ISA.
> >>
> >
> > I think this is the most practical of these two options. However, the
> > biggest draw back of this solution is that if a target uses a backend that
> > is not a part of the LLVM tree (the r600 backend is not in the LLVM tree,
> > but I'm working on getting it in), the backend would have to be included
> > as a part of clover and drivers using LLVM for other state tracker would
> > end up with one copy of their LLVM backend in clover and another in the
> > driver.
>
> Apparently it's already standard practice in LLVM to build back-ends as
> separate object files. I don't see why you couldn't build the r600
> back-end as a shared library. There's no reason for it to be duplicated
> in the source tree or the address space of the process it's running in.
>
I don't think this will work if clover and the back-end link
statically to the LLVM libraries, which is what Mesa does by default.
LLVM relies on global pointers to identify float types within
a module, so it doesn't work to pass a module between shared libraries that
have each linked separately with LLVM, which is what will be happening
if the Clang frontend and the backend live in separate shared objects.
> > I guess this might be able to work, but I'm not really sure why doing
> > it this way would be better than what is implemented in this patch.
> >
> I just think it would be useful by itself to have the r600 back-end able
> to emit machine code in a specific object file format, and at the same
> time, by doing so you avoid driver-specific code in the state tracker
> that does one thing or another depending on the hardware it's running
> on, which seems like a layering violation to me.
State trackers already use caps to do different things depending on
what hardware it's running. So, I'm not really sure why using
PIPE_SHADER_CAP_PREFFERED_IR is any different.
>
> >> - Another option would be to forget about driver-specific IRs in the
> >> compute API. The pipe driver would have the choice between TGSI and
> >> LLVM, if it wants LLVM, the state tracker would do roughly what
> >> you're doing here using some sort of "identity" LLVM target that
> >> would do nothing but describing the peculiarities of the hardware
> >> (e.g. endianness, widths of the supported primitive types), which in
> >> turn would be queried from the pipe driver using compute caps.
> >>
> >
> > This is the way I thought it would work initially, but the only way to
> > describe a target to Clang is by using the target triple, so passing a
> > list of caps into Clang is not going to work.
> >
> Why not? You're in control of the back-end, maybe I'm missing something
> but I don't see why the state tracker couldn't provide all the required
> parameters to the back-end as it's instantiated.
>
Clang doesn't consult the backend when it converts OpenCL C to LLVM IR,
so passing extra information to the backend won't help. We need to pass
target information directly to Clang and the only way to do this is via
the target triple.
> If this is impossible for some reason, can't we encode these parameters
> in the processor variant part of the target triple? Aren't these
> parameters supposed to be encoded as a target description string at some
> point anyway?
>
The target description string is one of the attributes on the TargetInfo
class in Clang, but there are others as well. Take a look at the
Targets.cpp file in Clang and you'll see what I'm talking about:
http://clang.llvm.org/doxygen/Targets_8cpp-source.html
> >> Personally I think the latter would be closer to ideal, but I guess it
> >> would also involve more work...
> >
> > My preference is to keep this patch as is. At this point, I'm not
> > really sure why the proposed alternate solutions would be better than
> > the current implementation, and I think if we want to support out of
> > tree backends, we will need to at least have the option of doing it this
> > way.
> >
> What bothers me of this solution is that you're pumping bytecode using a
> generic representation, but in a non-generic way, that isn't going to be
> useful for drivers other than r600 that want compute programs in the
> same representation -- at least not without a number of driver-specific
> ad-hoc changes in both LLVM and the CL state tracker.
>
This is exactly what other state trackers do. They produce a generic
representation (TGSI) in a non-generic way. For example, st/mesa might
produce a TGSI fragment shader that uses indirect addressing for r600g,
but the TGSI representation of the shader is not generic. That TGSI
program will not run on r300g, because that driver does not support
indirect addressing.
> I'm fine with the CL state tracker giving out code in LLVM IR, but, if
> it does, I really think it makes no sense for it to even know it's
> running on r600.
>
Even though LLVM is a generic representation it's not always possible
to produce an LLVM bitcode program that is guaranteed to run on every
target, so if we use LLVM IR it will have to some knowledge about that
target on which it will executed.
> > Even though I prefer this solution, ultimately I want to come up with
> > something that works for everyone even if it isn't exactly what I
> > proposed. So, I hope we will be able to work something out.
> >
> > -Tom
-Tom
More information about the mesa-dev
mailing list