[Mesa-dev] [PATCH 2/2] util/ralloc: Make sizeof(linear_header) a multiple of 8
Matt Turner
mattst88 at gmail.com
Tue Nov 13 05:02:51 UTC 2018
On Mon, Nov 12, 2018 at 3:07 PM Eric Anholt <eric at anholt.net> wrote:
>
> Matt Turner <mattst88 at gmail.com> writes:
>
> > Prior to this patch sizeof(linear_header) was 20 bytes in a
> > non-debug build on 32-bit platforms. We do some pointer arithmetic to
> > calculate the next available location with
> >
> > ptr = (linear_size_chunk *)((char *)&latest[1] + latest->offset);
> >
> > in linear_alloc_child(). The &latest[1] adds 20 bytes, so an allocation
> > would only be 4-byte aligned.
> >
> > On 32-bit SPARC a 'sttw' instruction (which stores a consecutive pair of
> > 4-byte registers to memory) requires an 8-byte aligned address. Such an
> > instruction is used to store to an 8-byte integer type, like intmax_t
> > which is used in glcpp's expression_value_t struct.
> >
> > As a result of the 4-byte alignment returned by linear_alloc_child() we
> > would generate a SIGBUS (unaligned exception) on SPARC.
> >
> > According to the GNU libc manual malloc() always returns memory that has
> > at least an alignment of 8-bytes [1]. I think our allocator should do
> > the same.
> >
> > So, simple fix with two parts:
> >
> > (1) Increase SUBALLOC_ALIGNMENT to 8 unconditionally.
> > (2) Mark linear_header with an aligned attribute, which will cause
> > its sizeof to be rounded up to that alignment. (We already do
> > this for ralloc_header)
> >
> > With this done, all Mesa's unit tests now pass on SPARC.
> >
> > [1] https://www.gnu.org/software/libc/manual/html_node/Aligned-Memory-Blocks.html
> >
> > Fixes: 47e17586924f ("glcpp: use the linear allocator for most objects")
> > Bug: https://bugs.gentoo.org/636326
> > ---
> > src/util/ralloc.c | 14 ++++++++++++--
> > 1 file changed, 12 insertions(+), 2 deletions(-)
> >
> > diff --git a/src/util/ralloc.c b/src/util/ralloc.c
> > index 745b4cf1226..fc35661996d 100644
> > --- a/src/util/ralloc.c
> > +++ b/src/util/ralloc.c
> > @@ -552,10 +552,18 @@ ralloc_vasprintf_rewrite_tail(char **str, size_t *start, const char *fmt,
> > */
> >
> > #define MIN_LINEAR_BUFSIZE 2048
> > -#define SUBALLOC_ALIGNMENT sizeof(uintptr_t)
> > +#define SUBALLOC_ALIGNMENT 8
> > #define LMAGIC 0x87b9c7d3
> >
> > -struct linear_header {
> > +struct
> > +#ifdef _MSC_VER
> > + __declspec(align(8))
> > +#elif defined(__LP64__)
> > + __attribute__((aligned(16)))
> > +#else
> > + __attribute__((aligned(8)))
> > +#endif
> > + linear_header {
> > #ifndef NDEBUG
> > unsigned magic; /* for debugging */
> > #endif
> > @@ -647,6 +655,8 @@ linear_alloc_child(void *parent, unsigned size)
> > ptr = (linear_size_chunk *)((char*)&latest[1] + latest->offset);
> > ptr->size = size;
> > latest->offset += full_size;
> > +
> > + assert((uintptr_t)&ptr[1] % SUBALLOC_ALIGNMENT == 0);
> > return &ptr[1];
> > }
>
> These patches are:
>
> Reviewed-by: Eric Anholt <eric at anholt.net>
Thanks a bunch! I hope this is useful for you on arm as well.
> However, shouldn't we also bump SUBALLOC_ALIGNMENT to 16 on LP64, too,
> if that's what glibc is doing for malloc?
16-byte alignment is necessary for SSE aligned vector load/store
instructions. I suppose we're not getting any vectorized SSE
load/store instructions to memory allocated by linear_alloc_* and
that's why we haven't seen problems?
Seems reasonable to bump it to 16-bytes.
More information about the mesa-dev
mailing list