glibc

lib/glibc

mirror of https://sourceware.org/git/glibc.git synced 2025-12-21 17:31:10 +03:00

Author	SHA1	Message	Date
Dev Jain	321e1fc73f	malloc: Enable 2MB THP by default on Aarch64 Linux supports multi-sized Transparent Huge Pages (mTHP). For the purpose of this patch description, we call the block size mapped by a non-last level pagetable level, the traditional THP size (2M for 4K basepage, 512M for 64K basepage). Linux now also supports intermediate THP sizes mapped by the last level pagetable - we call that the mTHP size. The support for mTHP in Linux has grown to be better and stable over time - applications can benefit from reduced page faults and reduced kernel memory management overhead, albeit at the cost of internal fragmentation. We have observed consistent performance boosts with mTHP with little variance. As a result, enable 2M THP by default on Aarch64. This enables THP even if user hasn't passed glibc.malloc.hugetlb=1. If user has passed it, we avoid making the system call to check the hugepage size from sysfs, and override it with the hardcoded 2MB. There are two additional benefits of this patch, if the transparent hugepage sysctl is set to madvise or always: 1) The THP size is now hardcoded to 2MB for Aarch64. This avoids a syscall for fetching the THP size from sysfs. 2) On 64K basepage size systems, the traditional THP size is 512M, which is unusable and impractical. We can instead benefit from the mTHP size of 2M. Apart from the usual benefit of THPs/mTHPs as described above, Aarch64 systems benefit from reduced TLB pressure on this mTHP size, commonly known as the "contpte" size. If the application takes a pagefault, and either the THP sysctl settings is "always", or the virtual memory area has been madvise(MADV_HUGEPAGE)'d along with sysctl being "madvise", then Linux will fault in a 2M mTHP, mapping contiguous pages into the pagetable, and painting the pagetable entries with the cont-bit. This bit is a hint to the hardware that the concerned pagetable entry maps a page which is part of a set of contiguous pages - the TLB then only remembers a single entry for this set of 2M/64K = 32 pages, because the physical address of any other page in this contiguous set is computable by the TLB cached physical address via a linear offset. Hence, what was only possible with the traditional THP size, is now possible with the mTHP size. We see a 6.25% performance improvement on SPEC. If the sysctl is set to never, no transparent hugepages will be created by the kernel. But, this patch still sets thp_pagesize = 2MB. The benefit is that on MORECORE() invocation, we extend the heap by 2MB instead of 4KB, potentially reducing the frequency of this syscall's invocation by 512x. Note that, there is no difference in cost between an sbrk(2M) and sbrk(4K); the kernel only does a virtual reservation and does not touch user physical memory. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-12-10 12:18:16 +00:00
Dev Jain	26e6e4d51e	malloc: Do not make out-of-bounds madvise call on non-aligned heap Currently, if the initial program break is not aligned to the system page size, then we align the pointer down to the page size. If there is a gap before the heap VMA, then such an adjustment means that the madvise() range now contains a gap. The behaviour in the upstream kernel is currently this: madvise() will return -ENOMEM, even though the operation will still succeed in the sense that the VM_HUGEPAGE flag will be set on the heap VMA. We must not depend on this behaviour - this is an internal kernel implementation, and earlier kernels may possibly abort the operation altogether. The other case is that there is no gap, and as a result we may end up setting the VM_HUGEPAGE flag on that other VMA too, which is an unnecessary side effect. Let us fix this by aligning the pointer up to the page size. We should also subtract the pointer difference from the size, because if we don't, since the pointer is now aligned up, the size may cross the heap VMA, thus leading to the same problem but at the other end. There is no need to check this new size against mp_.thp_pagesize to decide whether to make the madvise() call. The reason we make this check at the start of madvise_thp() is to check whether the size of the VMA is enough to map THPs into it. Since that check has passed, all that we need to ensure now is that q + size does not cross the heap VMA. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-12-10 12:18:16 +00:00
Wilco Dijkstra	7f670284d8	malloc: Use _int_free_chunk in tcache_thread_shutdown Directly call _int_free_chunk during tcache shutdown to avoid recursion. Calling __libc_free on a block from tcache gets flagged as a double free, and tcache_double_free_verify checks every tcache chunk (quadratic overhead). Reviewed-by: Arjun Shankar <arjun@redhat.com>	2025-11-20 12:28:46 +00:00
Justin King	56549264d1	malloc: add free_sized and free_aligned_sized from C23 Signed-off-by: Justin King <jcking@google.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-11-19 13:47:53 -03:00
Adhemerval Zanella	f91abbde02	malloc: Remove unused tcache_set_inactive clang warns that this function is not used. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2025-10-29 12:53:53 -03:00
Dev Jain	b2b4b46a52	malloc: fix large tcache code to check for exact size match The tcache is used for allocation only if an exact match is found. In the large tcache code added in commit `cbfd798810`, we currently extract a chunk of size greater than or equal to the size we need, but don't check strict equality. This patch fixes that behaviour. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-10-24 16:55:02 +00:00
DJ Delorie	2bf2188fae	malloc: avoid need for tcache == NULL checks Avoid needing to check for tcache == NULL by initializing it to a dummy read-only tcache structure. This dummy is all zeros, so logically it is both full (when you want to put) and empty (when you want to get). Also, there are two dummies, one used for "not yet initialized" and one for "tunables say we shouldn't have a tcache". The net result is twofold: 1. Checks for tcache == NULL may be removed from the fast path. Whether this makes the fast path faster when tcache is disabled is TBD, but the normal case is tcache enabled. 2. no memory for tcache is allocated if tunables disable caching. Co-authored-by: Florian Weimer <fweimer@redhat.com> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-10-21 16:51:03 -04:00
Adhemerval Zanella	41e27c400d	malloc: Use INT_ADD_OVERFLOW instead of __builtin_add_overflow_p clang does not support the __builtin__overflow_p builtins, on gcc the macros will call __builtin__overflow_p. Reviewed-by: Collin Funk <collin.funk1@gmail.com>	2025-10-20 11:33:54 -03:00
Wilco Dijkstra	e974b1b7eb	malloc: Cleanup _int_memalign Cleanup _int_memalign. Simplify the logic. Add a seperate check for mmap. Only release the tail chunk if it is at least MINSIZE. Use the new mmap abstractions. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-10-17 17:03:54 +00:00
Dev Jain	fa5d1b5419	malloc: Do not call madvise if oldsize >= THP size Linux handles virtual memory in Virtual Memory Areas (VMAs). The madvise(MADV_HUGEPAGE) call works on a VMA granularity, which sets the VM_HUGEPAGE flag on the VMA. If this VMA or a portion of it is mremapped to a different location, Linux will create a new VMA, which will have the same flags as the old one. This implies that the VM_HUGEPAGE flag will be retained. Therefore, if we can guarantee that the old VMA was marked with VM_HUGEPAGE, then there is no need to call madvise_thp() in mremap_chunk(). The old chunk comes from a heap or non-heap allocation, both of which have already been enlightened for THP. This implies that, if THP is on, and the size of the old chunk is greater than or equal to thp_pagesize, the VMA to which this chunk belongs to, has the VM_HUGEPAGE flag set. Hence in this case we can avoid invoking the madvise() syscall. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-10-08 12:59:30 +00:00
Wilco Dijkstra	88de32a070	malloc: Improve mmap interface Add mmap_set_chunk() to create a new chunk from an mmap block. Remove set_mmap_is_hp() since it is done inside mmap_set_chunk(). Rename prev_size_mmap() to mmap_base_offset(). Cleanup comments. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-10-08 12:59:30 +00:00
William Hunt	849a274531	malloc: Cleanup macros, asserts and sysmalloc_mmap_fallback Refactor malloc.c to remove dead code, create macros to abstract duplicated code, and cleanup sysmalloc_mmap_fallback to remove logic not related to the mmap call. Change the return type of mmap_base to uintptr_t since this allows using operations on the return value, and avoids casting in both calls in mremap_chunk and munmap_chunk. Cleanup sysmalloc_mmap_fallback. Remove unused parameters nb, oldsize and av. Remove redundant overflow check and instead use size_t for all parameters except extra_flags to prevent overflows. Move logic not concerned with the mmap call itself outside the function after both calls to sysmalloc_mmap_fallback are made; this means move code for naming the VMA and marking the arena being extended as non-contiguous to the calling code to be handled in the case that the mmap is successful. Calculate the fallback size from nb to avoid modifying size after it has been set for MORECORE. Remove unused noncontiguous macro. Remove redundant assert for checking unreachable option for global_max_fast. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-10-03 16:34:10 +00:00
Wilco Dijkstra	19442c052c	malloc: Cleanup libc_realloc Minor cleanup of libc_realloc: remove unnecessary special cases for mmap, move ar_ptr initialization, first check for oldmem == NULL. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-09-10 09:18:06 +00:00
Wilco Dijkstra	210ee29503	atomics: Remove unused atomics Remove all unused atomics. Replace uses of catomic_increment and catomic_decrement with atomic_fetch_add_relaxed which maps to a standard compiler builtin. Relaxed memory ordering is correct for simple counters since they only need atomicity. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-09-10 09:18:06 +00:00
Samuel Thibault	245ea60b0e	malloc: check "negative" tcache_key values by hand instead of undefined cases from casting uintptr_t into intptr_t.	2025-09-09 23:05:00 +02:00
Adhemerval Zanella	b9fe06a8a8	malloc: Fix Os build on some ABIs I have not checked with all versions for all ABIs, but I saw failures with gcc-14 on arm, alpha, hppa, i686, sparc, sh4, and microblaze. Reviewed-by: Collin Funk <collin.funk1@gmail.com>	2025-09-08 08:21:48 -03:00
Wilco Dijkstra	921e251e8f	malloc: Support hugepages in mremap_chunk Add mremap_chunk support for mmap()ed chunks using hugepages by accounting for their alignment, to prevent the mremap call failing in most cases where the size passed is not a hugepage size multiple. It also improves robustness for reallocating hugepages since mremap is much less likely to fail, so running out of memory when reallocating a larger size and having to copy the old contents after mremap fails is also less likely. To track whether an mmap()ed chunk uses hugepages, have a flag in the lowest bit of the mchunk_prev_size field which is set after a call to sysmalloc_mmap, and accessed later in mremap_chunk. Create macros for getting and setting this bit, and for mapping the bit off when accessing the field for mmap()ed chunks. Since the alignment cannot be lower than 8 bytes, this flag cannot affect the alignment data. Add malloc/tst-tcfree4-malloc-check to the tests-exclude-malloc-check list as malloc-check prevents the tcache from being used to store chunks. This test caused failures due to a bug in mem2chunk_check to be fixed in a later patch. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-08-27 13:07:09 +00:00
Wilco Dijkstra	614cfd0f8a	malloc: Change mmap chunk layout Change the mmap chunk layout to be identical to a normal chunk. This makes it safe for tcache to hold mmap chunks and simplifies size calculations in memsize and musable. Add mmap_base() and mmap_size() macros to simplify code. Reviewed-by: Cupertino Miranda <cupertino.miranda@oracle.com>	2025-08-27 11:41:58 +00:00
Samuel Thibault	8543577b04	malloc: Fix checking for small negative values of tcache_key tcache_key is unsigned so we should turn it explicitly to signed before taking its absolute value.	2025-08-10 23:45:35 +02:00
Samuel Thibault	2536c4f858	malloc: Make sure tcache_key is odd enough We want tcache_key not to be a commonly-occurring value in memory, so ensure a minimum amount of one and zero bits. And we need it non-zero, otherwise even if tcache_double_free_verify sets e->key to 0 before calling __libc_free, it gets called again by __libc_free, thus looping indefinitely. Fixes: `c968fe5062` ("malloc: Use tailcalls in __libc_free")	2025-08-10 09:44:08 +02:00
Wilco Dijkstra	a5e9269f51	malloc: Fix MALLOC_DEBUG MALLOC_DEBUG only works on locked arenas, so move the call to check_inuse_chunk from __libc_free() to _int_free_chunk(). Regress now passes if MALLOC_DEBUG is enabled. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-08-08 14:00:43 +00:00
Wilco Dijkstra	94ebcfc4f2	malloc: Remove use of __curbrk Remove an odd use of __curbrk and use MORECORE (0) instead. This fixes Hurd build since it doesn't define this symbol. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-08-08 13:59:31 +00:00
Wilco Dijkstra	7ab623afb9	Revert "Remove use of __curbrk." This reverts commit `1ee0b771a9`.	2025-08-04 17:31:56 +00:00
Wilco Dijkstra	91a7726374	Revert "Improve MALLOC_DEBUG" This reverts commit `4b3e65682d`.	2025-08-04 17:31:54 +00:00
Wilco Dijkstra	3191dda282	Revert "Use _int_free_chunk in tcache_thread_shutdown" This reverts commit `05ef6a4974`.	2025-08-04 17:31:49 +00:00
Wilco Dijkstra	1bf4a379e8	Revert "malloc: Cleanup libc_realloc" This reverts commit `dea1e52af3`.	2025-08-04 17:31:45 +00:00
Wilco Dijkstra	8c2b6e528d	Revert "Change mmap representation" This reverts commit `4b74591022`.	2025-08-04 17:31:40 +00:00
Wilco Dijkstra	1ee0b771a9	Remove use of __curbrk.	2025-08-04 17:13:55 +00:00
Wilco Dijkstra	4b3e65682d	Improve MALLOC_DEBUG	2025-08-04 17:13:55 +00:00
Wilco Dijkstra	05ef6a4974	Use _int_free_chunk in tcache_thread_shutdown	2025-08-04 17:13:55 +00:00
Wilco Dijkstra	dea1e52af3	malloc: Cleanup libc_realloc Minor cleanup of libc_realloc: remove unnecessary special cases for mmap, move ar_ptr initialization, first check for oldmem == NULL.	2025-08-04 17:13:55 +00:00
Wilco Dijkstra	4b74591022	Change mmap representation	2025-08-04 17:13:55 +00:00
Wilco Dijkstra	35a7a7ab99	malloc: Cleanup sysmalloc_mmap Cleanup sysmalloc_mmap - simplify padding since it is always a constant. Remove av parameter which is only used in do_check_chunk, but since it may be NULL for mmap, it will cause a crash in checking mode. Remove the odd check on mmap in do_check_chunk. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-08-02 15:21:16 +00:00
Wilco Dijkstra	b68b125ad1	malloc: Improve checked_request2size Change checked_request2size to return SIZE_MAX for huge inputs. This ensures large allocation requests stay large and can't be confused with a small allocation. As a result several existing checks against PTRDIFF_MAX become redundant. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-08-02 14:38:35 +00:00
Wilco Dijkstra	21fda179c2	malloc: Cleanup madvise defines Remove redundant ifdefs for madvise/THP. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-08-02 14:19:50 +00:00
Wilco Dijkstra	ad4caba414	malloc: Fix MAX_TCACHE_SMALL_SIZE MAX_TCACHE_SMALL_SIZE should use chunk size since it is used after checked_request2size. Increase limit of tcache_max_bytes by 1 since all comparisons use '<'. As a result, the last tcache entry is now used as expected. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-08-02 14:16:24 +00:00
William Hunt	9097cbf5d8	malloc: Enable THP always support on hugetlb tunable Enable support for THP always when glibc.malloc.hugetlb=1, as the tunable currently only gives explicit support in malloc for the THP madvise mode by aligning to a huge page size. Add a thp_mode parameter to mp_ and check in madvise_thp whether the system is using madvise mode, otherwise the `__madvise` call is useless. Set the thp_mode to be unsupported by default, but if the hugetlb tunable is set this updates thp_mode. Performance of xalancbmk improves by 4.9% on Neoverse V2 when THP always mode is set on the system and glibc.malloc.hugetlb=1. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-07-29 15:05:51 +00:00
Wilco Dijkstra	089b4fb90f	malloc: Remove redundant NULL check Remove a redundant NULL check from tcache_get_n. Reviewed-by: Cupertino Miranda <cupertino.miranda@oracle.com>	2025-07-29 14:11:58 +00:00
Cupertino Miranda	0263528f8d	malloc: fix definition for MAX_TCACHE_SMALL_SIZE Reviewed-by: Arjun Shankar <arjun@redhat.com>	2025-07-14 19:44:48 +02:00
Wilco Dijkstra	1061b75412	malloc: Cleanup tcache_init() Cleanup tcache_init() by using the new __libc_malloc2 interface. Reviewed-by: Cupertino Miranda <cupertino.miranda@oracle.com>	2025-06-26 15:08:17 +00:00
William Hunt	9a5a7613ac	malloc: replace instances of __builtin_expect with __glibc_unlikely Replaced all instances of __builtin_expect to __glibc_unlikely within malloc.c and malloc-debug.c. This improves the portability of glibc by avoiding calls to GNU C built-in functions. Since all the expected results from calls to __builtin_expect were 0, __glibc_likely was never used as a replacement. Multiple calls to __builtin_expect within a single if statement have been replaced with one call to __glibc_unlikely, which wraps every condition. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-06-26 15:07:53 +00:00
William Hunt	d1ad959b00	malloc: refactored aligned_OK and misaligned_chunk Renamed aligned_OK to misaligned_mem as to be similar to misaligned_chunk, and reversed any assertions using the macro. Made misaligned_chunk call misaligned_mem after chunk2mem rather than bitmasking with the malloc alignment itself, since misaligned_chunk is meant to test the data chunk itself rather than the header, and the compiler will optimise the addition so the ternary operator is not needed. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-06-26 14:57:53 +00:00
Wilco Dijkstra	ba32fd7d04	malloc: Cleanup _mid_memalign Remove unused 'address' parameter from _mid_memalign and callers. Fix off-by-one alignment calculation in __libc_pvalloc. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-06-18 13:37:00 +00:00
Cupertino Miranda	cbfd798810	malloc: add tcache support for large chunk caching Existing tcache implementation in glibc seems to focus in caching smaller data size allocations, limiting the size of the allocation to 1KB. This patch changes tcache implementation to allow to cache any chunk size allocations. The implementation adds extra bins (linked-lists) which store chunks with different ranges of allocation sizes. Bin selection is done in multiples in powers of 2 and chunks are inserted in growing size ordering within the bin. The last bin contains all other sizes of allocations. This patch although by default preserves the same implementation, limitting caches to 1KB chunks, it now allows to increase the max size for the cached chunks with the tunable glibc.malloc.tcache_max. It also now verifies if chunk was mmapped, in which case __libc_free will not add it to tcache. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-06-16 12:05:22 +00:00
Wilco Dijkstra	7e10e30e64	malloc: Count tcache entries downwards Currently tcache requires 2 global variable accesses to determine whether a block can be added to the tcache. Change the counts array to 'num_slots' to indicate the number of entries that could be added. If 'num_slots' reaches zero, no more blocks can be added. If the entries pointer is not NULL, at least one block is available for allocation. Now each tcache bin can support a different maximum number of entries, and they can be individually switched on or off (a zero initialized num_slots+entry means the tcache bin is not available for free or malloc). Reviewed-by: DJ Delorie <dj@redhat.com>	2025-06-03 17:16:39 +00:00
Wilco Dijkstra	36189c76fb	malloc: Improve performance of __libc_calloc Improve performance of __libc_calloc by splitting it into 2 parts: first handle the tcache fastpath, then do the rest in a separate tailcalled function. This results in significant performance gains since __libc_calloc doesn't need to setup a frame. On Neoverse V2, bench-calloc-simple improves by 5.0% overall. Bench-calloc-thread 1 improves by 24%. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-05-14 09:22:32 +00:00
Wilco Dijkstra	25d37948c9	malloc: Improve malloc initialization Move malloc initialization to __libc_early_init. Use a hidden __ptmalloc_init for initialization and a weak call to avoid pulling in the system malloc in a static binary. All previous initialization checks can now be removed. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2025-05-12 16:10:28 +00:00
David Lau	eff1f680cf	malloc: Improved double free detection in the tcache The previous double free detection did not account for an attacker to use a terminating null byte overflowing from the previous chunk to change the size of a memory chunk is being sorted into. So that the check in 'tcache_double_free_verify' would pass even though it is a double free. Solution: Let 'tcache_double_free_verify' iterate over all tcache entries to detect double frees. This patch only protects from buffer overflows by one byte. But I would argue that off by one errors are the most common errors to be made. Alternatives Considered: Store the size of a memory chunk in big endian and thus the chunk size would not get overwritten because entries in the tcache are not that big. Move the tcache_key before the actual memory chunk so that it does not have to be checked at all, this would work better in general but also it would increase the memory usage. Signed-off-by: David Lau <david.lau@fau.de> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2025-05-12 11:58:30 +00:00
Wilco Dijkstra	5d10174581	malloc: Inline tcache_try_malloc Inline tcache_try_malloc into calloc since it is the only caller. Also fix usize2tidx and use it in __libc_malloc, __libc_calloc and _mid_memalign. The result is simpler, cleaner code. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-05-01 20:01:53 +00:00
Cupertino Miranda	1c9ac027a5	malloc: move tcache_init out of hot tcache paths This patch moves any calls of tcache_init away after tcache hot paths. Since there is no reason to initialize tcaches in the hot path and since we need to be able to check tcache != NULL in any case, because of tcache_thread_shutdown function, moving tcache_init away from hot path can only be beneficial. The patch also removes the initialization of tcaches within the __libc_free call. It only makes sense to initialize tcaches for the thread after it calls one of the allocation functions. Also the patch removes the save/restore of errno from tcache_init code, as it is no longer needed.	2025-04-16 13:09:16 +00:00

1 2 3 4 5 ...

542 Commits