glibc

lib/glibc

mirror of https://sourceware.org/git/glibc.git synced 2025-12-06 12:01:08 +03:00

Files

Adhemerval Zanella 8a0152b61b math: New generic fmaf implementation

The current implementation relies on setting the rounding mode for
different calculations (FE_TOWARDZERO) to obtain correctly rounded
results.  For most CPUs, this adds significant performance overhead
because it requires executing a typically slow instruction (to
get/set the floating-point status), necessitates flushing the
pipeline, and breaks some compiler assumptions/optimizations.

The original implementation adds tests to handle underflow in corner
cases, whereas this implementation uses a different strategy that
checks both the mantissa and the result to determine whether the
result is not subject to double rounding.

I tested this implementation on various targets (x86_64, i686, arm,
aarch64, powerpc), including some by manually disabling the compiler
instructions.

Performance-wise, it shows large improvements:

reciprocal-throughput       master       patched       improvement
x86_64 [1]                   58.09          7.96             7.33x
i686 [1]                    279.41         16.97            16.46x
aarch64 [2]                  26.09          4.10             6.35x
armhf [2]                    30.25          4.20             7.18x
powerpc [3]                   9.46          1.46             6.45x

latency                     master       patched       improvement
x86_64                       64.50         14.25             4.53x
i686                        304.39         61.04             4.99x
aarch64                      27.71          5.74             4.82x
armhf                        33.46          7.34             4.55x
powerpc                      10.96          2.65             4.13x

Checked on x86_64-linux-gnu and i686-linux-gnu with —disable-multi-arch,
and on arm-linux-gnueabihf.

[1] gcc 15.2.1, Zen3
[2] gcc 15.2.1, Neoverse N1
[3] gcc 15.2.1, POWER10

Signed-off-by: Szabolcs Nagy <nsz@gcc.gnu.org>
Co-authored-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Co-authored-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>
Reviewed-by: Wilco Dijkstra  <Wilco.Dijkstra@arm.com>

2025-11-27 14:52:25 -03:00

fpu

x86: Adapt "%v" usage on clang to emit VEX enconding

2025-11-10 08:58:06 -03:00

htl

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

i586

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

i686

math: Remove ldbl-96 fma implementation

2025-11-21 13:13:02 -03:00

i786

…

nptl

htl: move pthread_spin_{destroy, lock, init, trylock, unlock) and remove _pthread_spin_lock, into libc.

2025-11-21 00:29:44 +01:00

sys

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

____longjmp_chk.S

…

__longjmp.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

abort-instr.h

…

add_n.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

addmul_1.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

asm-syntax.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

backtrace.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

bsd-_setjmp.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

bsd-setjmp.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

configure

i686: Compile .op files and gmon tests with -mfentry

2025-09-15 11:14:03 -07:00

configure.ac

i686: Compile .op files and gmon tests with -mfentry

2025-09-15 11:14:03 -07:00

crti.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

crtn.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

dl-fixup-attribute.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

dl-irel.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

dl-machine-rel.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

dl-machine.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

dl-procinfo.c

Remove unused dl-procinfo.h

2025-02-28 16:55:18 +00:00

dl-tls-get-addr.c

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

dl-tls.h

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

dl-tlsdesc-dynamic.h

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

dl-tlsdesc.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

dl-tlsdesc.S

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

dl-trampoline.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

gccframe.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

i386-mcount.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

Implies

Add float128 support for x86_64, x86.

2017-06-26 22:02:24 +00:00

isa.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

jmpbuf-offsets.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

jmpbuf-unwind.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

link-defines.sym

elf: Remove Intel MPX support (lazy PLT, ld.so profile, and LD_AUDIT)

2021-10-11 11:14:02 -07:00

lshift.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

machine-gmon.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

Makefile

math: New generic fmaf implementation

2025-11-27 14:52:25 -03:00

malloc-alignment.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

math-use-builtins-ffs.h

string: Use builtins for ffs and ffsll

2024-02-01 09:31:33 -03:00

memchr.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memcmp.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memcopy.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memcpy_chk.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memcpy.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memmove_chk.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memmove.S

…

mempcpy_chk.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

mempcpy.S

…

memset_chk.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

memset.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

mul_1.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

preconfigure

…

rawmemchr.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

rshift.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

setfpucw.c

x86: Adapt "%v" usage on clang to emit VEX enconding

2025-11-10 08:58:06 -03:00

setjmp.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

stackguard-macros.h

x86: Simplify stack and pointer guard macros

2025-10-08 09:35:15 +02:00

stackinfo.h

elf: early conversion of elf p_flags to mprotect flags

2025-08-27 10:45:45 -03:00

start.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

stpcpy.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

stpncpy.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strcat.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strchr.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strchrnul.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strcspn.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

string-inlines.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

string-opthr.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strlen.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strlen.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strpbrk.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strrchr.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

strspn.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

sub_n.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

submul_1.S

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

symbol-hacks.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

sysdep.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tls_get_addr.h

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

tls_get_addr.S

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

tls-get-addr-wrapper.h

i386: Update ___tls_get_addr to preserve vector registers

2025-06-19 04:30:31 +08:00

tlsdesc.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tlsdesc.sym

…

tst-audit3.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tst-audit3.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tst-audit.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tst-auditmod3a.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tst-auditmod3b.c

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

tst-ld-sse-use.sh

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

unwind-arch.h

Update copyright dates with scripts/update-copyrights

2025-01-01 11:22:09 -08:00

Versions

i386: Add GLIBC_ABI_GNU_TLS version [BZ #33221 ]

2025-08-14 05:43:32 -07:00