lib/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-07-28 00:01:53 +03:00

Author	SHA1	Message	Date
Dominik Loidolt	4be08ba122	fuzz: Fix FUZZ_malloc_rand() to return non-NULL for zero-size allocations The FUZZ_malloc_rand() function was incorrectly always returning NULL for zero-size allocations. The random offset generated by FUZZ_dataProducer_int32Range() was not being added to the pointer variable, causing the function to always return (void *)0.	2025-06-05 17:28:30 +02:00
Yann Collet	04a2a0219c	update type names naming convention: Type names should start with a Capital letter (after the prefix)	2024-12-29 14:25:33 -08:00
Yann Collet	72ce56b527	fixed another invalid scenario compressSequencesAndLiterals() doesn't support sequence validation	2024-12-23 21:15:50 -08:00
Yann Collet	7b294caf46	add one valid test case ZSTD_compressSequencesAndLiterals() may return a specific error code when data to compress is non-compressible.	2024-12-23 19:43:17 -08:00
Yann Collet	f8725e80cc	added fuzzer test for compressSequencesAndLiterals() piggy-backing onto existing compressSequences() fuzzer test	2024-12-23 18:42:51 -08:00
Yann Collet	5164d44dab	change advanced parameter name: ZSTD_c_repcodeResolution and updated its documentation. Note: older name ZSTD_c_searchForExternalRepcodes remains supported via #define	2024-12-20 10:36:59 -08:00
Yann Collet	c97522f7fb	codemod: ZSTD_sequenceFormat_e -> ZSTD_SequenceFormat_e since it's a type name. Note: in contrast with previous names, this one is on the Public API side. So there is a #define, so that existing programs using ZSTD_sequenceFormat_e still work.	2024-12-20 10:36:56 -08:00
Yann Collet	bbaba45589	change experimental parameter name from ZSTD_c_useBlockSplitter to ZSTD_c_splitAfterSequences.	2024-10-31 13:43:40 -07:00
Yann Collet	41d870fbbf	updated regression tests results	2024-10-17 11:06:26 -07:00
Quentin Boswank	f19c98228f	Fix $filter and Msys/Cygwin - switched the patter and input of $filter into the right places - added pattern wildcard to MSYS_NT & CYGWIN_NT as they change with windows versions - correctly identify MSYS2, even in an env like MINGW64	2024-06-05 18:37:27 +02:00
Nick Terrell	81a5e5d438	[fuzz] Turn off -Werror by default This was causing OSS-Fuzz errors, due to compiler differences. * Fix the issue * Also turn off -Werror so we don't fail fuzzer builds for warnings * Turn on -Werror in our CI	2024-03-26 16:34:36 -07:00
Elliot Gorokhovsky	dc1f7b560b	fix -Werror=pointer-arith in fuzzers (#3983 )	2024-03-21 15:16:38 -04:00
Yann Collet	6679d0ca7b	Merge pull request #3982 from embg/fuzzer_readme Document the process for adding a new fuzzer	2024-03-21 11:10:03 -07:00
Yann Collet	1d3f664fce	Merge pull request #3979 from yoniko/Werror-fuzz Fail on errors when building fuzzers	2024-03-21 10:41:34 -07:00
Nick Terrell	731f4b70fc	Fix & fuzz ZSTD_generateSequences This function was seriously flawed: * It didn't do output bounds checks * It produced invalid sequences when an uncompressed or RLE block was emitted * It produced invalid sequences when the block splitter was enabled * It produced invalid sequences when ZSTD_c_targetCBlockSize was enabled I've attempted to fix these issues, but this function is just a bad idea, so I've marked it as deprecated and unsafe. We should replace it with `ZSTD_extractSequences()` which operates on a compressed frame.	2024-03-21 07:18:05 -07:00
Elliot Gorokhovsky	741b87bbe1	Fuzzing and bugfixes for magicless-format decoding (#3976 ) * fuzzing and bugfixes for magicless format * reset dctx before each decompression * do not memcmp empty buffers * nit: decompressor errata	2024-03-20 19:22:34 -04:00
Elliot Gorokhovsky	f62b2663b9	Add docs on how to add a new fuzzer	2024-03-19 14:05:23 -07:00
Yonatan Komornik	3487a60950	Fail on errors when building fuzzers Fails on errors when building fuzzers with `fuzz.py` (adds `Werror`). Currently allows `declaration-after-statement`, `c++-compat` and `deprecated` as they are abundant in code (some fixes to `declaration-after-statement` are presented in this commit).	2024-03-18 15:51:28 -07:00
Yonatan Komornik	6a0052a409	Fix bugs in simple decompression fuzzer (#3978 ) Fixes 2 issue in `simple_decompress.c`: 1. Wrong type used for storing the results of `ZSTD_findDecompressedSize` resulting in never matching to `ZSTD_CONTENTSIZE_ERROR` or `ZSTD_CONTENTSIZE_UNKNOWN`. 2. Experimental API is used (`ZSTD_findDecompressedSize`) without defining `ZSTD_STATIC_LINKING_ONLY`.	2024-03-18 15:36:40 -07:00
Elliot Gorokhovsky	f65b9e27ce	Exercise ZSTD_findDecompressedSize() in the simple decompression fuzzer (#3959 ) * Improve decompression fuzzer * Fix legacy frame header fuzzer crash, add unit test	2024-03-12 17:07:06 -04:00
Yann Collet	e385c3dd46	Merge pull request #3753 from facebook/make2 minor Makefile refactoring	2024-03-03 19:13:00 -08:00
Yann Collet	6719794379	fixed some regressionTests but not all	2024-02-23 18:48:29 -08:00
Yann Collet	695d154cac	fuzz: control debuglevel from Makefile and make the compilation faster	2024-02-08 16:23:52 -08:00
Yann Collet	c1e588fcb4	Merge pull request #3771 from DimitriPapadopoulos/codespell Fix new typos found by codespell	2023-10-07 19:29:41 -07:00
Nick Terrell	43118da8a7	Stop suppressing pointer-overflow UBSAN errors * Remove all pointer-overflow suppressions from our UBSAN builds/tests. * Add `ZSTD_ALLOW_POINTER_OVERFLOW_ATTR` macro to suppress pointer-overflow at a per-function level. This is a superior approach because it also applies to users who build zstd with UBSAN. * Add `ZSTD_wrappedPtr{Diff,Add,Sub}()` that use these suppressions. The end goal is to only tag these functions with `ZSTD_ALLOW_POINTER_OVERFLOW`. But we can start by annoting functions that rely on pointer overflow, and gradually transition to using these. * Add `ZSTD_maybeNullPtrAdd()` to simplify pointer addition when the pointer may be `NULL`. * Fix all the fuzzer issues that came up. I'm sure there will be a lot more, but these are the ones that came up within a few minutes of running the fuzzers, and while running GitHub CI.	2023-09-28 17:35:05 -04:00
Dimitri Papadopoulos	fe34776c20	Fix new typos found by codespell	2023-09-23 18:56:01 +02:00
Yann Collet	f4dbfce79c	define LIB_SRCDIR and LIB_BINDIR	2023-09-12 13:46:03 -07:00
Nick Terrell	61efb2a047	Add ZSTD_d_maxBlockSize parameter Reduces memory when blocks are guaranteed to be smaller than allowed by the format. This is useful for streaming compression in conjunction with ZSTD_c_maxBlockSize. This PR saves 2 * (formatMaxBlockSize - paramMaxBlockSize) when streaming. Once it is rebased on top of PR #3616 it will save 3 * (formatMaxBlockSize - paramMaxBlockSize).	2023-04-17 22:06:44 -07:00
Nick Terrell	e72e13ac6c	[oss-fuzz] Fix simple_round_trip fuzzer with overlapping decompression When `ZSTD_c_maxBlockSize` is set, we weren't computing the decompression margin correctly, leading to `dstSize_tooSmall` errors. Fix that computation. This is just a bug in the fuzzer, not a bug in the library itself. Credit to OSS-Fuzz	2023-04-13 10:14:29 -07:00
daniellerozenblit	fcaf06ddb4	Check that `dest` is valid for decompression (#3555 ) * add check for valid dest buffer and fuzz on random dest ptr when malloc 0 * add uptrval to linux-kernel * remove bin files * get rid of uptrval * restrict max pointer value check to platforms where sizeof(size_t) == sizeof(void*)	2023-03-31 23:00:55 -07:00
Elliot Gorokhovsky	a810e1eeb7	Provide an interface for fuzzing sequence producer plugins	2023-03-28 12:02:57 -07:00
Elliot Gorokhovsky	ff42ed1582	Rename "External Matchfinder" to "Block-Level Sequence Producer" (#3484 ) * change "external matchfinder" to "external sequence producer" * migrate contrib/ to new naming convention * fix contrib build * fix error message * update debug strings * fix def of invalid sequences in zstd.h * nit * update CHANGELOG * fix .gitignore	2023-02-09 17:01:17 -05:00
Elliot Gorokhovsky	7f8189ca57	add ZSTD_c_fastExternalSequenceParsing cctxParam	2023-02-01 09:09:53 -08:00
Elliot Gorokhovsky	64052ef57d	Guard against invalid sequences from external matchfinders (#3465 )	2023-01-31 13:55:48 -05:00
Nick Terrell	8957fef554	[huf] Add generic C versions of the fast decoding loops Add generic C versions of the fast decoding loops to serve architectures that don't have an assembly implementation. Also allow selecting the C decoding loop over the assembly decoding loop through a zstd decompression parameter `ZSTD_d_disableHuffmanAssembly`. I benchmarked on my Intel i9-9900K and my Macbook Air with an M1 processor. The benchmark command forces zstd to compress without any matches, using only literals compression, and measures only Huffman decompression speed: ``` zstd -b1e1 --compress-literals --zstd=tlen=131072 silesia.tar ``` The new fast decoding loops outperform the previous implementation uniformly, but don't beat the x86-64 assembly. Additionally, the fast C decoding loops suffer from the same stability problems that we've seen in the past, where the assembly version doesn't. So even though clang gets close to assembly on x86-64, it still has stability issues. \| Arch \| Function \| Compiler \| Default (MB/s) \| Assembly (MB/s) \| Fast (MB/s) \| \|---------\|----------------\|--------------\|----------------\|-----------------\|-------------\| \| x86-64 \| decompress 4X1 \| gcc-12.2.0 \| 1029.6 \| 1308.1 \| 1208.1 \| \| x86-64 \| decompress 4X1 \| clang-14.0.6 \| 1019.3 \| 1305.6 \| 1276.3 \| \| x86-64 \| decompress 4X2 \| gcc-12.2.0 \| 1348.5 \| 1657.0 \| 1374.1 \| \| x86-64 \| decompress 4X2 \| clang-14.0.6 \| 1027.6 \| 1659.9 \| 1468.1 \| \| aarch64 \| decompress 4X1 \| clang-12.0.5 \| 1081.0 \| N/A \| 1234.9 \| \| aarch64 \| decompress 4X2 \| clang-12.0.5 \| 1270.0 \| N/A \| 1516.6 \|	2023-01-25 13:47:51 -08:00
Danielle Rozenblit	7d600c628a	fix bound check for ZSTD_copySequencesToSeqStoreNoBlockDelim()	2023-01-24 06:40:40 -08:00
Danielle Rozenblit	7fc00c18b8	calloc dictionary in sequence compression fuzzer rather than generating a random buffer	2023-01-23 10:42:09 -08:00
Danielle Rozenblit	f75afb613f	merge dev	2023-01-23 08:12:19 -08:00
Danielle Rozenblit	638d502002	modify sequence compression api fuzzer	2023-01-23 07:55:11 -08:00
Nick Terrell	329169189c	Replace Huffman boolean args with flags bit set	2023-01-20 14:12:53 -08:00
Nick Terrell	0cc1b0cb22	Delete unused Huffman functions Remove all Huffman functions that aren't used by zstd.	2023-01-20 14:12:53 -08:00
Elliot Gorokhovsky	f593e54ee1	Enable if == 1 rather than if == 0 Co-authored-by: Nick Terrell <nickrterrell@gmail.com>	2023-01-20 11:41:53 -05:00
Elliot Gorokhovsky	3f9f568aa6	Fuzz the external matchfinder API	2023-01-19 13:33:25 -08:00
daniellerozenblit	dc1c6cc5df	Merge pull request #3418 from daniellerozenblit/fuzz-max-block-size Fuzz on maxBlockSize	2023-01-19 08:18:04 -05:00
Nick Terrell	5b266196a4	Add support for in-place decompression * Add a function and macro ZSTD_decompressionMargin() that computes the decompression margin for in-place decompression. The function computes a tight margin that works in all cases, and the macro computes an upper bound that will only work if flush isn't used. * When doing in-place decompression, make sure that our output buffer doesn't overlap with the input buffer. This ensures that we don't decide to use the portion of the output buffer that overlaps the input buffer for temporary memory, like for literals. * Add a simple unit test. * Add in-place decompression to the simple_round_trip and stream_round_trip fuzzers. This should help verify that our margin stays correct.	2023-01-12 16:28:08 -08:00
Danielle Rozenblit	1fffcfe01d	update minimum threshold for max block size	2023-01-11 11:09:57 -08:00
Daniel Kutenin	ca2ff788df	Make the producer use the same amount of entropy	2023-01-11 10:09:19 -08:00
Daniel Kutenin	3ac0b91302	Fix fuzzing with ZSTD_MULTITHREAD At Google we fuzz zstd without ZSTD_MULTITHREAD but we want inputs to be as much as reproducible. It allows us to test new fuzzing methods for our fuzz team internally and have more horsepower to find bugs	2023-01-11 10:09:19 -08:00
Danielle Rozenblit	fe08137d9a	resolve max block value in cctx and use when calculating the max block size	2023-01-09 07:53:53 -08:00
Danielle Rozenblit	908e812733	initial commit	2023-01-04 13:01:54 -08:00

1 2 3 4 5

210 Commits