lib/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-07-28 00:01:53 +03:00

Author	SHA1	Message	Date
Yann Collet	39c091bc9e	Merge pull request #4397 from xiaoge1001/free Fix several locations with potential memory leak	2025-06-09 10:06:36 -07:00
shixuantong	de8d9e8914	Fix several locations with potential memory leak	2025-06-09 21:23:23 +08:00
Yann Collet	5b6fbf5f96	Merge pull request #4392 from mugitya03/MLK Fix potential memory leak in function `benchMem`	2025-06-08 12:38:31 -07:00
jinyaoguo	dd4cee9190	Release resources before returning In main, resources were freed on the success path but not in the error path. This change ensures all allocated resources are released before returning.	2025-06-03 15:28:11 -04:00
jinyaoguo	4618b255ea	Fix memory leak in function benchMem `speedPerRound` is allocated at the start of benchMem to collect per-round speeds, but is never freed, causing a leak on each invocation.	2025-05-25 15:21:23 -04:00
Dave Vasilevsky	448a09ff78	seekable_format: Fix conversion warnings in parallel_compression	2025-05-07 22:01:49 -07:00
Dave Vasilevsky	13cb7a10ae	seekable_format: Add test for parallel_compression memory usage Use ulimit to fail the test if we use O(filesize) memory, rather than O(threads).	2025-05-07 22:01:49 -07:00
Dave Vasilevsky	01c973de8d	seekable_format: Fix race in parallel_processing There was no memory barrier between writing and reading `done`, which would allow reordering to cause races. With so little data to handle after each job completes, we might as well just join.	2025-05-07 22:01:49 -07:00
Dave Vasilevsky	6fc8455a72	seekable_format: Cleanup POOL in parallel_compression	2025-05-07 22:01:49 -07:00
Dave Vasilevsky	2d4cff69c4	seekable_format: Make parallel_compression use memory properly Previously, parallel_compression would only handle each job's results after ALL jobs were successfully queued. This caused all src/dst buffers to remain in memory until then! It also polled to check whether a job completed, which is racy without any memory barrier. Now, we flush results as a side effect of completing a job. Completed frames are placed in an ordered linked-list, and any eligible frames are flushed. This may be zero or multiple frames, depending on the order in which jobs finish. This design also makes it simple to support streaming input, so that is now available. Just pass `-` as the filename, and stdin/stdout will be used for I/O.	2025-05-07 22:01:49 -07:00
Dave Vasilevsky	f5b6531902	seekable_format: Link against multi-threaded libzstd.a Some of these examples are intended to be parallel, and don't make sense to link against single-threaded libzstd. The filename of mt and nomt libzstd are identical, so it's still possible to link against the single-threaded one, just harder.	2025-05-07 22:01:49 -07:00
Dave Vasilevsky	6b0039abcf	seekable_format: Build with $(MAKE) This passes make flags, such as `-jN` for building in parallel, to the underlying make.	2025-05-07 22:01:49 -07:00
Sergey Senozhatsky	0727fc5ff9	lib: zstd: export API needed for dictionary support Patch series "zram: introduce custom comp backends API", v7. This series introduces support for run-time compression algorithms tuning, so users, for instance, can adjust compression/acceleration levels and provide pre-trained compression/decompression dictionaries which certain algorithms support. At this point we stop supporting (old/deprecated) comp API. We may add new acomp API support in the future, but before that zram needs to undergo some major rework (we are not ready for async compression). Some benchmarks for reference (look at column #2) * init zstd /sys/block/zram0/mm_stat 1750659072 504622188 514355200 0 514355200 1 0 34204 34204 * init zstd dict=/home/ss/zstd-dict-amd64 /sys/block/zram0/mm_stat 1750650880 465908890 475398144 0 475398144 1 0 34185 34185 * init zstd level=8 dict=/home/ss/zstd-dict-amd64 /sys/block/zram0/mm_stat 1750654976 430803319 439873536 0 439873536 1 0 34185 34185 * init lz4 /sys/block/zram0/mm_stat 1750646784 664266564 677060608 0 677060608 1 0 34288 34288 * init lz4 dict=/home/ss/lz4-dict-amd64 /sys/block/zram0/mm_stat 1750650880 619990300 632102912 0 632102912 1 0 34278 34278 * init lz4hc /sys/block/zram0/mm_stat 1750630400 609023822 621232128 0 621232128 1 0 34288 34288 * init lz4hc dict=/home/ss/lz4-dict-amd64 /sys/block/zram0/mm_stat 1750659072 505133172 515231744 0 515231744 1 0 34278 34278 Recompress init zram zstd (prio=0), zstd level=5 (prio 1), zstd with dict (prio 2) * zstd /sys/block/zram0/mm_stat 1750982656 504630584 514269184 0 514269184 1 0 34204 34204 * idle recompress priority=1 (zstd level=5) /sys/block/zram0/mm_stat 1750982656 488645601 525438976 0 514269184 1 0 34204 34204 * idle recompress priority=2 (zstd dict) /sys/block/zram0/mm_stat 1750982656 460869640 517914624 0 514269184 1 0 34185 34204 This patch (of 24): We need to export a number of API functions that enable advanced zstd usage - C/D dictionaries, dictionaries sharing between contexts, etc. Link: https://lkml.kernel.org/r/20240902105656.1383858-1-senozhatsky@chromium.org Link: https://lkml.kernel.org/r/20240902105656.1383858-2-senozhatsky@chromium.org Signed-off-by: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: Nick Terrell <terrelln@fb.com> Cc: Minchan Kim <minchan@kernel.org> Cc: Sergey Senozhatsky <senozhatsky@chromium.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2025-02-28 17:09:46 -05:00
Yann Collet	bbf2801ddd	try to fix the meson windows build apparently could be just a missing include	2025-02-27 15:45:59 -08:00
Giovanni Cabiddu	92be4be810	[linux] Expose ZSTD_compressSequencesAndLiterals() in the kernel Make the function ZSTD_compressSequencesAndLiterals() available in kernel space. This will be used by Intel QAT driver. Additionally, (1) expose the function ZSTD_CCtx_setParameter(), which is required to set parameters before calling ZSTD_compressSequencesAndLiterals(), (2) update the build process to include `compress/zstd_preSplit.o` and (3) replace `asm/unaligned.h` with `linux/unaligned.h`. Signed-off-by: Giovanni Cabiddu <giovanni.cabiddu@intel.com>	2025-01-27 15:24:22 +00:00
Sean Bartell	80af41e08a	[contrib/pzstd] Prevent hangs when there are errors When two threads are using a WorkQueue and the reader thread exits due to an error, it must call WorkQueue::finish() to wake up the writer thread. Otherwise, if the queue is full and the writer thread is waiting for a free slot, it could hang forever. This can happen in pratice when decompressing a large, corrupted file that does not contain pzstd skippable frames.	2025-01-13 17:24:41 -05:00
Victor Zhang	a610550e2c	Merge pull request #4218 from facebook/externC Move #includes out of `extern "C"` blocks	2025-01-07 10:06:08 -08:00
Victor Zhang	6b046f5841	PR feedback	2025-01-02 15:05:58 -08:00
Victor Zhang	54c3d998a0	Support for libc variants without fseeko/ftello Some older Android libc implementations don't support `fseeko` or `ftello`. This commit adds a new compile-time macro `LIBC_NO_FSEEKO` as well as a usage in CMake for old Android APIs.	2025-01-02 14:02:10 -08:00
Yann Collet	04a2a0219c	update type names naming convention: Type names should start with a Capital letter (after the prefix)	2024-12-29 14:25:33 -08:00
Yann Collet	56cfb7816a	codemod: ZSTD_paramSwitch_e -> ZSTD_ParamSwitch_e	2024-12-20 10:36:58 -08:00
Victor Zhang	c727d5cd67	Remove unnecessary extern C declarations from programs/ and contrib/	2024-12-19 16:20:23 -08:00
Victor Zhang	fc726da774	Move #includes out of `extern "C"` blocks Do some include shuffling for `**.h` files within lib, programs, tests, and zlibWrapper. `lib/legacy` and `lib/deprecated` are untouched. `#include`s within `extern "C"` blocks in .cpp files are untouched. todo: shuffling for `xxhash.h`	2024-12-17 17:55:07 -08:00
Robert Rose	b683c0dbe2	prevent possible segfault when creating seek table Add a check whether the seek table of a `ZSTD_seekable` is initialized before creating a new seek table from it. Return `NULL`, if the check fails.	2024-11-25 08:57:25 +01:00
Yann Collet	2e02cd330d	inform manual users that it's automatically generated suggested by @Eugeny1	2024-10-31 15:06:48 -07:00
inventor500	9215de52c7	Included suggestion from @neheb	2024-09-25 09:51:05 -07:00
inventor500	a8b544d460	Fixed warning when compiling pzstd with CPPFLAGS=-Wunused-result and CXXFLAGS=-std=c++17	2024-09-25 09:51:05 -07:00
Joseph Lee	1f72f52bc1	gen_html: Fix build error in windows	2024-07-03 12:26:15 +09:00
Elliot Gorokhovsky	3242ac598e	Expose size estimation helpers for Linux (#4064 )	2024-06-14 14:38:48 -04:00
Alexander Kanavin	cd4dba74de	pzstd: use c++14 without conditions Doing this check with a direct c++ snippet is prone to portability problems: - \043 is not portable between shells: dash expands it to #, bash does not; - using # directly works with make 4.3 but does not with make 4.2. Let's just use the c++ version that covers both the code and the gtest.	2024-03-18 12:15:26 -04:00
Nick Terrell	c2d470581e	[linux] Remove usage of deprecated function ZSTD_resetDStream() is deprecated and replaced by ZSTD_DCtx_reset(). This removes deprecation warnings from the kernel build. This change is a no-op, see the docs suggesting this replacement. `fcbf2fde9a/lib/zstd.h (L2655-L2663)`	2023-11-17 09:54:10 -08:00
Nick Terrell	a419265d30	[linux] Backport intptr_t removal Linux started providing intptr_t in <linux/types.h> so we no longer need to define it here. https://lkml.kernel.org/r/ed66b9e4-1fb7-45be-9bb9-d4bc291c691f@p183	2023-11-17 09:54:10 -08:00
Yann Collet	c692b8d12d	Merge pull request #3752 from paulmenzel/fix-pzstd-makefile Fix pzstd Makefile to allow setting `DESTDIR` and `BINDIR` separately	2023-10-07 19:32:22 -07:00
Dimitri Papadopoulos	585aaa0ed3	Do not test WIN32, instead test _WIN32 To the best of my knowledge: * `_WIN32` and `_WIN64` are defined by the compiler, * `WIN32` and `WIN64` are defined by the user, to indicate whatever the user chooses them to indicate. They mean 32-bit and 64-bit Windows compilation by convention only. See: https://accu.org/journals/overload/24/132/wilson_2223/ Windows compilers in general, and MSVC in particular, have been defining `_WIN32` and `_WIN64` for a long time, provably at least since Visual Studio 2015, and in practice as early as in the days of 16-bit Windows. See: https://learn.microsoft.com/en-us/cpp/preprocessor/predefined-macros?view=msvc-140 https://learn.microsoft.com/en-us/windows/win32/winprog64/the-tools Tests used to be inconsistent, sometimes testing `_WIN32`, sometimes `_WIN32` and `WIN32`. This brings consistency to Windows detection.	2023-09-23 19:03:18 +02:00
Paul Menzel	d55ebb5718	[pzstd]: Fix `DESTDIR` handling to allow setting `BINDIR` Currently, setting `BINDIR` and `DESTDIR` separately is not possible, so the command below fails, as BINDIR is set explicitly: $ make -j80 install PREFIX=/usr EPREFIX=/usr BINDIR=/usr/bin SBINDIR=/usr/sbin LIBEXECDIR=/usr/libexec SYSCONFDIR=/etc SHAREDSTATEDIR=/var LOCALST ATEDIR=/var LIBDIR=/usr/lib INCLUDEDIR=/usr/include DATAROOTDIR=/usr/share DATADIR=/usr/share INFODIR=/usr/share/info LOCALEDIR=/usr/share/locale MAND IR=/usr/share/man DOCDIR=/usr/share/doc/zstd DESTDIR=/dev/shm/bee-pmenzel/zstd/zstd-1.5.5-0/image -C contrib/pzstd DESTDIR=/dev/shm/bee-pmenzel/zstd/zstd-1.5.5-0/image make: Entering directory '/dev/shm/bee-pmenzel/zstd/zstd-1.5.5-0/source/contrib/pzstd' CFLAGS=" -I../../lib -I../../lib/common -I../../programs -I. -DNDEBUG -O3 -Wall -Wextra -Wno-deprecated-declarations " LDFLAGS=" -O3 -Wall -Wextra -pedantic " make -C ../../lib libzstd.a make[1]: Entering directory '/dev/shm/bee-pmenzel/zstd/zstd-1.5.5-0/source/lib' make[1]: Leaving directory '/dev/shm/bee-pmenzel/zstd/zstd-1.5.5-0/source/lib' g++ main.o ../../programs/util.o Options.o Pzstd.o SkippableFrame.o ../../lib/libzstd.a -O3 -Wall -Wextra -pedantic -pthread -o pzstd install -d -m 755 /usr/bin/ install -m 755 pzstd /usr/bin/pzstd install: cannot create regular file '/usr/bin/pzstd': Permission denied make: *** [Makefile:116: install] Error 1 So, do not prefix `BINDIR` with `DESTDIR`, and adapt all paths for installation. This is more common, and, for example, `programs/Makefile` does the same. Fixes: `8b4e84249b` ("[pzstd] Fix Makefile")	2023-09-12 19:15:25 +02:00
Yann Collet	d9645327b3	fixed MEM_STATIC already defined in Linux Kernel mode	2023-06-14 20:07:18 -07:00
Yoni Gilad	649a9c85c3	seekable_format: Add unit test for multiple decompress calls This does the following: 1. Compress test data into multiple frames 2. Perform a series of small decompressions and seeks forward, checking that compressed data wasn't reread unnecessarily. 3. Perform some seeks forward and backward to ensure correctness.	2023-03-29 21:35:52 -07:00
Yoni Gilad	618bf84e0d	seekable_format: Prevent rereading frame when seeking forward When decompressing a seekable file, if seeking forward within a frame (by issuing multiple ZSTD_seekable_decompress calls with a small gap between them), the frame will be unnecessarily reread from the beginning. This patch makes it continue using the current frame data and simply skip over the unneeded bytes.	2023-03-29 21:24:12 -07:00
W. Felix Handte	cbe0f0e435	Switch Strategies: Only Set `-std=c++11` When Default is Older	2023-03-27 18:37:19 -04:00
W. Felix Handte	1b8bddc41e	[contrib/pzstd] Detect and Select Maximum Available C++ Standard Rather than remove the flag entirely, as proposed in #3499, this commit uses the newest C++ standard the compiler supports. This retains the selection of using only standardized features (excluding GNU extensions) and keeps the recency requirements of the codebase explicit. Tested with various versions of `g++` and `clang++`.	2023-03-27 11:24:47 -04:00
Yann Collet	dd8cb5a0f1	added documentation for the seekable format and notably provide additional context for the Maximum Frame Size parameter. requested by @P-E-Meunier at `1df9f36c6c (commitcomment-103856979)`.	2023-03-10 15:54:31 -08:00
Yann Collet	1df9f36c6c	Improved seekable format ingestion speed for small frame size As reported by @P-E-Meunier in https://github.com/facebook/zstd/issues/2662#issuecomment-1443836186, seekable format ingestion speed can be particularly slow when selected `FRAME_SIZE` is very small, especially in combination with the recent row_hash compression mode. The specific scenario mentioned was `pijul`, using frame sizes of 256 bytes and level 10. This is improved in this PR, by providing approximate parameter adaptation to the compression process. Tested locally on a M1 laptop, ingestion of `enwik8` using `pijul` parameters went from 35sec. (before this PR) to 2.5sec (with this PR). For the specific corner case of a file full of zeroes, this is even more pronounced, going from 45sec. to 0.5sec. These benefits are unrelated to (and come on top of) other improvement efforts currently being made by @yoniko for the row_hash compression method specifically. The `seekable_compress` test program has been updated to allows setting compression level, in order to produce these performance results.	2023-03-09 18:00:30 -08:00
W. Felix Handte	cd9486031d	Also Pin Dockerfile Dependency Hashes	2023-03-09 17:01:22 -05:00
Nick Terrell	6313a58e45	[linux-kernel] Fix assert definition Backport upstream fix of the assert definition. This code is currently unused, and can be enabled for testing, which is why it wasn't caught. https://lore.kernel.org/lkml/20230129131436.1343228-1-j.neuschaefer@gmx.net/	2023-03-07 16:53:36 -08:00
Yonatan Komornik	c78f434aa4	Fix zstd-dll build missing dependencies (#3496 ) * Fixes zstd-dll build (https://github.com/facebook/zstd/issues/3492): - Adds pool.o and threading.o dependency to the zstd-dll target - Moves custom allocation functions into header to avoid needing to add dependency on common.o - Adds test target for zstd-dll - Adds github workflow that buildis zstd-dll	2023-02-12 12:32:31 -08:00
Elliot Gorokhovsky	ff42ed1582	Rename "External Matchfinder" to "Block-Level Sequence Producer" (#3484 ) * change "external matchfinder" to "external sequence producer" * migrate contrib/ to new naming convention * fix contrib build * fix error message * update debug strings * fix def of invalid sequences in zstd.h * nit * update CHANGELOG * fix .gitignore	2023-02-09 17:01:17 -05:00
Yann Collet	4de9d637e8	minor: fix missing newline character in help page	2023-02-08 15:56:49 -08:00
Danielle Rozenblit	63042f1f11	fix 32bit build errors in zstd seekable	2023-01-24 15:53:59 -08:00
Nick Terrell	0cc1b0cb22	Delete unused Huffman functions Remove all Huffman functions that aren't used by zstd.	2023-01-20 14:12:53 -08:00
Elliot Gorokhovsky	2a402626dd	External matchfinder API (#3333 ) * First building commit with sample matchfinder * Set up ZSTD_externalMatchCtx struct * move seqBuffer to ZSTD_Sequence* * support non-contiguous dictionary * clean up parens * add clearExternalMatchfinder, handle allocation errors * Add useExternalMatchfinder cParam * validate useExternalMatchfinder cParam * Disable LDM + external matchfinder * Check for static CCtx * Validate mState and mStateDestructor * Improve LDM check to cover both branches * Error API with optional fallback * handle RLE properly for external matchfinder * nit * Move to a CDict-like model for resource ownership * Add hidden useExternalMatchfinder bool to CCtx_params_s * Eliminate malloc, move to cwksp allocation * Handle CCtx reset properly * Ensure seqStore has enough space for external sequences * fix capitalization * Add DEBUGLOG statements * Add compressionLevel param to matchfinder API * fix c99 issues and add a param combination error code * nits * Test external matchfinder API * C90 compat for simpleExternalMatchFinder * Fix some @nocommits and an ASAN bug * nit * nit * nits * forward declare copySequencesToSeqStore functions in zstd_compress_internal.h * nit * nit * nits * Update copyright headers * Fix CMake zstreamtest build * Fix copyright headers (again) * typo * Add externalMatchfinder demo program to make contrib * Reduce memory consumption for small blockSize * ZSTD_postProcessExternalMatchFinderResult nits * test sum(matchlen) + sum(litlen) == srcSize in debug builds * refExternalMatchFinder -> registerExternalMatchFinder * C90 nit * zstreamtest nits * contrib nits * contrib nits * allow block splitter + external matchfinder, refactor * add windowSize param * add contrib/externalMatchfinder/README.md * docs * go back to old RLE heuristic because of the first block issue * fix initializer element is not a constant expression * ref contrib from zstd.h * extremely pedantic compiler warning fix, meson fix, typo fix * Additional docs on API limitations * minor nits * Refactor maxNbSeq calculation into a helper function * Fix copyright	2022-12-28 16:45:14 -05:00

1 2 3 4 5 ...

802 Commits