mariadb-columnstore-engine

mirror of https://github.com/mariadb-corporation/mariadb-columnstore-engine.git synced 2025-11-02 06:13:16 +03:00

Author	SHA1	Message	Date
Leonid Fedorov	f7118b53a8	Turn on ASAN for unitests (#2719 ) Fix asan error on compression tests Fix warn of nonreturn function	2023-02-02 15:08:01 +02:00
Roman Nozdrin	9746a2572b	This commit adds pattern match feature using MPark's library (#2665 )	2022-12-20 19:00:32 +03:00
Leonid Fedorov	37fd915a08	Serg`s patch for develop-6 revised for develop https://github.com/mariadb-corporation/mariadb-columnstore-engine/pull/2614	2022-11-09 22:41:38 +00:00
NTH19	7d76dc4534	AUX column scan(MCOL-5021) effectively disables vectorized scanning on ARM platforms. This patch resolves this issue and unifies AUX column processing at x86 and ARM using tempate class SimdProcessor. The patch also replaces uint16_t mask previously used in column.cpp and SimProcessor code with a native masks that platform uses, e.g. __m128i or __m128 on x86 and variety of masks on ARM. To unify the processing I introduced a new filtering Compare Operator - COMPARE_NULLEQ. with a 'c1 IS NULL semantics'.	2022-10-07 10:32:54 +00:00
NTH19	3ef706b054	fix micro benchmark	2022-08-29 19:30:45 +08:00
mariadb-AndreyPiskunov	0863ecd279	Replace getBinaryField	2022-08-25 18:21:43 +03:00
Sergei Golubchik	a7a9ccf889	Serg dev (#2504 ) * more build dependencies * fix for cmake < 3.11 It cannot do ADD_LIBRARY(... ALIAS ...) on IMPORTED targets * another fix for cmake 3.10.2 It doesn't know about CMAKE_CXX_STANDARD=20, let's add the correct flag manually * gcc 8 on aarch64 utils/common/simd_arm.h:241:16: error: need ‘typename’ before ‘simd::TypeToVecWrapperType<T>::WrapperType’ because ‘simd::TypeToVecWrapperType<T>’ is a dependent scope	2022-08-15 13:35:30 +03:00
Roman Nozdrin	f40175dc32	This patch solves the ocassional issues with the FairThreadPool unit test	2022-08-11 16:56:17 +00:00
Roman Nozdrin	a9d8924683	MCOL-5166 This patch adds support for in-memory communication b/w EM to PP via a shared queue in DEC class JobList low-level code relateod to primitive jobs now uses shared pointers instead of ByteStream refs talking to DEC b/c same-node EM-PP communication now goes over a queue in DEC instead of a network hop. PP now has a separate thread that processes the primitive job messages from that DEC queue.	2022-08-04 18:51:31 +03:00
NTH19	19ca844cd1	support_max_min	2022-08-04 16:16:38 +03:00
Andrey Piskunov	589b786fda	Don't ignore null or empty in calculation	2022-08-04 16:16:38 +03:00
Andrey Piskunov	04ac04ff74	Temporary test fix	2022-08-04 16:16:38 +03:00
Andrey Piskunov	5c6cd2cca3	use vect update for everything except TEXT	2022-08-04 16:16:38 +03:00
Andrey Piskunov	225f54fd79	Tests for simd min/max	2022-08-04 16:16:38 +03:00
Andrey Piskunov	b8200acd3b	Don't ignore null or empty in calculation	2022-08-04 16:16:38 +03:00
Andrey Piskunov	2a7da39610	Temporary test fix	2022-08-04 16:16:38 +03:00
Andrey Piskunov	c4df7925d1	use vect update for everything except TEXT	2022-08-04 16:16:38 +03:00
Andrey Piskunov	1681edaca0	Tests for simd min/max	2022-08-04 16:16:38 +03:00
Roman Nozdrin	3b87532413	Revert "This patch disables FairThreadPool to double check if this feature contributes to multiple strange side-effects and ocassional failed MTR tests" This reverts commit `b78cbffa93`.	2022-07-22 14:04:06 +00:00
Roman Nozdrin	b78cbffa93	This patch disables FairThreadPool to double check if this feature contributes to multiple strange side-effects and ocassional failed MTR tests	2022-07-20 11:17:19 +00:00
Leonid Fedorov	1cd382ba3b	Clang warning fix	2022-07-16 16:26:10 +03:00
Leonid Fedorov	140770d6f4	Delete tests/shared_components_tests.cpp, erase legacy code from tests/primitives_scan_bench.cpp, option to run benchmarks from build/bootstrap_mcs.sh	2022-07-15 15:56:24 +00:00
Leonid Fedorov	56b01fdefc	Workaround for gtest compile bug	2022-07-11 22:27:25 +02:00
Roman Nozdrin	0907ca414f	MCOL-5044 This patch simplifies addJob interfaces removing extra bool that control mutex locking, adds additional nullptr dereference check in removeJobs and fixes FairThreadPool hashmap iter invalidation issues	2022-07-09 12:50:30 +00:00
Roman Nozdrin	6cff14997d	Revert "This reverts MCOL-5044 AKA FairThreadPool that breaks regr test002" This reverts commit `61359119ad`.	2022-07-09 12:38:51 +00:00
NTH19	a4842ef998	rename	2022-06-24 16:53:02 +08:00
NTH19	4c0b8fd829	simd of arm neon unit testing pass unit test for simdprocessor add test cases implement specific _mm_movemask for different types float movemask change rename	2022-06-24 11:24:59 +08:00
Leonid Fedorov	3638f4ac8c	Replace gtest_discovery_tests with gtests_add_tests Despite we have another number of tests in result, they all still run gtests_add_test cannot parse TYPED_TEST_SUITE one by one and run them in one bunch	2022-06-13 15:05:10 +00:00
Roman Nozdrin	61359119ad	This reverts MCOL-5044 AKA FairThreadPool that breaks regr test002 This reverts commit `e40c16bd56`, reversing changes made to `18e6b1d77b`.	2022-06-10 14:17:59 +00:00
Roman Nozdrin	fd8ba33f21	MCOL-5044 This patch replaces PriorityThreadPool with FairThreadPool that uses a simple operations + morsel size weight model to equally allocate CPU b/w parallel query morsels. This patch delivers better parallel query timings distribution(timings graph resembles normal distribution with a bigger left side thus more queries runs faster comparing with PrioThreadPool-based single-node installation). See changes in batchprimitiveprocessor-jl.h and comments in fair_threadpool.h for important implementation details	2022-06-03 10:08:12 +00:00
Roman Nozdrin	0f0b3a2bed	Disable FairThreadPool unit tests in develop-6 b/c its unit test segfaults in containers	2022-06-02 17:05:30 +00:00
Roman Nozdrin	c92dc08264	MCOL-5044 Initial version of a fair thread pool PP now uses PriorityThreadPool that arbitrary picks another jobs pack to run. This scheduling discipline tend to run portions of a single query forcing other simultaneous queries to wait. In result parallel queries timings variance is high. The FairThreadPool picks the job with the smallest amount of work done so far(see the code for details)	2022-06-02 17:05:12 +00:00
Roman Nozdrin	4c26e4f960	MCOL-4912 This patch introduces Extent Map index to improve EM scaleability EM scaleability project has two parts: phase1 and phase2. This is phase1 that brings EM index to speed up(from O(n) down to the speed of boost::unordered_map) EM lookups looking for <dbroot, oid, partition> tuple to turn it into LBID, e.g. most bulk insertion meta info operations. The basis is boost::shared_managed_object where EMIndex is stored. Whilst it is not debug-friendly it allows to put a nested structs into shmem. EMIndex has 3 tiers. Top down description: vector of dbroots, map of oids to partition vectors, partition vectors that have EM indices. Separate EM methods now queries index before they do EM run. EMIndex has a separate shmem file with the fixed id MCS-shm-00060001.	2022-05-04 12:59:16 +00:00
Roman Nozdrin	7cdc914b4e	MCOL-4809 This patch introduces vectorized scanning/filtering for short CHAR/VARCHAR columns Short CHAR/VARCHAR column values contain integer-encoded strings. After certain manipulations(orderSwap(strnxfrm(str))) the values become integers that preserve original strings order relation according to a certain translation rules(collation). Prepared values are ready to be SIMD-processed.	2022-04-01 10:28:33 +00:00
Leonid Fedorov	c847f6ce25	Fix segfault for vector scan tests on clang	2022-03-25 13:49:25 +00:00
Leonid Fedorov	fbd043b036	Fixing alightment for clang tests of rowgroup	2022-03-23 14:29:19 +00:00
Roman Nozdrin	b46f4b42b3	MCOL-4809 Vectorized comparison operations unit tests This commit replaces system googletest with 0.11.1 version compiled from sources to enable typed tests feature	2022-02-25 14:32:47 +03:00
Leonid Fedorov	3919c541ac	New warnfixes (#2254 ) * Fix clang warnings * Remove vim tab guides * initialize variables * 'strncpy' output truncated before terminating nul copying as many bytes from a string as its length * Fix ISO C++17 does not allow 'register' storage class specifier for outdated bison * chars are unsigned on ARM, having if (ival < 0) always false * chars are unsigned by default on ARM and comparison with -1 if always true	2022-02-17 13:08:58 +03:00
Roman Nozdrin	c79dfc4925	MCOL-4809 This patch adds support for float data types filtering and scanning vectorization	2022-02-03 16:38:56 +00:00
Leonid Fedorov	04752ec546	clang format apply	2022-01-21 16:43:49 +00:00
Leonid Fedorov	01f3ceb437	replace header guards with #pragma once	2022-01-21 15:24:58 +00:00
Roman Nozdrin	af36f9940f	This patch introduces support for scanning/filtering vectorized execution for numeric-based data types TEXT, CHAR, VARCHAR, FLOAT and DOUBLE are not yet supported by vectorized path This patch introduces an example for Google benchmarking suite to measure a perf diff b/w legacy scan/filtering code and the templated version	2021-12-10 10:30:00 +00:00
Roman Nozdrin	3de038c1da	MCOL-4876 This patch enables continues buffer to be used by ColumnCommand and aligns BPP::blockData that in most cases was unaligned	2021-10-06 09:23:40 +00:00
Roman Nozdrin	4cb9fe4850	This patch migrates filtering UT to ctest and elimites static files dependencies of the UT	2021-10-05 15:03:18 +00:00
Roman Nozdrin	67c85dae15	MCOL-4809 The patch replaces legacy scanning/filtering code with a number of templates that simplifies control flow removing needless expressions	2021-09-06 17:04:52 +00:00
Leonid Fedorov	f584e90718	Drone build run unittests	2021-08-03 05:36:05 +03:00
Leonid Fedorov	73e710ed52	Add ctest for google unittests	2021-08-02 19:41:04 +03:00
Denis Khalikov	cc1c3629c5	MCOL-987 Add LZ4 compression. * Adds CompressInterfaceLZ4 which uses LZ4 API for compress/uncompress. * Adds CMake machinery to search LZ4 on running host. * All methods which use static data and do not modify any internal data - become `static`, so we can use them without creation of the specific object. This is possible, because the header specification has not been modified. We still use 2 sections in header, first one with file meta data, the second one with pointers for compressed chunks. * Methods `compress`, `uncompress`, `maxCompressedSize`, `getUncompressedSize` - become pure virtual, so we can override them for the other compression algos. * Adds method `getChunkMagicNumber`, so we can verify chunk magic number for each compression algo. * Renames "s/IDBCompressInterface/CompressInterface/g" according to requirement.	2021-07-06 18:04:37 +03:00
Denis Khalikov	5d497e8821	MCOL-4566: Add rebuildEM tool support to work with compressed files. * This patch adds rebuildEM tool support to work with compressed files. * This patch increases a version of the file header. Note: Default version of the `rebuildEM` tool was using very old API, those functions are not present currently. So `rebuildEM` will not work with files created without compression, because we cannot deduce some info which are needed to create column extent.	2021-04-02 10:55:01 +03:00
Roman Nozdrin	508d5455a8	Merge pull request #1795 from denis0x0D/MCOL-4566/CompressedHeader MCOL-4566: Extend CompressedDBFileHeader struct with new fields.	2021-03-08 12:24:59 +03:00

1 2 3

120 Commits