mariadb-columnstore-engine

mirror of https://github.com/mariadb-corporation/mariadb-columnstore-engine.git synced 2025-08-08 14:22:09 +03:00

Author	SHA1	Message	Date
David.Hall	2020f35e88	Mcol 5092 MODA uses wrong column width for some types (#2450 ) * MCOL-5092 Ensure column width is correct for datatype Change MODA return type to STRING Modify MODA to handle every numeric type * MCOL-5162 MODA to support char and varchar with collation support Fixes to the aggregate bit functions When we fixed the storage sign issue for MCOL-5092, it uncovered a problem in the bit aggregates (bit_and, bit_or and bit_xor). These aggregates should always return UBIGINT, but they relied on the type of the argument column, which gave bad results.	2022-08-11 15:16:11 -05:00
Denis Khalikov	61cf18b92d	[MCOL-5167] Add support for on clause filter for a table which is not involved in join. This patch adds support for on clause filter for a table which is not involved in particular join by disabling an `merge optimization` for those particular cases. The `merge optimization` is optimization when CS tries to create a one BPP join with one `large side` table and multiple `small sides` tables, in this case we cannot apply a FE filter if this filter requires a columns from `small side` table which is not involved in particular join.	2022-08-10 10:46:43 +00:00
Gagan Goel	cbfdae3481	MCOL-5021 Code changes based on review feedback.	2022-08-05 14:40:50 -04:00
Gagan Goel	11b7ee2f11	MCOL-5021 Disallow the following ALTER TABLE ADD COLUMN statement: ALTER TABLE calpontsys.systable ADD COLUMN (auxcolumnoid INT NOT NULL DEFAULT 0);	2022-08-05 14:40:50 -04:00
Gagan Goel	1355237ca3	MCOL-5021 Some minor fixes.	2022-08-05 14:40:50 -04:00
Gagan Goel	94e9f55940	MCOL-5021 Add a new member function to the DBRM class, DBRM::addToLBIDList(). This function iterates over lbidList (populated by an earlier call to DBRM::getUncommittedExtentLBIDs()) to find those LBIDs which belong to the AUX column. It then finds the corresponding LBIDs for all other columns which belong to the same table as the AUX LBID and appends them to lbidList. The updated lbidList is used by invalidateUncommittedExtentLBIDs() to update the casual partitioning information. DBRM::addToLBIDList() only comes into play in case of a transaction ROLLBACK.	2022-08-05 14:40:50 -04:00
Gagan Goel	9b6d3c3870	MCOL-5021 Add support for AUX column in the client code calling CalpontSystemCatalog::columnRIDs().	2022-08-05 14:40:49 -04:00
Gagan Goel	439db48c5a	MCOL-5021 Add support for the AUX column in TRUNCATE table processing.	2022-08-05 14:40:49 -04:00
Gagan Goel	ea1861fdb5	MCOL-5021 Add a new function to CalpontSystemCatalog class, isAUXColumnOID(), to check if a given OID is an auxilliary column OID.	2022-08-05 14:40:49 -04:00
Gagan Goel	262cd5c501	MCOL-5021 Remove hard-coded values for data type, column width and compression type for the AUX column, and replace them with constants defined in the execplan namespace.	2022-08-05 14:40:49 -04:00
Gagan Goel	2280b1dd25	MCOL-5021 Add support for the AUX column in ExeMgr and PrimProc. In the joblist code, in addition to sending the lbid of the SCAN column, we also send the corresponding lbid of the AUX column to PrimProc. In the primitives processor code in PrimProc, we load the AUX column block (8192 rows since the AUX column is implemented as a 1-byte UNSIGNED TINYINT) into memory and then pass it down to the low-level scanning (vectorized scanning as applicable) routine to build a non-Empty mask for the block being processed to filter out DELETED rows based on comparison of the AUX block row to the empty magic value for the AUX column.	2022-08-05 14:40:49 -04:00
Gagan Goel	86df9a972c	MCOL-5021 Add prototype support for the AUX column in CREATE/DROP DDL commands, single and multi-value INSERTs, cpimport, and DELETE.	2022-08-05 14:40:49 -04:00
David.Hall	d3b57ec767	MCOL-4800 emit error if IN filter > 65535 entries (#2480 ) * MCOL-4800 emit error if IN filter > 65535 entries	2022-08-04 19:21:58 +03:00
Roman Nozdrin	a9d8924683	MCOL-5166 This patch adds support for in-memory communication b/w EM to PP via a shared queue in DEC class JobList low-level code relateod to primitive jobs now uses shared pointers instead of ByteStream refs talking to DEC b/c same-node EM-PP communication now goes over a queue in DEC instead of a network hop. PP now has a separate thread that processes the primitive job messages from that DEC queue.	2022-08-04 18:51:31 +03:00
Denis Khalikov	e519cd7486	[MCOL-5061] Fix wrong `join id` assignment for the views. (#2474 ) This patch fixes a wrong `join id` assignment for `TupleHashJoinStep` in a view. After MCOL-334 CS assigns a '-1' as `join id` for `TupleHashJoinStep` in a view, and in this case we cannot apply a filter for specific `Join step`, which is associated with `join id` for 2 reasons: 1. Filters for all `TupleHashJoinSteps` associated with the same `join id`, which is '-1'. 2. When CS creates a `joinIdIndexMap` it eliminates all `join ids` which a less or equal 0. This patch also fixes some tests for the view, which were generated wrong results.	2022-07-25 20:02:02 +03:00
Denis Khalikov	636e60b5f9	[MCOL-4699] Add support for circular outer joins.	2022-07-19 21:47:36 +03:00
David.Hall	08bef648b3	Mcol 5074 Case with In and aggregates asserts (#2435 ) * MCOL-5074 CASE with IN and aggregate asserts gwip-scsp wasn't set and buildPredicateItem() was called which assumes it is set. Added code to set properly in this case	2022-07-11 16:20:15 -05:00
Leonid Fedorov	39c43a0f70	<unnamed>.execplan::CalpontSystemCatalog::TableName::create_date' may be used uninitialized	2022-07-11 22:27:25 +02:00
Roman Nozdrin	6cff14997d	Revert "This reverts MCOL-5044 AKA FairThreadPool that breaks regr test002" This reverts commit `61359119ad`.	2022-07-09 12:38:51 +00:00
david.hall	c71d11cb3f	Restore calonlinealter	2022-07-06 09:22:49 -05:00
Leonid Fedorov	242769d542	Mistype bug error handler fix	2022-07-05 18:48:30 +03:00
Roman Nozdrin	a3bc3de5f4	Merge pull request #2432 from mariadb-corporation/dataload-raw MCOL-5013: Load Data from S3 into Columnstore	2022-07-05 13:06:53 +03:00
Roman Nozdrin	38c4b973dd	Merge pull request #2421 from denis0x0D/MCOL-4778 [MCOL-4778] Return if we have an error in push_down_init.	2022-07-04 21:16:13 +03:00
Leonid Fedorov	110d9cfab5	Review fixes	2022-07-04 19:52:37 +03:00
Leonid Fedorov	f5b2a6885f	MCOL-5013: Load Data from S3 into Columnstore Introduced UDF and stored prodecure. usage: set columnstore_s3_key='<s3_key>'; set columnstore_s3_secret='<s3_secret>'; set columnstore_s3_region='region'; and then use UDF select columnstore_dataload("<tablename>", "<filename>", "<bucket>", "<db_name>"); for UDF db_name can be ommited, then current connection db will be used or stored function call calpontsys.columnstore_load_from_s3("<tablename>", "<filename>", "<bucket>", "<db_name>");	2022-07-04 19:52:37 +03:00
Roman Nozdrin	1624c347f6	MCOL-5152 This patch enables PP to put ByteStreams into DEC input queue directly for a local PP-EM connection	2022-07-04 09:06:40 +00:00
Roman Nozdrin	fcf8596089	Merge pull request #2403 from denis0x0D/MCOL-5109 [MCOL-5109] Make PPS as singleton	2022-06-21 16:17:05 +03:00
Denis Khalikov	e8f83121d2	[MCOL-4778] Return if we have an error in push_down_init.	2022-06-21 00:06:25 +03:00
david.hall	9a24934728	MCOL-4841 remove BOOST_BIND_GLOBAL_PLACEHOLDERS drone has this defined on the command line	2022-06-14 16:16:38 -05:00
david.hall	6d47529499	Merge branch 'develop' into MCOL-4841	2022-06-14 14:41:41 -05:00
david.hall	d4cf894edc	MCOL-4841 fix some compiler issues	2022-06-14 14:32:01 -05:00
Roman Nozdrin	61359119ad	This reverts MCOL-5044 AKA FairThreadPool that breaks regr test002 This reverts commit `e40c16bd56`, reversing changes made to `18e6b1d77b`.	2022-06-10 14:17:59 +00:00
David.Hall	272246e9fa	Merge branch 'develop' into MCOL-4841	2022-06-09 16:58:33 -05:00
Roman Nozdrin	e40c16bd56	Merge pull request #2404 from drrtuy/MCOL-5044-dev MCOL-5044 FairThreadPool implementation	2022-06-09 22:23:52 +03:00
david.hall	3b6449842f	Merge branch 'develop' into MCOL-4841 # Conflicts: # exemgr/main.cpp # oam/etc/Columnstore.xml.singleserver # primitives/primproc/primproc.cpp	2022-06-09 10:07:26 -05:00
Denis Khalikov	467fe0b401	[MCOL-5109] Make a singleton from ServicePrimProc. This patch makes a singleton from ServicePrimProc.	2022-06-07 13:27:45 +03:00
Andrey Piskunov	c5fa27475d	Welford algorithm for STD and VAR Naive algorithm for calculating STD and VAR is subject to catastrophic cancellation. A well-known Welford's algorithms is used instead.	2022-06-03 15:29:30 +03:00
Roman Nozdrin	fd8ba33f21	MCOL-5044 This patch replaces PriorityThreadPool with FairThreadPool that uses a simple operations + morsel size weight model to equally allocate CPU b/w parallel query morsels. This patch delivers better parallel query timings distribution(timings graph resembles normal distribution with a bigger left side thus more queries runs faster comparing with PrioThreadPool-based single-node installation). See changes in batchprimitiveprocessor-jl.h and comments in fair_threadpool.h for important implementation details	2022-06-03 10:08:12 +00:00
Roman Nozdrin	f29d5e7869	MCOL-4912 This patch adds some forgotten MDB functions	2022-05-27 16:27:07 +00:00
benthompson15	e147184b8d	MCOL-5065: return values of getSystemReady/getSystemQueryReady should be > 0 (#2354 )	2022-05-10 12:32:17 -05:00
Roman Nozdrin	4c26e4f960	MCOL-4912 This patch introduces Extent Map index to improve EM scaleability EM scaleability project has two parts: phase1 and phase2. This is phase1 that brings EM index to speed up(from O(n) down to the speed of boost::unordered_map) EM lookups looking for <dbroot, oid, partition> tuple to turn it into LBID, e.g. most bulk insertion meta info operations. The basis is boost::shared_managed_object where EMIndex is stored. Whilst it is not debug-friendly it allows to put a nested structs into shmem. EMIndex has 3 tiers. Top down description: vector of dbroots, map of oids to partition vectors, partition vectors that have EM indices. Separate EM methods now queries index before they do EM run. EMIndex has a separate shmem file with the fixed id MCS-shm-00060001.	2022-05-04 12:59:16 +00:00
David.Hall	bbb168a846	Mcol 4560 (#2337 ) * MCOL-4560 remove unused xml entries and code that references it. There is reader code and variables for some of these settings, but nobody uses them.	2022-04-18 18:00:17 -04:00
Roman Nozdrin	e174696351	MCOL-5001 This patch merges ExeMgr and PrimProc runtimes EM and PP are most resource-hungry runtimes. The merge enables to control their cummulative resource consumption, thread allocation + enables zero-copy data exchange b/w local EM and PP facilities.	2022-04-04 11:46:33 +00:00
Commander thrashdin	f28e00c206	No repeating code in client_udfs + better test	2022-03-28 21:48:47 +03:00
Commander thrashdin	8d31478b72	Added mcsUDFs to install.sh+removed nonexistent fn	2022-03-28 21:48:47 +03:00
Commander thrashdin	749b8f16ee	Added mcs-named UDFs to cpp	2022-03-28 21:48:46 +03:00
Leonid Fedorov	65252df4f6	C++20 fixes	2022-03-28 12:32:29 +00:00
Leonid Fedorov	29679e91ec	Clang warnfixes (#2310 )	2022-03-21 13:19:55 -05:00
Serguey Zefirov	53b9a2a0f9	MCOL-4580 extent elimination for dictionary-based text/varchar types The idea is relatively simple - encode prefixes of collated strings as integers and use them to compute extents' ranges. Then we can eliminate extents with strings. The actual patch does have all the code there but miss one important step: we do not keep collation index, we keep charset index. Because of this, some of the tests in the bugfix suite fail and thus main functionality is turned off. The reason of this patch to be put into PR at all is that it contains changes that made CHAR/VARCHAR columns unsigned. This change is needed in vectorization work.	2022-03-02 23:53:39 +03:00
Leonid Fedorov	b5ccd52c09	OpenSSL 3 support for Columnstore	2022-02-18 14:07:23 +00:00

... 4 5 6 7 8 ...

1737 Commits