mariadb-columnstore-engine

mirror of https://github.com/mariadb-corporation/mariadb-columnstore-engine.git synced 2025-07-29 08:21:15 +03:00

Author	SHA1	Message	Date
Gagan Goel	b3a560300c	Revert "Merge pull request #2022 from mariadb-corporation/bar-develop-MCOL-4791" This reverts commit `4016e25e5b`, reversing changes made to `85435f6b1e`.	2021-07-13 11:06:56 +00:00
Gagan Goel	90e5218c71	Revert "Merge pull request #2027 from mariadb-corporation/bar-develop-MCOL-4791" This reverts commit `643c06b7fe`, reversing changes made to `c679b11ef6`.	2021-07-13 10:56:23 +00:00
Roman Nozdrin	866dc25729	Merge pull request #1842 from denis0x0D/MCOL-987_LZ MCOL-987 LZ4 compression support.	2021-07-07 13:13:18 +03:00
Roman Nozdrin	7b4f759592	Merge pull request #2032 from drrtuy/MCOL-4802 MCOL-4802 Removed ByteStream methods for bool and add some logging in…	2021-07-07 13:03:54 +03:00
Roman Nozdrin	fb5ba84212	MCOL-4802 Removed ByteStream methods for bool manipulations and add some logging into I_S.columnstore_files	2021-07-07 07:16:30 +00:00
Alexander Barkov	9794f24369	MCOL-4801 Replace Row methods getStringLength() and getStringPointer() to getConstString()	2021-07-06 21:15:32 +04:00
Denis Khalikov	cc1c3629c5	MCOL-987 Add LZ4 compression. * Adds CompressInterfaceLZ4 which uses LZ4 API for compress/uncompress. * Adds CMake machinery to search LZ4 on running host. * All methods which use static data and do not modify any internal data - become `static`, so we can use them without creation of the specific object. This is possible, because the header specification has not been modified. We still use 2 sections in header, first one with file meta data, the second one with pointers for compressed chunks. * Methods `compress`, `uncompress`, `maxCompressedSize`, `getUncompressedSize` - become pure virtual, so we can override them for the other compression algos. * Adds method `getChunkMagicNumber`, so we can verify chunk magic number for each compression algo. * Renames "s/IDBCompressInterface/CompressInterface/g" according to requirement.	2021-07-06 18:04:37 +03:00
Gagan Goel	8520f87237	MCOL-641 Cleanup.	2021-07-06 09:01:49 +00:00
Alexander Barkov	46ee6b3968	A clean-up for MCOL-4791 Fix ColumnCommand fudged data type format to clearly identify CHAR vs VARCHAR The "mIsDict" member was not copied in ColumnCommand::duplicate. Thus store() erroneously entered colCompare() for dictionary commands.	2021-07-04 09:48:33 +04:00
Alexander Barkov	e8126bede5	MCOL-4791 Fix ColumnCommand fudged data type format to clearly identify CHAR vs VARCHAR	2021-07-02 12:42:03 +04:00
Roman Nozdrin	2de4888899	Merge pull request #1990 from drrtuy/MCOL-4173_9 MCOL-4173 This patch adds support for wide-DECIMAL INNER, OUTER, SEMI…	2021-06-24 16:15:07 +03:00
Roman Nozdrin	6620d873fd	Merge pull request #1927 from denis0x0D/MCOL-4407 MCOL-4407 and condtion does not work when HWM > columnstore_string_san_threshold - 1	2021-06-24 15:58:35 +03:00
Roman Nozdrin	bed0b7c6bc	MCOL-4173 This patch adds support for wide-DECIMAL INNER, OUTER, SEMI, functional JOINs based on top of TypelessData	2021-06-24 08:07:23 +00:00
Denis Khalikov	8dd2f2937c	MCOL-4407 and condtion does not work when HWM > columnstore_string_scan_threshold - 1	2021-06-21 14:04:26 +03:00
Alexander Barkov	b3d6f62964	MCOL-4753 Performance problem in Typeless join	2021-06-10 09:26:26 +00:00
Roman Nozdrin	42e710f817	Merge pull request #1942 from mariadb-corporation/bar-develop-compile-10.6 Fixing 10.6 + develop compilation failure	2021-05-25 14:31:37 +03:00
Alexander Barkov	9608533d92	MCOL-4734 Compilation failure: MariaDB-10.6 + ColumnStore-develop mcsconfig.h and my_config.h have the following pre-processor definitions: 1. Conflicting definitions coming from the standard cmake definitions: - PACKAGE - PACKAGE_BUGREPORT - PACKAGE_NAME - PACKAGE_STRING - PACKAGE_TARNAME - PACKAGE_VERSION - VERSION 2. Conflicting definitions of other kinds: - HAVE_STRTOLL - this is a dirt in MariaDB headers. Should be fixed in the server code. my_config.h erroneously performs "#define HAVE_STRTOLL" instead of "#define HAVE_STRTOLL 1". in some cases. The former is not CMake compatible style. The latter is. 3. Non-conflicting definitions: Otherwise, mcsconfig.h and my_config.h should be mutually compatible, because both are generated by cmake on the same host machine. So they should have exactly equal definitions like "HAVE_XXX", "SIZEOF_XXX", etc. Observations: - It's OK to include both mcsconfig.h and my_config.h providing that we suppress duplicate definition of the above conflicting types #1 and #2. - There is no a need to suppress duplicate definitions mentioned in #3, as they are compatible! - my_sys.h and m_ctype.h must always follow a CMake configuation header, either my_config.h or mcsconfig.h (or both). They must never be included without any preceeding configuration header. This change make sure that we resolve conflicts by: - either disallowing inclusion of mcsconfig.h and my_config.h at the same time - or by hiding conflicting definitions #1 and #2 (with their later restoring). - also, by making sure that my_sys.h and m_ctype.h always follow a CMake configuration file. Details: - idb_mysql.h can now only be included only after my_config.h An attempt to use idb_mysql.h with mcsconfig.h instead of my_config.h is caught by the "#error" preprocessor directive. - mariadb_my_sys.h can now be only included after mcsconfig.h. An attempt to use mariadb_my_sys.h without mcscofig.h (e.g. with my_config.h) is also caught by "#error". - collation.h now can now be included in two ways. It now has the following effective structure: #if defined(PREFER_MY_CONFIG_H) && defined(MY_CONFIG_H) // Remember current conflicting definitions on the preprocessor stack // Undefine current conflicting definitions #endif #include "mcsconfig.h" #include "m_ctype.h" #if defined(PREFER_MY_CONFIG_H) && defined(MY_CONFIG_H) # Restore conflicting definitions from the preprocessor stack #endif and can be included as follows: a. using only mcsconfig.h as a configuration header: // my_config.h must not be included so far #include "collation.h" b. using my_config.h as the first included configuration file: #define PREFER_MY_CONFIG_H // Force conflict resolution #include "my_config.h" // can be included directly or indirectly ... #include "collation.h" Other changes: - Adding helper header files utils/common/mcsconfig_conflicting_defs_remember.h utils/common/mcsconfig_conflicting_defs_restore.h utils/common/mcsconfig_conflicting_defs_undef.h to perform conflict resolution easier. - Removing `#include "collation.h"` from a number of files, as it's automatically included from rowgroup.h. - Removing redundant `#include "utils_utf8.h"`. This change is not directly related to the problem being fixed, but it's nice to remove redundant directives for both collation.h and utils_utf8.h from all the files that do not really need them. (this change could probably have gone as a separate commit) - Changing my_init() to MY_INIT(argv[0]) in the MCS services sources. After the fix of the complitation failure it appeared that ColumnStore services compiled with the debug build crash due to recent changes in safemalloc. The crash happened in strcmp() with `my_progname` as an argument (where my_progname is a mysys global variable). This problem should probably be fixed on the server side as well to avoid passing NULL. But, the majority of MariaDB executable programs also use MY_INIT(argv[0]) rather than my_init(). So let's make MCS do like the other programs do.	2021-05-25 12:34:36 +04:00
Alexander Barkov	284fc51bb7	MCOL-4726 Wrong result of WHERE char1_col='A'	2021-05-21 14:40:16 +04:00
Roman Nozdrin	757f8d00a5	A plugable PoorManProfiler singleton	2021-04-14 10:54:46 +00:00
Gagan Goel	3ed1b26a2a	Merge pull request #1856 from mariadb-corporation/bar-develop-MCOL-4361 MCOL-4361 Replace pow(10.0, (double)scale) expressions with a static …	2021-04-13 07:01:33 -04:00
Alexander Barkov	362bfcd15e	MCOL-4361 Replace pow(10.0, (double)scale) expressions with a static dictionary lookup.	2021-04-09 12:41:04 +04:00
Roman Nozdrin	895cbbe2d1	This patch revives PP poorman's profiling using StopWatch class	2021-04-08 12:10:06 +00:00
Alexander Barkov	765858bc5b	MCOL-4498 LIKE is not collation aware	2021-03-22 20:42:01 +04:00
David.Hall	b35e1ee395	Merge pull request #1769 from mariadb-corporation/bar-develop-MCOL-4527 A join patch for MCOL-4527 (a performance hack) and MCOL-4539 (a bug …	2021-02-17 13:31:35 -06:00
Alexander Barkov	5bcc1cd1f0	A join patch for MCOL-4527 (a performance hack) and MCOL-4539 (a bug fix) - MCOL-4527 Simple query performace is degraded between 5.4 and 5.5 xxx_nopad_bin collations are now around 30% faster on simple queries like: SELECT * FROM t1 WHERE short_char_column_nopad_bin = 'literal' The gain is achieved by comparing two short CHAR values as uint64_t. Note, this patch does not affect xxx_bin collations! It wouldn't be correct to apply the same improvement for xxx_bin collations (i.e. with PAD SPACE attribute), because it would change the way how trailing spaces are compared. - MCOL-4539 WHERE short_char_column='literal' ignores the collation on a huge table Only the first thread used a correct collation when performing: WHERE short_char_char='literal' Other (15) threads used the server default collation, because the charsetNumber attribute was not copyed during cloning. - This patch also adds mtr/basic/suite.opt, so "mtr" can run without --extern.	2021-02-16 18:45:18 +04:00
benthompson15	afa88866bb	MCOL-4483: Fix and consolidate log files and cpimport logging.	2021-02-12 15:40:16 -06:00
benthompson15	846f7fb29b	MCOL-4193: Delete unused OAM and applications, ProcMon, ProcMgr, and no longer build all tools for packages	2021-02-08 17:51:09 -06:00
Roman Nozdrin	5fce19df0a	MCOL-4412 Introduce TypeHandler::getEmptyValueForType to return const ptr for an empty value WE changes for SQL DML and DDL operations Changes for bulk operations Changes for scanning operations Cleanup	2021-01-18 12:30:17 +00:00
Gagan Goel	a91fb15b07	Add PrimProc support for selective block loading for 16-byte columns.	2020-12-11 14:23:45 -05:00
Alexander Barkov	b08d719593	A cleanup for MCOL-4064 Make JOIN collation aware A non-JOIN condition like `WHERE c1=c2` (with c1 and c2 being columns of the same table) was not collation-aware yet after the main patches for MCOL-4064. Additionally fixing StrFilterCmd::compare*() to address this.	2020-12-08 16:43:07 +04:00
Alexander Barkov	4f6b6a8871	MCOL-4417 Non-equality comparison operators do not work well with NOPAD collations	2020-12-04 08:46:46 +04:00
Alexander Barkov	c6158eee31	Part#1 MCOL-4064 Make JOIN collation aware Making field1=field2 collation aware for long CHAR/VARCHAR.	2020-12-04 08:41:26 +04:00
Alexander Barkov	52c5af054a	Part#2 MCOL-495 Make string comparison not case sensitive Fixing field='str' for short (non-Dict) CHAR and VARCHAR data types.	2020-12-04 08:40:29 +04:00
Alexander Barkov	0ff6a6ec20	Part#1 MCOL-495 Make string comparison not case sensitive Fixing field='str' for long (Dict) string data types.	2020-12-04 07:49:00 +04:00
Alexander Barkov	2ea73846b9	MCOL-4422 Remove mariadb.h and my_sys.h dependency from collation.h	2020-11-30 14:26:35 +04:00
Roman Nozdrin	454ec4be99	Merge pull request #1613 from tntnatbry/MCOL-641-alter-table-fix MCOL-641 Fix alter table add wide decimal column.	2020-11-26 18:17:01 +03:00
Gagan Goel	c5d4a918ee	MCOL-4188 Regression fixes for MCOL-641. 1. In TupleAggregateStep::configDeliveredRowGroup(), use jobInfo.projectionCols instead of jobInfo.nonConstCols for setting scale and precision if the source column is wide decimal. 2. Tighten rules for wide decimal processing. Specifically: a. Replace (precision > INT64MAXPRECISION) checks with (precision > INT64MAXPRECISION && precision <= INT128MAXPRECISION) b. At places where (colWidth == MAXDECIMALWIDTH) is not enough to determine if a column is wide decimal or not, also add a check on type being DECIMAL/UDECIMAL.	2020-11-24 20:15:33 -05:00
Gagan Goel	995cadef2d	MCOL-641 Fix alter table add wide decimal column. This patch also removes CalpontSystemCatalog::BINARY and ddlpackage::DDL_BINARY that were added during the initial stages of the work on MCOL-641.	2020-11-20 19:49:54 -05:00
Roman Nozdrin	15b1bfa709	Fix fallthrough compilation warnings	2020-11-18 13:53:15 +00:00
Roman Nozdrin	3eb26c0d4a	MCOL-4313 Introduced TSInt128 that is a storage class for int128 Removed uint128 from joblist/lbidlist.* Another toString() method for wide-decimal that is EMPTY/NULL aware Unified decimal processing in WF functions Fixed a potential issue in EqualCompData::operator() for wide-decimal processing Fixed some signedness warnings	2020-11-18 13:53:15 +00:00
Alexander Barkov	129d5b5a0f	MCOL-4174 Review/refactor frontend/connector code	2020-11-18 13:53:15 +00:00
Roman Nozdrin	844472d812	MCOL-4313 Very fragile but high speed approach with inline ASM GCC compiler uses aligned versions of SIMD instructions expecting aligned memory blocks that is hard to implement now	2020-11-18 13:52:20 +00:00
Roman Nozdrin	1588ebe439	MCOL-641 Clean up primitives code Add int128_t support into ByteStream Fixed UTs broken after collation patch	2020-11-18 13:52:19 +00:00
Gagan Goel	d3bc68b02f	MCOL-641 Refactor initial extent elimination support. This commit also adds support in TupleHashJoinStep::forwardCPData, although we currently do not support wide decimals as join keys. Row estimation to determine large-side of the join is also updated.	2020-11-18 13:52:19 +00:00
Gagan Goel	6aea838360	MCOL-641 Add support for functions (Part 2).	2020-11-18 13:51:55 +00:00
Roman Nozdrin	b5534eb847	MCOL-641 Refactored MultiplicationOverflowCheck but it still has flaws. Introduced fDecimalOverflowCheck to enable/disable overflow check. Add support into a FunctionColumn. Low level scanning crashes on medium sized data sets.	2020-11-18 13:47:45 +00:00
Gagan Goel	74b64eb4f1	MCOL-641 1. Add support for int128_t in ParsedColumnFilter. 2. Set Decimal precision in SimpleColumn::evaluate(). 3. Add support for int128_t in ConstantColumn. 4. Set IDB_Decimal::s128Value in buildDecimalColumn(). 5. Use width 16 as first if predicate for branching based on decimal width.	2020-11-18 13:47:45 +00:00
Roman Nozdrin	b09f3088ca	MCOL-641 Initial version of Math operations for wide decimal.	2020-11-18 13:47:44 +00:00
Gagan Goel	62d0c82d75	MCOL-641 1. Templatized convertValueNum() function. 2. Allocate int128_t buffers in batchprimitiveprocessor if a query involves wide decimal columns.	2020-11-18 13:47:44 +00:00
Gagan Goel	824615a55b	MCOL-641 Refactor empty value implementation in writeengine.	2020-11-18 13:47:44 +00:00

1 2 3 4 5 ...

306 Commits