mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-08-05 13:16:09 +03:00

Author	SHA1	Message	Date
Kristian Nielsen	c73c6aea63	MDEV-33426: Aria temptables wrong thread-specific memory accounting in slave thread Aria temporary tables account allocated memory as specific to the current THD. But this fails for slave threads, where the temporary tables need to be detached from any specific THD. Introduce a new flag to mark temporary tables in replication as "global", and use that inside Aria to not account memory allocations as thread specific for such tables. Based on original suggestion by Monty. Reviewed-by: Monty <monty@mariadb.org> Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2024-02-16 12:48:30 +01:00
Oleksandr Byelkin	fa69b085b1	Merge branch '11.3' into 11.4	2024-02-15 13:53:21 +01:00
Nikita Malyavin	4246c0fa01	MDEV-22597 Add views for periods in information_schema Two new information_schema views are added: * PERIOD table -- columns TABLE_CATALOG, TABLE_SCHEMA, TABLE_NAME, PERIOD_NAME, START_COLUMN_NAME, END_COLUMN_NAME. * KEY_PERIOD_USAGE -- works similar to KEY_COLUMN_USAGE, but for periods. Columns CONSTRAINT_CATALOG, CONSTRAINT_SCHEMA, CONSTRAINT_NAME, TABLE_CATALOG, TABLE_SCHEMA, TABLE_NAME, PERIOD_NAME Two new columns are added to the COLUMNS view: IS_SYSTEM_TIME_PERIOD_START, IS_SYSTEM_TIME_PERIOD_END - contain YES/NO.	2024-02-12 22:26:06 +01:00
Marko Mäkelä	86c2c89743	Merge 10.6 into 10.11	2024-02-08 15:04:46 +02:00
Marko Mäkelä	466069b184	Merge 10.5 into 10.6	2024-02-08 10:38:53 +02:00
Marko Mäkelä	0381921e26	MDEV-33277 In-place upgrade causes invalid AUTO_INCREMENT values MDEV-33308 CHECK TABLE is modifying .frm file even if --read-only As noted in commit `d0ef1aaf61`, MySQL as well as older versions of MariaDB server would during ALTER TABLE ... IMPORT TABLESPACE write bogus values to the PAGE_MAX_TRX_ID field to pages of the clustered index, instead of letting that field remain 0. In commit `8777458a6e` this field was repurposed for PAGE_ROOT_AUTO_INC in the clustered index root page. To avoid trouble when upgrading from MySQL or older versions of MariaDB, we will try to detect and correct bogus values of PAGE_ROOT_AUTO_INC when opening a table for the first time from the SQL layer. btr_read_autoinc_with_fallback(): Add the parameters to mysql_version,max to indicate the TABLE_SHARE::mysql_version of the .frm file and the maximum value allowed for the type of the AUTO_INCREMENT column. In case the table was originally created in MySQL or an older version of MariaDB, read also the maximum value of the AUTO_INCREMENT column from the table and reset the PAGE_ROOT_AUTO_INC if it is above the limit. dict_table_t::get_index(const dict_col_t &) const: Find an index that starts with the specified column. ha_innobase::check_for_upgrade(): Return HA_ADMIN_FAILED if InnoDB needs upgrading but is in read-only mode. In this way, the call to update_frm_version() will be skipped. row_import_autoinc(): Adjust the AUTO_INCREMENT column at the end of ALTER TABLE...IMPORT TABLESPACE. This refinement was suggested by Debarun Banerjee. The changes outside InnoDB were developed by Michael 'Monty' Widenius: Added print_check_msg() service for easy reporting of check/repair messages in ENGINE=Aria and ENGINE=InnoDB. Fixed that CHECK TABLE do not update the .frm file under --read-only. Added 'handler_flags' to HA_CHECK_OPT as a way for storage engines to store state from handler::check_for_upgrade(). Reviewed by: Debarun Banerjee	2024-02-08 10:35:45 +02:00
Oleksandr Byelkin	d21cb43db1	Merge branch '11.2' into 11.3	2024-02-04 16:42:31 +01:00
Sergei Golubchik	79580f4f96	Merge branch '11.1' into 11.2	2024-02-02 17:43:57 +01:00
Sergei Golubchik	b6680e0101	Merge branch '11.0' into 11.1	2024-02-02 11:30:47 +01:00
Marko Mäkelä	9374772ecd	Merge 10.11 into 11.0	2024-01-19 09:07:48 +02:00
Marko Mäkelä	ad13fb36bf	Merge 10.6 into 10.11	2024-01-17 17:37:15 +02:00
Yuchen Pei	931df937e9	MDEV-32559 failing spider signal_ddl_recovery_done callback should result in spider deinit Since `0930eb86cb`, system table creation needed for spider init is delayed to the signal_ddl_recovery_done callback. Since it is part of the init, failure should result in spider deinit. We also remove the call to spider_init_system_tables() from spider_db_init(), as it was removed in the commit mentioned above and accidentally restored in a merge.	2024-01-16 17:17:50 +11:00
Oleksandr Byelkin	67a0224a3e	Merge branch '11.2' into 11.3	2023-11-19 08:44:35 +01:00
Nikita Malyavin	929532a942	fix uninitialized field warnings	2023-11-19 08:38:09 +01:00
Oleksandr Byelkin	34272bd6a5	Merge branch '11.2' into 11.3	2023-11-14 18:33:03 +01:00
Nikita Malyavin	23f9e34256	MDEV-32444 Data from orphaned XA transaction is lost after online alter XA support for online alter was totally missing. Tying on binlog_hton made this hardly visible: simply having binlog_commit called from xa_commit made an impression that it will automagically work for online alter, which turns out wrong: all binlog does is writes "XA END" into trx cache and flushes it to a real binlog. In comparison, online alter can't do the same, since online replication happens in a single transaction. Solution: make a dedicated XA support. * Extend struct xid_t with a pointer to Online_alter_cache_list * On prepare: move online alter cache from THD::ha_data to XID passed * On XA commit/rollback: use the online alter cache stored in this XID. This makes us pass xid_cache_element->xid to xa_commit/xa_rollback instead of lex->xid * Use manual memory management for online alter cache list, instead of mem_root allocation, since we don't have mem_root connected to the XA transaction.	2023-11-04 11:53:28 +04:00
Nikita Malyavin	cb52174693	online alter: extract the source to a separate file Move all the functions dedicated to online alter to a newly created online_alter.cc. With that, make many functions static and simplify the static functions naming. Also, rename binlog_log_row_online_alter -> online_alter_log_row.	2023-11-02 22:58:03 +04:00
Nikita Malyavin	830bdfccbd	MDEV-32126 Assertion fails upon online ALTER and binary log enabled Assertion `!writer.checksum_len \|\| writer.remains == 0' fails upon concurrent online ALTER and transactions with failing statements and binary log enabled. Also another assertion, `pos != (~(my_off_t) 0)', fails in my_seek, upon reinit_io_cache, on a simplified test. This means that IO_CACHE wasn't properly initialized, or had an error before. The overall problem is a deep interference with the effect of an installed binlog_hton: the assumption about that thd->binlog_get_cache_mngr() is, sufficiently, NULL, when we shouldn't run the binlog part of binlog_commit/binlog_rollback, is wrong: as turns out, sometimes the binlog handlerton can be not installed in current thd, but binlog_commit can be called on behalf of binlog, as in the bug reported. One separate condition found is XA recovery of the orphaned transaction, when binlog_commit is also called, but it has nothing to do with online alter. Solution: Extract online alter operations into a separate handlerton.	2023-11-02 22:58:03 +04:00
Nikita Malyavin	28b4037242	Merge branch '11.2' into 11.3	2023-09-21 14:15:04 +04:00
Alexander Barkov	8ad1e26b1b	MDEV-32081 Remove my_casedn_str() from get_canonical_filename() - Moving get_canonical_filename() from a public function to a method in handler. - Adding a helper method is_canonical_filename() to handler. - Adding helper methods left(), substr(), starts_with() to Lex_cstring. - Adding helper methods is_sane(), buffer_overlaps(), max_data_size() to CharBuffer. - Adding append_casedn() to CharBuffer. It implements the main functionality that replaces the being removed my_casedn_str() call. - Adding a class Table_path_buffer, a descendant of CharBuffer with size FN_REFLEN. - Changing get_canonical_filename() to get a pointer to Table_path_buffer instead just a pointer to char. - Changing the data type of the "path" parameter and the return type of get_canonical_filename() from char* to Lex_cstring.	2023-09-04 09:36:44 +04:00
Alexander Barkov	21218d3c9e	MDEV-31986 Remove old check_db_name() from make_table_name_list() - Replacing the old style inplace check_db_name() in make_table_name_list() to the new style non-modifying code - Adding "const" qualifier to the "db" parameter to ha_discover_table_names() and its dependency functions.	2023-08-23 08:12:47 +04:00
Sergei Golubchik	18ddde4826	Merge branch '11.1' into 11.2	2023-08-18 00:59:16 +02:00
Nikita Malyavin	30c965f866	MDEV-31777 ER_GET_ERRNO upon online alter on CONNECT table Forbid Online for CONNECT.	2023-08-15 14:00:28 +02:00
Nikita Malyavin	70491fb07b	MDEV-31677 Assertion failed upon online ALTER with binlog_row_image=NOBLOB Make binlog_prepare_row_images accept image type as an argument.	2023-08-15 14:00:28 +02:00
Nikita Malyavin	b3f988d260	Add const to get_foreign_key_list/get_parent_foreign_key_list	2023-08-15 10:16:13 +02:00
Nikita Malyavin	c76072db93	MDEV-31033 ER_KEY_NOT_FOUND upon online COPY ALTER on a partitioned table The row events were applied "twice": once for the ha_partition, and one more time for the underlying storage engine. There's no such problem in binlog/rpl, because ha_partiton::row_logging is normally set to false. The fix makes the events replicate only when the handler is a root handler. We will try to guess this by comparing it to table->file. The same approach is used in the MDEV-21540 fix, `231feabd`. The assumption is made, that the row methods are only called for table->file (and never for a cloned handler), hence the assertions are added in ha_innobase and ha_myisam to make sure that this is true at least for those engines Also closes MDEV-31040, however the test is not included, since we have no convenient way to construct a deterministic version.	2023-08-15 10:16:13 +02:00
Sergei Golubchik	ea46fdcea4	cleanup, remove dead code	2023-08-15 10:16:12 +02:00
Sergei Golubchik	64b55151f4	separate online_alter_cache_data from binlog_cache_data	2023-08-15 10:16:12 +02:00
Sergei Golubchik	d767ed5c89	remove handler::open_read_view() ht->start_consistent_snapshot() is also not a way, because some engines (e.g. rocksdb) only do it readonly. instead, downgrade the lock after reading the first row (which implicitly opens a read view).	2023-08-15 10:16:11 +02:00
Nikita Malyavin	ab4bfad206	MDEV-16329 [5/5] ALTER ONLINE TABLE * Log rows in online_alter_binlog. * Table online data is replicated within dedicated binlog file * Cached data is written on commit. * Versioning is fully supported. * Works both wit and without binlog enabled. * For now savepoints setup is forbidden while ONLINE ALTER goes on. Extra support is required. We can simply log the SAVEPOINT query events and replicate them together with row events. But it's not implemented for now. * Cache flipping: We want to care for the possible bottleneck in the online alter binlog reading/writing in advance. IO_CACHE does not provide anything better that sequential access, besides, only a single write is mutex-protected, which is not suitable, since we should write a transaction atomically. To solve this, a special layer on top Event_log is implemented. There are two IO_CACHE files underneath: one for reading, and one for writing. Once the read cache is empty, an exclusive lock is acquired (we can wait for a currently active transaction finish writing), and flip() is emitted, i.e. the write cache is reopened for read, and the read cache is emptied, and reopened for writing. This reminds a buffer flip that happens in accelerated graphics (DirectX/OpenGL/etc). Cache_flip_event_log is considered non-blocking for a single reader and a single writer in this sense, with the only lock held by reader during flip. An alternative approach by implementing a fair concurrent circular buffer is described in MDEV-24676. * Cache managers: We have two cache sinks: statement and transactional. It is important that the changes are first cached per-statement and per-transaction. If a statement fails, then only statement data is rolled back. The transaction moves along, however. Turns out, there's no guarantee that TABLE well persist in thd->open_tables to the transaction commit moment. If an error occurs, tables from statement are purged. Therefore, we can't store te caches in TABLE. Ideally, it should be handlerton, but we cut the corner and store it in THD in a list.	2023-08-15 10:16:11 +02:00
Nikita Malyavin	d2d0995cf2	MDEV-16329 [4/5] Refactor MYSQL_BIN_LOG: extract Event_log ancestor Event_log is supposed to be a basic logging class that can write events in a single file. MYSQL_BIN_LOG in comparison will have: * rotation support * index files * purging * gtid and transactional information handling. * is dedicated for a general-purpose binlog	2023-08-15 10:16:11 +02:00
Nikita Malyavin	6427e343cf	MDEV-16329 [3/5] use binlog_cache_data directly in most places * Eliminate most usages of THD::use_trans_table. Only 3 left, and they are at quite high levels, and really essential. * Eliminate is_transactional argument when possible. Lots of places are left though, because of some WSREP error handling in MYSQL_BIN_LOG::set_write_error. * Remove junk binlog functions from THD * binlog_prepare_pending_rows_event is moved to log.cc inside MYSQL_BIN_LOG and is not anymore template. Instead it accepls event factory with a type code, and a callback to a constructing function in it.	2023-08-15 10:16:11 +02:00
Nikita Malyavin	429f635f30	MDEV-16329 [2/5] refactor binlog and cache_mngr pump up binlog and cache manager to level of binlog_log_row_internal	2023-08-15 10:16:11 +02:00
Oleksandr Byelkin	f5fae75652	Merge branch '11.0' into 11.1	2023-08-09 08:25:14 +02:00
Oleksandr Byelkin	51f9d62005	Merge branch '10.11' into 11.0	2023-08-09 07:53:48 +02:00
Oleksandr Byelkin	036df5f970	Merge branch '10.10' into 10.11	2023-08-08 14:57:31 +02:00
Oleksandr Byelkin	34a8e78581	Merge branch '10.6' into 10.9	2023-08-04 08:01:06 +02:00
Oleksandr Byelkin	5ea5291d97	Merge branch '10.5' into 10.6	2023-08-04 07:52:54 +02:00
Sergei Golubchik	61acb43689	MDEV-31822 ALTER TABLE ENGINE=x started failing instead of producing warning on unsupported TRANSACTIONAL=1 make TRANSACTIONAL table option behave similar to other engine-defined table options. If the engine doesn't suport it: * if specified expicitly in CREATE or ALTER - it's ER_UNKNOWN_OPTION * an error or a warning depending on sql_mode IGNORE_BAD_TABLE_OPTIONS * in ALTER TABLE from the engine that suppors it to the engine that doesn't - silently preserved (no warning) * it is commented out in SHOW CREATE unless IGNORE_BAD_TABLE_OPTIONS	2023-08-02 14:45:31 +02:00
Sergei Golubchik	ab1191c039	cleanup: key->key_create_info.check_for_duplicate_indexes -> key->old mark old keys in the ALTER TABLE with the `old` flag, not with the `key_create_info.check_for_duplicate_indexes`. This allows to mark old foreign keys too.	2023-08-01 22:43:16 +02:00
Marko Mäkelä	e81fa34502	Merge 11.1 into 11.2	2023-07-26 15:49:24 +03:00
Marko Mäkelä	c6ac1e39b6	Merge 11.0 into 11.1	2023-07-26 15:13:43 +03:00
Marko Mäkelä	f2b4972bd4	Merge 10.11 into 11.0	2023-07-26 15:13:06 +03:00
Marko Mäkelä	bce3ee704f	Merge 10.10 into 10.11	2023-07-26 14:44:43 +03:00
Alexander Barkov	75f25e4ca7	MDEV-30164 System variable for default collations This patch adds a way to override default collations (or "character set collations") for desired character sets. The SQL standard says: > Each collation known in an SQL-environment is applicable to one > or more character sets, and for each character set, one or more > collations are applicable to it, one of which is associated with > it as its character set collation. In MariaDB, character set collations has been hard-coded so far, e.g. utf8mb4_general_ci has been a hard-coded character set collation for utf8mb4. This patch allows to override (globally per server, or per session) character set collations, so for example, uca1400_ai_ci can be set as a character set collation for Unicode character sets (instead of compiled xxx_general_ci). The array of overridden character set collations is stored in a new (session and global) system variable @@character_set_collations and can be set as a comma separated list of charset=collation pairs, e.g.: SET @@character_set_collations='utf8mb3=uca1400_ai_ci,utf8mb4=uca1400_ai_ci'; The variable is empty by default, which mean use the hard-coded character set collations (e.g. utf8mb4_general_ci for utf8mb4). The variable can also be set globally by passing to the server startup command line, and/or in my.cnf.	2023-07-17 14:56:17 +04:00
Marko Mäkelä	7cde5c539b	Merge 10.6 into 10.9	2023-07-10 11:22:21 +03:00
Monty	99bd226059	MDEV-31558 Add InnoDB engine information to the slow query log The new statistics is enabled by adding the "engine", "innodb" or "full" option to --log-slow-verbosity Example output: # Pages_accessed: 184 Pages_read: 95 Pages_updated: 0 Old_rows_read: 1 # Pages_read_time: 17.0204 Engine_time: 248.1297 Page_read_time is time doing physical reads inside a storage engine. (Writes cannot be tracked as these are usually done in the background). Engine_time is the time spent inside the storage engine for the full duration of the read/write/update calls. It uses the same code as 'analyze statement' for calculating the time spent. The engine statistics is done with a generic interface that should be easy for any engine to use. It can also easily be extended to provide even more statistics. Currently only InnoDB has counters for Pages_% and Undo_% status. Engine_time works for all engines. Implementation details: class ha_handler_stats holds all engine stats. This class is included in handler and THD classes. While a query is running, all statistics is updated in the handler. In close_thread_tables() the statistics is added to the THD. handler::handler_stats is a pointer to where statistics should be collected. This is set to point to handler::active_handler_stats if stats are requested. If not, it is set to 0. handler_stats has also an element, 'active' that is 1 if stats are requested. This is to allow engines to avoid doing any 'if's while updating the statistics. Cloned or partition tables have the pointer set to the base table if status are requested. There is a small performance impact when using --log-slow-verbosity=engine: - All engine calls in 'select' will be timed. - IO calls for InnoDB reads will be timed. - Incrementation of counters are done on local variables and accesses are inline, so these should have very little impact. - Statistics has to be reset for each statement for the THD and each used handler. This is only 40 bytes, which should be neglectable. - For partition tables we have to loop over all partitions to update the handler_status as part of table_init(). Can be optimized in the future to only do this is log-slow-verbosity changes. For this to work we have to update handler_status for all opened partitions and also for all partitions opened in the future. Other things: - Added options 'engine' and 'full' to log-slow-verbosity. - Some of the new files in the test suite comes from Percona server, which has similar status information. - buf_page_optimistic_get(): Do not increment any counter, since we are only validating a pointer, not performing any buf_pool.page_hash lookup. - Added THD argument to save_explain_data_intern(). - Switched arguments for save_explain_.*_data() to have always THD first (generates better code as other functions also have THD first).	2023-07-07 12:53:18 +03:00
Marko Mäkelä	3883eb63dc	Merge 11.0 into 11.1	2023-06-08 14:09:21 +03:00
Marko Mäkelä	5fb2c031f7	Merge 10.11 into 11.0	2023-06-08 13:49:48 +03:00
Monty	ded4ed3220	MDEV-30944 Range_rowid_filter::fill() leaves file->keyread at MAX_KEY This test case exposed 2 different bugs: - When replacing a range with an index scan on a covering key in test_if_skip_sort_order() we didn't disable filtering. Filtering does not make much sense in this case. - Fixed by disabling filtering in this case. - Range_rowid_filter::fill() did not take into account that keyread could already active, which caused an assert when it tried to activate another keyread. - Fixed by remembering old keyread state at start and restoring it at end. Other things: - ha_start_keyread() allowed multiple calls. This is wrong, especially as we do no check if the index changed! I added an assert() to ensure that we don't call it there is already an active keyread. - ha_end_keyread() always called ha_extra(), even if keyread was not active. Added a check to avoid the extra call.	2023-06-07 18:44:12 +03:00

1 2 3 4 5 ...

2264 Commits