mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-07-24 19:42:23 +03:00

Author	SHA1	Message	Date
Marko Mäkelä	43465352b9	Merge 11.4 into 11.6	2024-10-03 16:09:56 +03:00
Marko Mäkelä	12a91b57e2	Merge 10.11 into 11.2	2024-10-03 13:24:43 +03:00
Monty	dfdedd46e4	MDEV-32188 make TIMESTAMP use whole 32-bit unsigned range This patch extends the timestamp from 2038-01-19 03:14:07.999999 to 2106-02-07 06:28:15.999999 for 64 bit hardware and OS where 'long' is 64 bits. This is true for 64 bit Linux but not for Windows. This is done by treating the 32 bit stored int as unsigned instead of signed. This is safe as MariaDB has never accepted dates before the epoch (1970). The benefit of this approach that for normal timestamp the storage is compatible with earlier version. However for tables using system versioning we before stored a timestamp with the year 2038 as the 'max timestamp', which is used to detect current values. This patch stores the new 2106 year max value as the max timestamp. This means that old tables using system versioning needs to be updated with mariadb-upgrade when moving them to 11.4. That will be done in a separate commit.	2024-05-27 12:39:02 +02:00
Nikita Malyavin	5669cb7323	fix race in the MDEV-32614 test Sometimes 'continue' signal could be missed.	2024-05-03 23:01:10 +02:00
Nikita Malyavin	ba6f9943b2	online alter: show examined rows in the progress report	2024-01-31 22:04:39 +01:00
Nikita Malyavin	50095046f3	MDEV-32614 LeakSanitizer errors in copy_data_between_tables The memory leak occurs on error when backup_reset_alter_copy_lock fails with timeout. This leads to the alter rollback, but flush_unused is not called. Move table flushing on error handling to a single place and mind more possible failures this time.	2024-01-30 02:48:02 +01:00
Nikita Malyavin	c2e16b3ad5	MDEV-32803 Assertion `total == 0' failed in Event_log::write_cache_raw A second DML in a transaction for a table of non-rollbackable engine leads to a cache corruption, because the cache wasn't reset after a statement end, but also wasn't destroyed. This patch resets the cache for a reuse by subsequent statements in current transaction.	2024-01-30 02:48:02 +01:00
Nikita Malyavin	e6acddf121	main.alter_table_online_debug: fix race in XA tests	2023-11-14 15:41:53 +01:00
Nikita Malyavin	d59d883631	MDEV-32771 Server crash upon online alter with concurrent XA In case of a non-recovery XA rollback/commit in the same connection, thon->rollback is called instead of rollback_by_xid, Though previously, thd_ha_data was moved to thd->transaction->xid_state.xid in hton->prepare. Like it wasn't enough, XA PREPARE can be skipped upon user and thus we can end up in hton->commit/rollback with and unprepared XA, so checking xid_state.is_explicit_XA is not enough -- we should check xid_state.get_state_code() == XA_PREPARED, which will also guarantee is_explicit_XA() == true.	2023-11-14 15:41:53 +01:00
Nikita Malyavin	f7646d890b	main.alter_table_online_debug: remove explicit innodb	2023-11-04 11:53:28 +04:00
Nikita Malyavin	23f9e34256	MDEV-32444 Data from orphaned XA transaction is lost after online alter XA support for online alter was totally missing. Tying on binlog_hton made this hardly visible: simply having binlog_commit called from xa_commit made an impression that it will automagically work for online alter, which turns out wrong: all binlog does is writes "XA END" into trx cache and flushes it to a real binlog. In comparison, online alter can't do the same, since online replication happens in a single transaction. Solution: make a dedicated XA support. * Extend struct xid_t with a pointer to Online_alter_cache_list * On prepare: move online alter cache from THD::ha_data to XID passed * On XA commit/rollback: use the online alter cache stored in this XID. This makes us pass xid_cache_element->xid to xa_commit/xa_rollback instead of lex->xid * Use manual memory management for online alter cache list, instead of mem_root allocation, since we don't have mem_root connected to the XA transaction.	2023-11-04 11:53:28 +04:00
Nikita Malyavin	830bdfccbd	MDEV-32126 Assertion fails upon online ALTER and binary log enabled Assertion `!writer.checksum_len \|\| writer.remains == 0' fails upon concurrent online ALTER and transactions with failing statements and binary log enabled. Also another assertion, `pos != (~(my_off_t) 0)', fails in my_seek, upon reinit_io_cache, on a simplified test. This means that IO_CACHE wasn't properly initialized, or had an error before. The overall problem is a deep interference with the effect of an installed binlog_hton: the assumption about that thd->binlog_get_cache_mngr() is, sufficiently, NULL, when we shouldn't run the binlog part of binlog_commit/binlog_rollback, is wrong: as turns out, sometimes the binlog handlerton can be not installed in current thd, but binlog_commit can be called on behalf of binlog, as in the bug reported. One separate condition found is XA recovery of the orphaned transaction, when binlog_commit is also called, but it has nothing to do with online alter. Solution: Extract online alter operations into a separate handlerton.	2023-11-02 22:58:03 +04:00
Nikita Malyavin	46ee272a10	MDEV-32100 Online ALTER TABLE ends with 1032 under some isolation levels 1032 (Can't find record) could be emitted when ALTER TABLE is execued vs concurrent DELETE/UPDATE/other DML that would require search on the online ALTER's side. Innodb's INPLACE, in comparison, creates a new trx_t and uses it in scope of the alter table context. ALTER TABLE class of statements (i.g. CREATE INDEX, OPTIMIZE, etc.) is expected to be unaffected by the value of current session's transaction isolation. This patch save-and-restores thd->tx_isolation and sets in to ISO_REPEATABLE_READ for almost a whole mysql_alter_table duration, to avoid any possible side-effect of it. This should be primarily done before the lock_tables call, to initialize the storage engine's local value correctly during the store_lock() call. sql_table.cc: set thd->tx_isolation to ISO_REPEATABLE_READ in mysql_alter_table and then restore it to the original value in the end of the call.	2023-11-02 22:58:02 +04:00
Nikita Malyavin	8aa1a9e6a7	MDEV-31812 Add switch to old_mode to disable non-locking ALTER Add LOCK_ALTER_TABE_COPY bit to old_mode. Disables online copy by default, but still allows to force it with explicit lock=none	2023-08-15 14:00:28 +02:00
Nikita Malyavin	44ca37ef17	MDEV-31631 Adding auto-increment to table with history online misbehaves Adding an auto_increment column online leads to an undefined behavior. Basically any DEFAULTs that depend on a row order in the table, or on the non-deterministic (in scope of the ALTER TABLE statement) function is UB. For example, NOW() is considered generally non-deterministic (Item_func_now_utc is marked with VCOL_NON_DETERMINISTIC), but it's fixed in scope of a single statement. Same for any other function that depends only on the session/status vars apart from its arguments. Only two UB cases are known: * adding new AUTO_INCREMENT column. Modifying the existing column may be fine under certain circumstances, see MDEV-31058. * adding new column with DEFAULT(nextval(...)). Modifying the existing column is possible, since its value will be always present in the online event, except for the NULL -> NOT NULL modification	2023-08-15 14:00:28 +02:00
Nikita Malyavin	e026a366bf	MDEV-31776 Online ALTER reports the number of affected rows incorrectly Add a new virtual function that will increase the inserted rows count for the insert log event and decrease it for the delete event. Reuses Rows_log_event::m_row_count on the replication side, which was only set on the logging side.	2023-08-15 14:00:28 +02:00
Nikita Malyavin	70491fb07b	MDEV-31677 Assertion failed upon online ALTER with binlog_row_image=NOBLOB Make binlog_prepare_row_images accept image type as an argument.	2023-08-15 14:00:28 +02:00
Nikita Malyavin	d5e59c983f	MDEV-31646 Online alter applies binlog cache limit to cache writes 1. Make online disk writes unlimited, same as filesort does. 2. Make proper error handling -- in 32-bit build IO_CACHE capacity limit is 4GB, so it is quite possible to overfill there. 3. Event_log::write_cache complicated with event reparsing, and as it was proven by QA, contains some mistakes. Rewrite introbuce a simpler and much faster version, not featuring reparsing and therefore copying a whole buffer at once. This also disables checksums and crypto. 4. Handle read_log_event errors correctly: error returned is -1 (eof signal for alter table), and my_error is not called. Call my_error and always return 1. There's no test for this, since it shouldn't happen, see the next bullet. 5. An event could be written partially in case of error, if it's bigger than the IO_CACHE buffer. Restore the position where it was before the error was emitted. As a result, online alter is untied of several binlog variables, which was a second aim of this patch.	2023-08-15 13:59:07 +02:00
Nikita Malyavin	d7b0c6d8a8	MDEV-30987 main.alter_table_online times out with view-protocol disabie view-protocol for certain places: 1. While a transaction is in progress: a view gets locked for a transaction duration, so DROP VIEW from a servicing conneciton deadlocks. We can't just disable servicing connection, as CREATE VIEW and DROP VIEW commit implicitly. 2. For querying Opened_table_definitions: the actual query executed is `SELECT * FROM mysqltest_tmp_v`, then the view is dropped	2023-08-15 10:16:13 +02:00
Nikita Malyavin	55d1645d5b	MDEV-31059 "Slave SQL" errors upon concurrent DML and erroneous ALTER Skip more rpl-related error handling. Also move the error check inside if (table) -- otherwise the error should be handled already.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	500379cf49	unpack_row: unpack a correct number of fields	2023-08-15 10:16:13 +02:00
Nikita Malyavin	af82d56a51	unpack_row: set the correct fields in has_value_set for online alter	2023-08-15 10:16:13 +02:00
Nikita Malyavin	ecb9db4c3d	MDEV-30949 Direct leak in binlog_online_alter_end_trans when committing a big transaction, online_alter_cache_log creates a cache file. It wasn't properly closed, which was spotted by a memory leak from my_register_filename. A temporary file also remained open. Binlog wasn't affected by this, since it features its own file management. A proper closing is calling close_cached_file. It deinits io_cache and closes the underlying file. After closing, the file is expected to be deleted automagically.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	0695f7dd7b	MDEV-31043 ER_KEY_NOT_FOUND upon concurrent ALTER and transaction Non-transactional engines changes are visible immediately the row operation succeeds, so we should apply the changes to the online buffer immediately after the statement ends.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	c76072db93	MDEV-31033 ER_KEY_NOT_FOUND upon online COPY ALTER on a partitioned table The row events were applied "twice": once for the ha_partition, and one more time for the underlying storage engine. There's no such problem in binlog/rpl, because ha_partiton::row_logging is normally set to false. The fix makes the events replicate only when the handler is a root handler. We will try to guess this by comparing it to table->file. The same approach is used in the MDEV-21540 fix, `231feabd`. The assumption is made, that the row methods are only called for table->file (and never for a cloned handler), hence the assertions are added in ha_innobase and ha_myisam to make sure that this is true at least for those engines Also closes MDEV-31040, however the test is not included, since we have no convenient way to construct a deterministic version.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	5361b87093	add partition test	2023-08-15 10:16:13 +02:00
Nikita Malyavin	3ad0e7edd1	MDEV-30924 Server crashes in MYSQL_LOG::is_open upon ALTER vs FUNCTION ASAN showed use-after-free in binlog_online_alter_end_trans, during running through thd->online_alter_cache_list. In online_alter_binlog_get_cache_data, new_cache_data was allocated on thd->mem_root, in case of autocommit=1, but this mem_root could be freed in sp_head::execute, upon using stored functions. It appears that thd->transaction->mem_root exists even in single-stmt transaction mode (i.e autocommit=1), so it can be used in all cases. This mem_root will remain valid till the end of transaction, including commit phase.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	6b35d6a909	MDEV-30925 Assertion failed in translog_write_record in ONLINE ALTER + Aria This is the corner case of ONLINE ALTER vs ha_maria vs App-time Periods. When a Delete_rows_event (or update) is executed, a lookup handler may be created, normally to serve long unique index needs, by a call of handler::prepare_for_insert. This function also creates a lookup handler if an application-time period exists in a table. A difference with a usual call of prepare_for_insert is that transactions are disabled for this table during ALTER TABLE. See mysql_trans_prepare_alter_copy_data call in copy_data_between_tables. Then, ha_maria calls _ma_tmp_disable_logging_for_table during ha_maria::external_lock. It never happened so before, that two handlers would be created for write to a single ha_maria table under transactions disabled. Hence, the fix handles this scenario. It could be done otherwise, by not creating this lookup handler (since it's not used anyway during ONLINE ALTER), but architecturally, two handlers should be supported. Avoiding the creation of lookup handler could be done here additionally, but with a cost of slowing down other more generic cases, with an additional check of online alter table active.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	6f78efc01c	MDEV-30902 Server crash in LEX::first_lists_tables_same ONLINE ALTER TABLE uses binlog events like the replication does. Before it was never used outside of replication, so significant change was required. For example, a single event had a statement-like befavior: it locked the tables, opened it, and closed them in the end. But for ONLINE ALTER we use preopened table. A crash scenario is following: lex->query_tables was set to NULL in restore_empty_query_table_list when alter event is applied. Then lex->query_tables->prev_global was write-accessed in LEX::first_lists_tables_same, leading to a segfault. In replication restore_empty_query_table_list would mean resetting lex before next query or event. In ONLINE ALTER TABLE we reuse a locked table between the events, so we should avoid it. Here the need to reset lex state (or close the tables) can be determined by nonzero rgi->tables_to_lock_count. If no table is locked, then event doesn't own the tables. The same was already done before for rgi->slave_close_thread_tables call.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	41697008fe	MDEV-29069 follow-up: improve DEFAULT rules previously, fields with DEFAULTs were allowed just when expression is deterministic. In case of online alter, we should recursively check that underlying fields of expression also either have explicit values, or have DEFAULT following this validity rule.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	2be4c836e5	MDEV-29069 ER_KEY_NOT_FOUND on online autoinc addition + concurrent DELETE We can't rely on keys formed with columns that were added during this ALTER. These columns can be set with non-deterministic values, which can end up with broken or incorrect search. The same applies to the keys that contain reliable columns, but also have bogus ones. Using them can narrow the search, but they're also ignored. Also, added columns shouldn't be considered during the record match. To determine them, table->has_value_set bitmap is used. To fill has_value_set bitmap in the find_key call, extra unpack_row call has been added. For replication case, extra replica columns can be considered for this case. We try to ignore them, too.	2023-08-15 10:16:13 +02:00
Nikita Malyavin	754c8dab52	MDEV-29038 XA assertions failing in binlog_rollback and binlog_commit ONLINE ALTER TABLE adds binlog handlerton into ha_list, so any rollback command can end up calling binlog_rollback having no cache_mngr, if binlog is not enabled. The assertion should be fixed in the same manner as DBUG_ASSERT(WSREP(thd))	2023-08-15 10:16:12 +02:00
Nikita Malyavin	6e0f456090	MDEV-29013 ER_KEY_NOT_FOUND/lock timeout upon online alter with long unique 1. ER_KEY_NOT_FOUND general replcation problem, already fixed earlier. test added. 2. ER_LOCK_WAIT_TIMEOUT This is a long unique specific problem. Sometimes, lookup_handler is created for to->file. To properly free it, ha_reset should be called. It is usually done by calling close_thread_table, but ALTER TABLE makes it differently. Hence, a single ha_reset call is added to mysql_alter_table. Also, event_mem_root is removed. Normally, no per-event data should be allocated on thd->mem_root, that would mean a leak. And otherwise, lookup_handler is lazily allocated, but its lifetime matches statement, not event.	2023-08-15 10:16:12 +02:00
Nikita Malyavin	da5df33927	rpl: check should go after defaults and vcols update	2023-08-15 10:16:12 +02:00
Sergei Golubchik	aa1a2507f5	MDEV-29067 Online alter ignores check constraint violation	2023-08-15 10:16:12 +02:00
Sergei Golubchik	472c3d082f	don't do ALTER IGNORE TABLE online because online means we'll apply events from the binlog, and ignore means that bad rows will be skipped. So a bad Write_row_log_event will be skipped and a following Update_row_log_event will fail to apply.	2023-08-15 10:16:12 +02:00
Nikita Malyavin	aa9e173e9e	MDEV-29021 add test case from MDEV-29013 This test case was also fixed by adding update_virtual_columns.	2023-08-15 10:16:12 +02:00
Nikita Malyavin	b0db7239b1	Do not ignore sql_mode when replicating Division by zero is a good example. sql_mode is basically ignored by replication, see Bug#56662. The behavior for ONLINE should remain the same as for non-ONLINE ALTER.	2023-08-15 10:16:12 +02:00
Nikita Malyavin	93fb92d3f9	MDEV-29021 ALTER TABLE fails when a stored virtual column is dropped+added We shouldn't rely on `fill_extra_persistent_columns`, as it only updates fields which have an index > cols->n_bits (replication bitmap width). Actually, it should never be used, as its approach is error-prone. Normal update_virtual_fields+update_default_fields should be done.	2023-08-15 10:16:12 +02:00
Nikita Malyavin	2ed03a41e6	MDEV-28930 ALTER TABLE Deadlocks with parallel TL_WRITE ALTER ONLINE TABLE acquires table with TL_READ. Myisam normally acquires TL_WRITE for DML, which makes it hang until table is freed. We deadlock once ALTER upgrades its MDL lock. Solution: Unlock table earlier. We don't need to hold TL_READ once we finished copying. Relay log replication requires no data locks on `from` table.	2023-08-15 10:16:12 +02:00
Sergei Golubchik	8fbdc76038	MDEV-28967 Assertion `marked_for_write_or_computed()' failed in Field_new_decimal::store_value / online_alter_read_from_binlog` in the catch-up phase of the online alter we apply row events, they're unpacked into `from->record[0]` and then converted to `to->record[0]`. This needs all fields of `from` to be in the `write_set`. Although practically `Field::unpack()` does not assert the `write_set`, and `Field::reset()` - used when a field value is not present in the after-image - also doesn't assert the `write_set` for many types, `Field_new_decimal::reset()` does.	2023-08-15 10:16:12 +02:00
Sergei Golubchik	01b3cb2f46	cleanup: whitespace, etc	2023-08-15 10:16:12 +02:00
Sergei Golubchik	93049e3de6	MDEV-28959 Online alter ignores strict table mode * don't disable warnings when catching up * do propagate warnings up like copy_data_between_tables() does	2023-08-15 10:16:12 +02:00
Sergei Golubchik	754333e6ad	test rename alter_table_online -> alter_table_online_debug	2023-08-15 10:16:12 +02:00

44 Commits