mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-11-10 23:02:54 +03:00

Author	SHA1	Message	Date
sjaakola	a1e70388c4	MDEV-24966 Galera multi-master regression After the merging of MDEV-24915, 10.6 branch has regressions with handling of concurrent write load against two or more cluster nodes. These regressions may surface as cluster hanging, node crashes or data inconsistency. With some test scenarios, the only visible symptom could be that the BF victim aborting happens only by innodb lock wait timeout expiration. This would result only to poor performance (by default 50 sec hang for each BF conflict), and could be somewhat difficult to diagnose. This pull request has following fixes to handle concurrent write load from multiple nodes: In lock_wait_wsrep_kill(), the victim trx was expected to be only in TRX_STATE_ACTIVE state. With the delayed BF conflict handling, it can happen that victim has advanced into pre commit state. This was fixed by choosing victim both in TRX_STATE_ACTIVE and TRX_STATE_PREPARED states. Victim transaction may be in several different states at the time of detected lock conflict, and due to delayed BF aborting practice in MDEV-24915, the victim may advance further before the actual BF aborting takes place. The BF aborting in MDEV-24915 did not wake the victim, if it was in the state of waiting for some other lock (than the one that was blocking the high priority thread). This anomaly caused the innodb lock wait timeout expiration delays and poor performance symptom. To fix this, lock_wait_wsrep_kill() now looks if victim is in lock waiting state, and uses lock_cancel_waiting_and_release() to cancel this lock wait. wsrep_bf_abort() checks if the victim has active transaction (in wsrep-lib), and starts a new transaction if there was no active transaction before. Due to late BF aborting, the victim may have e.g. failed in certification and is already aborting or has aborted at this stage. This has caused problems in testing where BF aborter tries to BF abort himself. The fix in wsrep_bf_abort() now skips the BF abort, if victim is aborting or has aborted. Victim may not have started transaction yet in wsrep context, but it may have acquired MDL locks (due to DDL execution), and this has caused BF conflict. Such case does not require aborting in wsrep or replication provider state. BF aborting could cause BF-BF conflict scenario, if victim was already aborted and changed to replayer having high priority as well. This BF-BF conflict scenario is now avoided in lock_wait_wsrep() where we now check if blocking lock holder is also high priority and is ordered before, caller should wait for the lock in this situation. The natural innodb deadlock resolving algorithm could pick BF thread as deadlock victim. This is fixed by giving max weigh to BF threads in Deadlock::report(). MDEV-24341 has changed excution paths in do_command() and this affects BF aborted victim execution. This PR fixes one assert in do_command(): DBUG_ASSERT(!thd->async_state.pending_ops()) Which fired if the thd was BF aborted earlier. This assert is now changed to allow pending_ops() if thd was BF aborted before. With these fixes, long term highly conflicting write load could be run against to node cluster. If binlogging is configured, log_slave_updates should be also set.	2021-04-13 14:58:54 +03:00
Jan Lindström	27d66d644c	MENT-411 : Implement wsrep_replicate_aria Introduced two new wsrep_mode options * REPLICATE_MYISAM * REPLICATE_ARIA Depracated wsrep_replicate_myisam parameter and we use wsrep_mode = REPLICATE_MYISAM instead. This required small refactoring of wsrep_check_mode_after_open_table so that both MyISAM and Aria are handled on required DML cases. Similarly, added Aria to wsrep_should_replicate_ddl to handle DDL for Aria tables using TOI. Added test cases and improved MyISAM testing. Changed use of wsrep_replicate_myisam to wsrep_mode = REPLICATE_MYISAM	2021-02-25 07:47:51 +02:00
Sergei Golubchik	f33e57a9e6	Merge branch '10.4' into 10.5	2021-02-23 13:06:22 +01:00
Sergei Golubchik	e841957416	Merge branch '10.3' into 10.4	2021-02-23 09:25:57 +01:00
Sergei Golubchik	0ab1e3914c	Merge branch '10.2' into 10.3	2021-02-22 22:42:27 +01:00
Sergei Golubchik	a638f1577a	Merge branch 'bb-10.2-release' into 10.2	2021-02-22 18:43:03 +01:00
Sergei Golubchik	3a8ca9096e	Merge branch 'bb-10.4-release' into bb-10.5-release	2021-02-19 10:37:51 +01:00
Sergei Golubchik	53123dfa3e	Merge branch 'bb-10.3-release' into bb-10.4-release	2021-02-19 00:19:42 +01:00
Sergei Golubchik	0d55b020e1	Merge branch 'bb-10.2-release' into bb-10.3-release	2021-02-18 22:09:53 +01:00
Sergei Golubchik	ce3a2a688d	make @@wsrep_provider and @@wsrep_notify_cmd read-only this should simplify run-time cluster management	2021-02-18 19:03:01 +01:00
Jan Lindström	4d300ab1a8	MDEV-24867 : wsrep.variables MTR failed: Result length mismatch Stabilize test case.	2021-02-17 10:28:37 +02:00
Sergei Golubchik	25d9d2e37f	Merge branch 'bb-10.4-release' into bb-10.5-release	2021-02-15 16:43:15 +01:00
Sergei Golubchik	00a313ecf3	Merge branch 'bb-10.3-release' into bb-10.4-release Note, the fix for "MDEV-23328 Server hang due to Galera lock conflict resolution" was null-merged. 10.4 version of the fix is coming up separately	2021-02-12 17:44:22 +01:00
Sergei Golubchik	60ea09eae6	Merge branch '10.2' into 10.3	2021-02-01 13:49:33 +01:00
Jan Lindström	900a14754a	Fix wsrep.variables	2021-01-26 14:21:33 +02:00
Marko Mäkelä	961c7938bb	Merge 10.4 into 10.5	2021-01-25 12:44:24 +02:00
Jan Lindström	be5fce16a0	MDEV-24596 : Assertion `state_ == s_exec \|\| state_ == s_quitting' failed in wsrep::client_state::disable_streaming There were multiple problems here * wsrep_trx_fragment_size should not be set when wsrep is disabled or provider is not loaded * wsrep_trx_fragment_unit should not be set when wsrep is disabled or provider is not loaded * wsrep_debug has no effect if wsrep is disabled or provider is not loaded * wsrep_start_position should not be set when wsrep is disabled or provider is not loaded any other value than default * wsrep_start_position should be changed only when we are joiner or initialized * wsrep_start_position should be allowed to set only a value that exits, thus we need to add error handling to wsrep_sst_complete	2021-01-21 11:41:29 +02:00
Marko Mäkelä	6a1e655cb0	Merge 10.4 into 10.5	2020-12-02 18:29:49 +02:00
Jan Lindström	6f50f51e60	MDEV-21494 : Galera test sporadic failure on galera.galera_defaults Make sure that we operate with correct Galera library version and do not print wsrep_provider_options field.	2020-11-19 12:42:54 +02:00
Marko Mäkelä	133b4b46fe	Merge 10.4 into 10.5	2020-11-03 16:24:47 +02:00
Marko Mäkelä	4b3690b504	fixup `67cb7ea22a`	2020-11-03 14:48:08 +02:00
Jan Lindström	67cb7ea22a	Clean up wsrep.variables	2020-11-03 09:12:06 +02:00
Jan Lindström	2391582ec3	Merge remote-tracking branch 10.2 into 10.3	2020-11-03 09:00:23 +02:00
Jan Lindström	94859d985e	Clean up wsrep.variables	2020-11-03 08:49:10 +02:00
Marko Mäkelä	898521e2dd	Merge 10.4 into 10.5	2020-10-30 11:15:30 +02:00
Teemu Ollakka	ec0e9d6f76	MDEV-22681 EXECUTE IMMEDIATE crashes server if wsrep is on. A wsrep transaction was started for EXECUTE IMMEDIATE, which caused assertion failure when the executed statement was CREATE TABLE which should be executed in TOI mode. As a fix, don't start wsrep transaction for EXECUTE IMMEDIATE to let the wsrep state logic to be handled from inside stored procedure codepath. Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>	2020-10-28 09:51:35 +02:00
Marko Mäkelä	1657b7a583	Merge 10.4 to 10.5	2020-10-22 17:08:49 +03:00
Marko Mäkelä	45e10d46d8	After-merge fix to wsrep.variables We must make the test depend on a debug version of Galera.	2020-10-22 13:50:06 +03:00
Marko Mäkelä	46957a6a77	Merge 10.3 into 10.4	2020-10-22 13:27:18 +03:00
Marko Mäkelä	e3d692aa09	Merge 10.2 into 10.3	2020-10-22 08:26:28 +03:00
Daniele Sciascia	fdf87973cb	MDEV-23081 Stray XA transactions at startup, with wsrep_on=OFF Change xarecover_handlerton so that transaction with WSREP prefixed xids are rolled back when Galera is disabled. Reviewd-by: Jan Lindström <jan.lindstrom@mariadb.com>	2020-10-21 16:29:07 +03:00
Jan Lindström	a3eddd9f11	MDEV-23659 : Update Galera disabled.def file Changes to be committed: modified: mysql-test/suite/galera/disabled.def modified: mysql-test/suite/wsrep/disabled.def	2020-10-10 08:50:50 +03:00
Jan Lindström	fc3b5c7db3	MDEV-17585 : wsrep.variables failed in buildbot with deadlock on CREATE USER Stabilize test by using correct galera library and restore original galera cluster at end.	2020-10-10 08:50:50 +03:00
Sujatha	25ede13611	Merge branch '10.4' into 10.5	2020-09-29 16:59:36 +05:30
Daniele Sciascia	7edfb72eff	Fix MTR test wsrep.variables Require galera_have_debug_sync.inc and re-record to include new variables exposed by latest galera library.	2020-09-28 12:17:05 +03:00
Marko Mäkelä	882ce206db	Merge 10.4 into 10.5	2020-09-23 11:32:43 +03:00
Jan Lindström	de76bebc57	MDEV-23659 : Update Galera disabled.def file	2020-09-14 18:21:48 +03:00
Jan Lindström	b69e980a38	MDEV-20581 Fix MTR test wsrep.variables Made the test work with --repeat option by adding galera_wait_ready.inc at the end of test. Recorded the test output.	2020-09-14 18:21:17 +03:00
Marko Mäkelä	0775717479	MDEV-23466: Fix the result for different GTID format in 10.5	2020-08-21 11:53:30 +03:00
Marko Mäkelä	2fa9f8c53a	Merge 10.3 into 10.4	2020-08-20 11:01:47 +03:00
Marko Mäkelä	de0e7cd72a	Merge 10.2 into 10.3	2020-08-20 09:12:16 +03:00
Marko Mäkelä	bfba2bce6a	Merge 10.1 into 10.2	2020-08-20 06:00:36 +03:00
Daniele Sciascia	f8bf5b0f84	MDEV-23466 SIGABRT on SELECT WSREP_LAST_SEEN_GTID SELECT WSREP_LAST_SEEN_GTID aborts the server if no provider is loaded.	2020-08-19 13:12:00 +03:00
Daniele Sciascia	fe3284b2cc	MDEV-23092 SIGABRT when setting invalid wsrep_provider Some invalid wsrep_provider paths may be interpreted as a valid directory. For example '/invalid/libgalera_smm.so' with UTF character set is interpreted as '/', which is a valid directory. A early check that wsrep_provider should not be a directory fixes it.	2020-08-19 13:12:00 +03:00
Daniele Sciascia	09dd06f14a	MDEV-22443 wsrep::runtime_error on START TRANSACTION This happens with global wsrep_on disabled and local wsrep_on enabled. The fix consists in avoiding sync wait when global wsrep_on is disabled.	2020-08-19 13:12:00 +03:00
Daniel Black	b970363acf	MDEV-23440: mysql_tzinfo_to_sql to use transactions Since MDEV-18778, timezone tables get changed to innodb to allow them to be replicated to other galera nodes. Even without galera, timezone tables could be declared innodb. With the standalone innodb tables, the mysql_tzinfo_to_sql takes approximately 27 seconds. With the transactions enabled in this patch, 1.2 seconds is the approximate load time. While explicit checks for the engine of the time zone tables could be done, or checks against !opt_skip_write_binlog, non-transactional storage engines will just ignore the transactional state without even a warning so its safe to enact globally. Leap seconds are pretty much ignored as they are a single insert statement and have gone out of favour as they have caused MariaDB stalls in the past.	2020-08-15 14:02:05 +10:00
Marko Mäkelä	eae968f62d	Merge 10.3 into 10.4	2020-08-10 21:08:46 +03:00
Marko Mäkelä	bafc5c1321	Merge 10.2 into 10.3	2020-08-10 18:40:57 +03:00
Jan Lindström	1dec60c795	MDEV-22626: mysql_tzinfo_to_sql not replicates timezone to galeranodes if only 1 timezone will be loaded. Move alter to InnoDB earlier to more correct place to handle also if only a one timezone file is loaded.	2020-08-07 09:06:13 +03:00
Sergei Golubchik	c2db9397c7	MDEV-18565 Galera mtr-suite fails if galera library is not installed revert/simplify `f5390eea9a` remove galera-specific checks from mtr and the main suite	2020-04-27 09:22:36 +02:00

1 2 3 4 5

229 Commits