mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-11-09 11:41:36 +03:00

Author	SHA1	Message	Date
Oleksandr Byelkin	f52954ef42	Merge commit '10.4' into 10.5	2023-07-20 11:54:52 +02:00
Marko Mäkelä	7cde5c539b	Merge 10.6 into 10.9	2023-07-10 11:22:21 +03:00
Monty	99bd226059	MDEV-31558 Add InnoDB engine information to the slow query log The new statistics is enabled by adding the "engine", "innodb" or "full" option to --log-slow-verbosity Example output: # Pages_accessed: 184 Pages_read: 95 Pages_updated: 0 Old_rows_read: 1 # Pages_read_time: 17.0204 Engine_time: 248.1297 Page_read_time is time doing physical reads inside a storage engine. (Writes cannot be tracked as these are usually done in the background). Engine_time is the time spent inside the storage engine for the full duration of the read/write/update calls. It uses the same code as 'analyze statement' for calculating the time spent. The engine statistics is done with a generic interface that should be easy for any engine to use. It can also easily be extended to provide even more statistics. Currently only InnoDB has counters for Pages_% and Undo_% status. Engine_time works for all engines. Implementation details: class ha_handler_stats holds all engine stats. This class is included in handler and THD classes. While a query is running, all statistics is updated in the handler. In close_thread_tables() the statistics is added to the THD. handler::handler_stats is a pointer to where statistics should be collected. This is set to point to handler::active_handler_stats if stats are requested. If not, it is set to 0. handler_stats has also an element, 'active' that is 1 if stats are requested. This is to allow engines to avoid doing any 'if's while updating the statistics. Cloned or partition tables have the pointer set to the base table if status are requested. There is a small performance impact when using --log-slow-verbosity=engine: - All engine calls in 'select' will be timed. - IO calls for InnoDB reads will be timed. - Incrementation of counters are done on local variables and accesses are inline, so these should have very little impact. - Statistics has to be reset for each statement for the THD and each used handler. This is only 40 bytes, which should be neglectable. - For partition tables we have to loop over all partitions to update the handler_status as part of table_init(). Can be optimized in the future to only do this is log-slow-verbosity changes. For this to work we have to update handler_status for all opened partitions and also for all partitions opened in the future. Other things: - Added options 'engine' and 'full' to log-slow-verbosity. - Some of the new files in the test suite comes from Percona server, which has similar status information. - buf_page_optimistic_get(): Do not increment any counter, since we are only validating a pointer, not performing any buf_pool.page_hash lookup. - Added THD argument to save_explain_data_intern(). - Switched arguments for save_explain_.*_data() to have always THD first (generates better code as other functions also have THD first).	2023-07-07 12:53:18 +03:00
Sergei Golubchik	1570c6e3e0	bugfix: join a=b where cast(a as type_of_b) can produce NULL optimizer implicitly assumed that if `a` in `a=b` is not NULL, then it's safe to convert `a` to the type of `b` and search the result in the index(b). which is not always the case, as converting a non-null value to a different type might produce NULL. And searching for NULL in the index might find NULL there, so NULL will be equal to NULL, making `a=b` behave as if it was `a<=>b`	2023-07-05 22:08:29 +02:00
Sergei Golubchik	22e5a5ff6e	generalize ER_QUERY_EXCEEDED_ROWS_EXAMINED_LIMIT make it "query reached <some limit> result may be incomplete"	2023-07-03 15:46:24 +02:00
Oleg Smirnov	d6c6102cad	MDEV-30828 Prevent pushing down unions with incorrect ORDER BY Fake_select_lex->join was prepared at the unit execution stage so the validation of fake_select_lex before the unit pushdown was incomplete. That caused pushing down of statements having an incorrect ORDER BY clause. This commit moves preparation of the fake_select_lex->join to the unit prepare() method, before initializing of the pushdown handler, so incorrect clauses error out before being pushed down	2023-06-30 12:36:47 +07:00
Marko Mäkelä	8290a46d50	Merge 11.0 into 11.1	2023-06-28 09:38:59 +03:00
Marko Mäkelä	71a1a28a49	Merge 10.10 into 10.11	2023-06-27 17:45:06 +03:00
Marko Mäkelä	135e976696	Merge 10.9 into 10.10	2023-06-27 17:43:31 +03:00
Marko Mäkelä	eb6b521f1b	Merge 10.6 into 10.9	2023-06-27 13:48:46 +03:00
Marko Mäkelä	493083833b	Merge 10.5 into 10.6	2023-06-26 17:11:38 +03:00
Monty	3d617fdc7f	MDEV-31375 Assertion `dbl_records <= s->records' failed with optimizer_use_condition_selectivity=1 The reason for the crash wad that 'best splitting' optimization predicted less rows to be found than what opt_range did. This code in apply_selectivity_for_table(), when using use_cond_selectivity=1, was not prepared for this case which caused an assert in debug builds. Production builds is not affected. The fix is to choose the smaller of the two row counts. It will have a minimum on costs when using use_cond_selectivity=1 and should not cause any problems in production.	2023-06-21 15:44:25 +03:00
Sergei Petrunia	f7e9ac0d88	MDEV-31449: Assertion s->table->opt_range_condition_rows <= s->found_records Fix a typo in make_join_statistics(): when updating statistics for derived table, set s->table->... not "table->..."	2023-06-15 11:27:31 +03:00
Marko Mäkelä	3883eb63dc	Merge 11.0 into 11.1	2023-06-08 14:09:21 +03:00
Marko Mäkelä	5fb2c031f7	Merge 10.11 into 11.0	2023-06-08 13:49:48 +03:00
Marko Mäkelä	cb9d97ef38	Merge mariadb-11.0.2 into 11.0	2023-06-08 11:35:36 +03:00
Marko Mäkelä	5d7b957eb0	Merge 10.10 into 10.11	2023-06-08 11:23:08 +03:00
Marko Mäkelä	e704a13b32	Merge 10.9 into 10.10	2023-06-08 11:22:12 +03:00
Marko Mäkelä	223c2c5b9d	Merge 10.6 into 10.9	2023-06-08 10:46:19 +03:00
Oleksandr Byelkin	04f0b955dd	Merge branch '10.6' into 10.6.14	2023-06-07 19:59:52 +02:00
Monty	ded4ed3220	MDEV-30944 Range_rowid_filter::fill() leaves file->keyread at MAX_KEY This test case exposed 2 different bugs: - When replacing a range with an index scan on a covering key in test_if_skip_sort_order() we didn't disable filtering. Filtering does not make much sense in this case. - Fixed by disabling filtering in this case. - Range_rowid_filter::fill() did not take into account that keyread could already active, which caused an assert when it tried to activate another keyread. - Fixed by remembering old keyread state at start and restoring it at end. Other things: - ha_start_keyread() allowed multiple calls. This is wrong, especially as we do no check if the index changed! I added an assert() to ensure that we don't call it there is already an active keyread. - ha_end_keyread() always called ha_extra(), even if keyread was not active. Added a check to avoid the extra call.	2023-06-07 18:44:12 +03:00
Monty	07b02ab40e	MDEV-31356: Range cost calculations does not take into account join_buffer This patch also fixes MDEV-31391 Assertion `((best.records_out) == 0.0 ... failed Cost changes caused by this change: - range queries with join buffer now have a notable smaller cost. - range ranges are bit more expensive as the MULTI_RANGE_COST is now properly applied to it in all cases (this extra cost is equal to a key lookup). - table scan cost is slight smaller as we now assume data is cached in the engine after the first scan pass. (We did this before for range scans and other access methods). - partition tables had wrong values for max_row_blocks and max_index_blocks. Correcting this, causes range access on partitioned tables to have slightly higher cost because of the increased estimated IO. - Using first match + join buffer caused 'filtered' to be calcualted wrong. (Only affected EXPLAIN, not query costs). - Added cost_without_join_buffer to optimizer_trace. - check_quick_select() adjusted the number of rows according to persistent statistics, but did not adjust cost. Now fixed. The big change in the patch are: - In best_access_path(), where we now are using storing the cost in 'ALL_READ_COST cost' and only converting it to a double at the end. This allows us to more exactly calculate the effect of the join_cache. - In JOIN_TAB::estimate_scan_time(), store the cost also in a ALL_READ_COST object. One of effect if this change is that when joining very small tables: t1 some_access_method t2 range t3 ALL Use join buffer This is swiched to t1 some_access_method t3 ALL t2 range use join buffer Both plans has the same cost, but as table scan in this case has less cost than rang, the table scan will be considered first and thus have precidence. Test case changes: - optimizer_trace - Addition of cost_without_join_buffer - subselect_mat_cost_bugs - Small tables and scan versus range - range & range_mrr_icp - Range + join_cache is faster than ref - optimizer_trace - cost_without_join_buffer, smaller scan cost, range setup cost. - mrr - range+join_buffer used as smaller cost	2023-06-07 18:42:58 +03:00
Oleksandr Byelkin	78b1831c9f	Merge branch '10.4' into 10.4.30	2023-06-07 15:08:29 +02:00
Marko Mäkelä	609b4e997a	Merge mariadb-10.5.21 into 10.5	2023-06-07 15:31:55 +03:00
Marko Mäkelä	c04284e747	Merge 10.10 into 10.11	2023-06-07 15:01:43 +03:00
Marko Mäkelä	82230aa423	Merge 10.9 into 10.10	2023-06-07 14:48:37 +03:00
Oleg Smirnov	3118132228	MDEV-25080 Allow pushdown of UNIONs to foreign engines Allow queries of multiple SELECTs combined together with UNIONs/EXCEPTs/INTERSECTs to be pushed down to foreign engines. If the foreign engine provides an interface method "create_unit" and the UNIT is a top-level unit of the SQL query then the server tries to push the whole SELECT_LEX_UNIT down to the engine for execution. The engine should perform necessary checks and if they succeed, execute the query. If the engine is unable to execute the whole unit, then another attempt is made to push down SELECTs composing the unit separately using the "create_select" interface method. In this case the results of separate SELECTs are combined at the server side thus composing the final result	2023-06-05 20:15:57 +02:00
Sergei Golubchik	cbabb95915	Merge branch '11.0' into 11.1	2023-06-05 20:15:15 +02:00
Sergei Golubchik	0005f2f06c	Merge branch 'bb-10.11-release' into bb-11.0-release	2023-06-05 19:27:00 +02:00
Sergei Golubchik	4e2b93dffe	Merge branch 'bb-10.10-release' into bb-10.11-release	2023-06-05 19:04:58 +02:00
Sergei Golubchik	30bba8e275	Merge branch 'github/bb-10.9-release' into bb-10.10-release	2023-06-05 18:59:43 +02:00
Sergei Golubchik	33fd519ca7	Merge branch 'github/bb-10.6-release' into bb-10.9-release	2023-06-05 18:55:26 +02:00
Sergei Golubchik	a42a6fa99b	Merge branch 'bb-10.5-release' into bb-10.6-release	2023-06-05 18:53:02 +02:00
Sergei Golubchik	bed70468ea	Merge branch 'bb-10.4-release' into bb-10.5-release	2023-06-05 17:50:51 +02:00
Sergei Petrunia	928012a27a	MDEV-31403: Server crashes in st_join_table::choose_best_splitting The code in choose_best_splitting() assumed that the join prefix is in join->positions[]. This is not necessarily the case. This function might be called when the join prefix is in join->best_positions[], too. Follow the approach from best_access_path(), which calls this function: pass the current join prefix as an argument, "const POSITION *join_positions" and use that.	2023-06-05 18:24:39 +03:00
Sergei Petrunia	7083e58e2e	Fix UBSAN failure: sql_select.h:982:7: load of value ... not valid for type bool This is 11.0 part of the fix: in 11.0, get_costs_for_tables() calls best_access_path() for all possible tables, for each call it saves a POSITION object with the access method and "loose_scan_pos" POSITION object. The latter is saved even if there is no possible LooseScan plan. Saving is done by copying POSITION objects which may generate a spurious UBSan error.	2023-06-03 18:30:04 +02:00
Igor Babaev	aa713f5ae2	MDEV-31224 Crash with EXPLAIN EXTENDED for multi-table update of system table EXPLAIN EXTENDED should always print the field item used in the left part of an equality expression from the SET clause of an update statement as a reference to table column. Approved by Oleksandr Byelkin <sanja@mariadb.com>	2023-06-03 10:39:34 +02:00
Monty	aac88fc205	MDEV-31237 Assertion `!(tab->select && tab->select->quick)' failed in make_join_readinfo The problem was a wrong assert. I changed it to match the code in best_access_path(). The given test case was a bit tricky for the optimizer, which first decided on using a index scan (because of force index), but then test_if_skip_sort_order() decided to use range anyway to handle distinct.	2023-05-27 17:42:14 +03:00
Monty	661141948f	MDEV-31247 Assertion `c >= 0' failed in COST_MULT upon query with many joins Problem was an overflow when calculating number of join cache refills.	2023-05-27 17:42:14 +03:00
Monty	209fed8eed	MDEV-31258 Assertion `cond_selectivity <= 1.000000001' upon range query This was caused of two minor issues: - get_quick_record_count() returned the number of rows for range with least cost, when it should have returned the minum number of rows for any range. - When changing REF to RANGE, we also changed records_out, which should not be done (number of records in the result will not change). The above change can cause a small change in row estimates where the optimizer chooses a clustered key with more rows than a range one secondary key (unlikely case).	2023-05-27 12:20:31 +03:00
Marko Mäkelä	31be25349f	Merge 10.6 into 10.9	2023-05-25 09:24:32 +03:00
Marko Mäkelä	270eeeb523	Merge 10.5 into 10.6	2023-05-23 12:25:39 +03:00
Monty	92d2ceac73	MDEV-28285 Unexpected result when combining DISTINCT, subselect and LIMIT The problem was that when JOIN_TAB::remove_duplicates() noticed there can only be one possible row in the output, it adjusted limits but didn't take into account any possible offset. Fixed by not adjusting limit offset when setting one-row-limit.	2023-05-23 09:16:36 +03:00
Monty	16258677b3	MDEV-6768 Wrong result with aggregate with join with no result set When a query does implicit grouping and join operation produces an empty result set, a NULL-complemented row combination is generated. However, constant table fields still show non-NULL values. What happens in the is that end_send_group() is called with a const row but without any rows matching the WHERE clause. This last part is shown by 'join->first_record' not being set. This causes item->no_rows_in_result() to be called for all items to reset all sum functions to their initial state. However fields are not set to NULL. The used fix is to produce NULL-complemented records for constant tables as well. Also, reset the constant table's records back in case we're in a subquery which may get re-executed. An alternative fix would have item->no_rows_in_result() also work with Item_field objects. There is some other issues with the code: - join->no_rows_in_result_called is used but never set. - Tables that are used with group functions are not properly marked as maybe_null, which is required if the table rows should be regarded as null-complemented (not existing). - The code that tries to detect if mixed_implicit_grouping should be set didn't take into account all usage of fields and sum functions. - Item_func::restore_to_before_no_rows_in_result() called the wrong function. - join->clear() does not use a table_map argument to clear_tables(), which caused it to ignore constant tables. - unclear_tables() does not correctly restore status to what is was before clear_tables(). Main bug fix was to always use a table_map argument to clear_tables() and always use join->clear() and clear_tables() together with unclear_tables(). Other fixes: - Fixed Item_func::restore_to_before_no_rows_in_result() - Set 'join->no_rows_in_result_called' when no_rows_in_result_set() is called. - Removed not used argument from setup_end_select_func(). - More code comments - Ensure that end_send_group() modifies the same fields as are in the result set. - Changed return_zero_rows() to use pointers instead of references, similar to the rest of the code. Reviewer: Sergei Petrunia <sergey@mariadb.com>	2023-05-22 17:15:46 +03:00
Oleg Smirnov	60f0765b58	MDEV-30143 Segfault on select query using index for group-by and filesort The problem was trying to access JOIN_TAB::select which is set to NULL when using the filesort. The correct way is accessing either JOIN_TAB::select or JOIN_TAB::filesort->select depending on whether the filesort is used. This commit introduces member function JOIN_TAB::get_sql_select() encapsulating that check so the code duplication is eliminated. The new condition (s->table->quick_keys.is_set(best_key->key)) was added to best_access_path() to eliminate a Valgrind error. The cause of that error was using TRASH_ALLOC(quick_key_parts) instead of bzero(quick_key_parts); hence, accessing s->table->quick_key_parts[best_key->key]) without prior checking for quick_keys.is_set() might have caused reading "dirty" memory	2023-05-20 09:53:43 +07:00
Oleksandr Byelkin	de703a2b21	Merge branch '10.4' into 10.4.29 release	2023-05-11 09:07:45 +02:00
Marko Mäkelä	1916bf2a02	Merge 10.10 into 10.11	2023-05-11 10:00:06 +03:00
Marko Mäkelä	616ced88bd	Merge 10.9 into 10.10	2023-05-11 09:59:27 +03:00
Marko Mäkelä	2763f733ee	Merge 10.8 into 10.9	2023-05-11 09:24:59 +03:00
Marko Mäkelä	1f1eaef0af	Merge 10.6 into 10.8	2023-05-11 09:00:27 +03:00

... 3 4 5 6 7 ...

8551 Commits