mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-08-08 11:22:35 +03:00

Author	SHA1	Message	Date
Marko Mäkelä	ce6616aa28	Merge 10.9 into 10.10	2023-04-26 18:31:03 +03:00
Marko Mäkelä	818d5e4814	Merge 10.5 into 10.6	2023-04-25 13:10:33 +03:00
Oleksandr Byelkin	1d74927c58	Merge branch '10.4' into 10.5	2023-04-24 12:43:47 +02:00
Igor Babaev	6dc6c22c14	MDEV-31085 Crash when processing multi-update using view with optimizer_trace on This bug caused server crash when processing a multi-update statement that used views if optimizer tracing was enabled. The bug was introduced in the patch for MDEV-30539 that could incorrectly detect the most top level selects of queries if views were used in them. Approved by Oleksandr Byelkin <sanja@mariadb.com>	2023-04-22 12:32:38 -07:00
Sergei Petrunia	e4fbec1463	Make tests work with --view-protocol	2023-02-03 14:33:18 +03:00
Rex	07f21cfb14	MDEV-21092,MDEV-21095,MDEV-29997: Optimizer Trace for index condition pushdown, partition pruning, exists-to-in Add Optimizer Tracing for: - Index Condition Pushdown - Partition Pruning - Exists-to-IN optimization	2023-02-03 14:28:08 +03:00
Monty	727491b72a	Added test cases for preceding test This includes all test changes from "Changing all cost calculation to be given in milliseconds" and forwards. Some of the things that caused changes in the result files: - As part of fixing tests, I added 'echo' to some comments to be able to easier find out where things where wrong. - MATERIALIZED has now a higher cost compared to X than before. Because of this some MATERIALIZED types have changed to DEPENDEND SUBQUERY. - Some test cases that required MATERIALIZED to repeat a bug was changed by adding more rows to force MATERIALIZED to happen. - 'Filtered' in SHOW EXPLAIN has in many case changed from 100.00 to something smaller. This is because now filtered also takes into account the smallest possible ref access and filters, even if they where not used. Another reason for 'Filtered' being smaller is that we now also take into account implicit filtering done for subqueries using FIRSTMATCH. (main.subselect_no_exists_to_in) This is caluculated in best_access_path() and stored in records_out. - Table orders has changed because more accurate costs. - 'index' and 'ALL' for small tables has changed to use 'range' or 'ref' because of optimizer_scan_setup_cost. - index can be changed to 'range' as 'range' optimizer assumes we don't have to read the blocks from disk that range optimizer has already read. This can be confusing in the case where there is no obvious where clause but instead there is a hidden 'key_column > NULL' added by the optimizer. (main.subselect_no_exists_to_in) - Scan on primary clustered key does not report 'Using Index' anymore (It's a table scan, not an index scan). - For derived tables, the number of rows is now 100 instead of 2, which can be seen in EXPLAIN. - More tests have "Using index for group by" as the cost of this optimization is now more correct (lower). - A primary key could be preferred for a normal key, even if it would access more rows, as it's faster to do 1 lokoup and 3 'index_next' on a clustered primary key than one lookup trough a secondary. (main.stat_tables_innodb) Notes: - There was a 4.7% more calls to best_extension_by_limited_search() in the main.greedy_optimizer test. However examining the test results it looked that the plans where slightly better (eq_ref where more chained together) so I assume this is ok. - I have verified a few test cases where there was notable/unexpected changes in the plan and in all cases the new optimizer plans where faster. (main.greedy_optimizer and some others)	2023-02-03 00:00:35 +03:00
Monty	013ba37ae2	Fix cost calculation in test_if_cheaper_ordering() to be cost based The original code was mostly rule based and preferred clustered or covering indexed independent of cost. There where a few test changes: - Some test changed from using filesort to index or table scan. This happened when most of the rows had to be sorted and the ORDER BY could use covering or a clustered index (innodb_mysql, create_spatial_index). - Some test changed range to filesort. This where mainly because the range was scanning most of the rows or using index scan + row lookup and filesort with table scan is cheaper. (order_by). - Change in join_cache was because sorting 2 rows is faster than retrieving 10 rows. - In selectivity_innodb.test one test changed to use a cheaper index.	2023-02-02 23:08:23 +03:00
Monty	2387ee9b45	Added 'records_out' and join_type to POSITION records_out is the numbers of rows expected to be accepted from a table. records_read is in contrast the number of rows that the optimizer excepts to read from the engine. This patch causes not plan changes. The differences in test results comes from renaming "records" to "records_read" and printing of record_out in the optimizer trace. Other things: - Renamed table_cond_selectivity() to table_after_join_selectivity() to make the purpose of the function more clear.	2023-02-02 22:25:24 +03:00
Lena Startseva	f9bf41632e	Merge branch 'bb-10.9-all-builders' into bb-10.10-all-builders	2022-09-28 09:40:17 +07:00
Lena Startseva	a9962580ab	MDEV-27691: make working view-protocol Update tests for version 10.6	2022-09-27 13:18:28 +07:00
Lena Startseva	f8f25b472e	Merge branch 'bb-10.5-all-builders' into bb-10.6-all-builders	2022-09-27 13:17:59 +07:00
Lena Startseva	2abf499c76	MDEV-27691: make working view-protocol Update tests for version 10.5	2022-09-26 10:25:41 +07:00
Lena Startseva	d444536e1d	Merge branch 'bb-10.4-all-builders' into bb-10.5-all-builders	2022-09-26 10:24:59 +07:00
Lena Startseva	184e65954b	MDEV-27691: make working view-protocol Update tests for version 10.4	2022-09-23 19:47:30 +07:00
Monty	515b9ad05a	Added EQ_REF chaining to the greedy_optimizer MDEV-28073 Slow query performance in MariaDB when using many table The idea is to prefer and chain EQ_REF tables (tables that uses an unique key to find a row) when searching for the best table combination. This significantly reduces row combinations that has to be examined. This is optimization is enabled when setting optimizer_prune_level=2 (which is now default). Implementation: - optimizer_prune_level has a new level, 2, which enables EQ_REF optimization in addition to the pruning done by level 1. Level 2 is now default. - Added JOIN::eq_ref_tables that contains bits of tables that could use potentially use EQ_REF access in the query. This is calculated in sort_and_filter_keyuse() Under optimizer_prune_level=2: - When the greedy_optimizer notices that the preceding table was an EQ_REF table, it tries to add an EQ_REF table next. If an EQ_REF table exists, only this one will be considered at this level. We also collect all EQ_REF tables chained by the next levels and these are ignored on the starting level as we have already examined these. If no EQ_REF table exists, we continue as normal. This optimization speeds up the greedy_optimizer combination test with ~25% Other things: - I ported the changes in MySQL 5.7 to greedy_optimizer.test to MariaDB to be able to ensure we can handle all cases that MySQL can do. - I have run all tests with --mysqld=--optimizer_prune_level=1 to verify that there where no test changes.	2022-07-26 22:27:29 +07:00
Monty	b3c74bdc1f	Improve pruning in greedy_search by sorting tables during search MDEV-28073 Slow query performance in MariaDB when using many tables The faster we can find a good query plan, the more options we have for finding and pruning (ignoring) bad plans. This patch adds sorting of plans to best_extension_by_limited_search(). The plans, from best_access_path() are sorted according to the numbers of found rows. This allows us to faster find 'good tables' and we are thus able to eliminate 'bad plans' faster. One side effect of this patch is that if two tables have equal cost, the table that which was used earlier in the query is preferred. This allows users to improve plans by reordering eq_ref tables in the order they would like them to be uses. Result changes caused by the patch: - Traces are different as now we print the cost for using tables before we start considering them in the plan. - Table order are changed for some plans. In most cases this is because the plans are equal and tables are in this case sorted according to their usage in the original query. - A few plans was changed as the optimizer was able to find a better plan (that was pruned by the original code). Other things: - Added a new statistic variable: "optimizer_join_prefixes_check_calls", which counts number of calls to best_extension_by_limited_search(). This can be used to check the prune efficiency in greedy_search(). - Added variable "JOIN_TAB::embedded_dependent" to be able to handle XX IN (SELECT..) in the greedy_optimizer. The idea is that we should prune a table if any of the tables in embedded_dependent is not yet read. - When using many tables in a query, there will be some additional memory usage as we need to pre-allocate table of table_counttable_countsizeof(POSITION) objects (POSITION is 312 bytes for now) to hold the pre-calculated best_access_path() information. This memory usage is offset by the expected performance improvement when using many tables in a query. - Removed the code from an earlier patch to keep the table order in join->best_ref in the original order. This is not needed anymore as we are now sorting the tables for each best_extension_by_limited_search() call.	2022-07-26 22:27:28 +07:00
Marko Mäkelä	cd751f0259	Work around MDEV-27421 ./mtr --ps-protocol main.opt_trace	2022-01-04 15:53:02 +02:00
Marko Mäkelä	3f5726768f	Merge 10.5 into 10.6	2022-01-04 09:26:38 +02:00
Julius Goryavsky	55bb933a88	Merge branch 10.4 into 10.5	2021-12-26 12:51:04 +01:00
Sergei Petrunia	397f5cf71e	MDEV-27238: Assertion `got_name == named_item_expected()' failed in Json_writer make_join_select() calls const_cond->val_int(). There are edge cases where const_cond may have a not-yet optimized subquery. (The subquery will have used_tables() covered by join->const_tables. It will still have const_item()==false, so other parts of the optimizer will not try to evaluate it. We should probably mark such subqueries as constant but that is outside the scope of this MDEV)	2021-12-23 14:08:43 +03:00
Sergei Petrunia	32692140e1	MDEV-27306: SET STATEMENT optimizer_trace=1 Doesn't save the trace In mysql_execute_command(), move optimizer trace initialization to be after run_set_statement_if_requested() call. Unfortunately, mysql_execute_command() code uses "goto error" a lot, and this means optimizer trace code cannot use RAII objects. Work this around by: - Make Opt_trace_start a non-RAII object, add init() method. - Move the code that writes the top-level object and array into Opt_trace_start::init().	2021-12-19 17:19:02 +03:00
Monty	607b14c4dc	Add --optimizer_trace option to mysqltest This enables optimizer_trace output for the next SQL command. Identical as if one would have done: - Store value of @@optimizer_trace - Set @optimizer_trace="enabled=on" - Run query - SELECT * from OPTIMIZER_TRACE - Restore value of @@optimizer_trace This is a great time saver when one wants to quickly check the optimizer trace for a query in a mtr test.	2021-12-15 19:11:25 +02:00
Sergei Petrunia	5f22e83a29	Make the Optimizer Trace of reqular query and PS EXECUTE be identical Print this piece when we've just made the choice to convert to semi-join. Also, print it when we've already made that choice before: transformation": { "select_id": 2, "from": "IN (SELECT)", "to": "semijoin", "chosen": true }	2021-11-29 16:25:27 +03:00
Marko Mäkelä	25ac047baf	Merge 10.5 into 10.6	2021-11-09 09:11:50 +02:00
Marko Mäkelä	9c18b96603	Merge 10.4 into 10.5	2021-11-09 08:50:33 +02:00
Sergei Krivonos	fcca0c67b6	MDEV-26929: fixed opt_trace test for --mysqld=--optimizer_trace=enabled=on	2021-10-28 18:41:05 +03:00
Marko Mäkelä	d4a89b9262	Merge 10.5 into 10.6	2021-10-27 10:06:02 +03:00
Marko Mäkelä	44f9736e0b	Merge 10.4 into 10.5	2021-10-27 09:48:22 +03:00
Alexander Barkov	05a0eae335	MDEV-22380 Assertion `name.length == strlen(name.str)' failed .. w/optimizer_trace enabled Adding 10.4 specific tests.	2021-10-27 07:21:34 +04:00
Dmitry Shulga	461cac8901	MDEV-26150: The test main.opt_trace fails in case it is run in PS mode In case the test main.opt_trace is run with the option --ps-protocol it fails since querying from the table INFORMATION_SCHEMA.OPTIMIZER_TRACE produces an output that differed from the expected one in the following way: @@ -2829,14 +2829,6 @@ } }, { - "transformation": { - "select_id": 2, - "from": "IN (SELECT)", - "to": "semijoin", - "chosen": true - } - }, - { "expanded_query": "/* select#2 / select t10.pk from t10" } The table INFORMATION_SCHEMA.OPTIMIZER_TRACE is filled when optimizer_trace is on. The reason of missing above mentioned pieces in query result set is that the C++ macros OPT_TRACE_TRANSFORM(thd, trace_wrapper, trace_transform, select_lex->select_number, "IN (SELECT)", "semijoin"); located in the standalone function check_and_do_in_subquery_rewrites() is executed twice in case the statement explain extended select from t1 where a in (select pk from t10); is run in PS mode. The first time it is executed on PREPARE phase and the second time on EXECUTE phase. The output produced by this macros on EXECUTE phase rewrites the output produced on PREPARE phase. In result test failed in case it was run in PS mode. To make test output uniform regardless the test is run in PS or normal mode the operator '--source include/protocol.inc' has been added to the file opt_trace.test and extra opt_trace,ps.rdiff file has been added. Additionally, added operators --enable_prepared_warnings/--disable_prepared_warnings in order to store warnings in result file that received on PREPARE phase during running the statemement 'SELECT INTO'.	2021-07-16 09:30:36 +07:00
Dmitry Shulga	510662e81b	MDEV-16708: more fixes to test cases	2021-06-17 19:30:24 +02:00
Alexey Botchkov	e9fd327ee3	MDEV-17399 Add support for JSON_TABLE. The specific table handler for the table functions was introduced, and used to implement JSON_TABLE.	2021-04-21 10:21:43 +04:00
Sergei Petrunia	bd43f39bd5	MDEV-24325: Optimizer trace doesn't cover LATERAL DERIVED Provide basic coverage in the Optimizer Trace	2021-03-29 12:54:06 +03:00
Sergei Petrunia	b3c470a3c7	MDEV-23646: Optimizer trace: optimize_cond() should show ON expression processing Print the build_equal_items() step for ON expression processing	2021-03-19 18:12:26 +03:00
Sergei Petrunia	b9a45ba40f	MDEV-23645: Optimizer trace: print conditions after substitute_for_best_equal_field Print the conditions for WHERE, HAVING, and ON.	2021-03-19 17:37:38 +03:00
Sergei Petrunia	2b3fd5dff0	MDEV-23677: Optimizer trace: remove "no predicate for first keypart" (not) Don't remove (reasons given in Jira), instead add test coverage. Improve other printout in best_access_path.	2021-03-18 21:04:33 +03:00
Marko Mäkelä	a4b7232b2c	Merge 10.4 into 10.5	2021-03-11 20:09:34 +02:00
Sergei Golubchik	01a0d739c8	MDEV-24975 Server consumes extra 4G memory upon querying INFORMATION_SCHEMA.OPTIIMIZER_TRACE if a query used no fields from an I_S table, we were creating a temp table with one, first, field (as a table cannot have zero fields), with its length truncated to 1. Now - force also this dummy field to be a normal field, not a BLOB	2021-03-08 15:00:45 +01:00
Sergei Petrunia	29a6d23622	MDEV-23767: IN-to-subquery conversion is not visible in optimizer trace Add the printout	2020-09-20 00:07:37 +03:00
Marko Mäkelä	1813d92d0c	Merge 10.4 into 10.5	2020-07-02 09:41:44 +03:00
Varun Gupta	cc0dca3663	MDEV-22910: SIGSEGV in Opt_trace_context::is_started & SIGSEGV in Json_writer::add_table_name (on optimized builds) Make sure to initialize members of TABLE::reginfo when TABLE::init is called. In this case the problem was that table->reginfo.join_tab was set for the SELECT query and then was reused by the UPDATE query. This case occurred only when the SELECT query had a degenerate join.	2020-06-30 18:29:02 +05:30
Varun Gupta	4c3cbe2392	MDEV-22665: Print ranges in the optimizer trace created for non-indexed columns when optimizer_use_condition_selectivity >2 Now the optimizer trace shows the ranges constructed while getting estimates from EITS	2020-06-18 20:15:06 +05:30
Sergei Petrunia	517e9334f2	MDEV-22891: Optimizer trace: const tables are not clearly visible Make mark_join_nest_as_const() print its action into the trace.	2020-06-15 13:00:43 +03:00
Marko Mäkelä	6877ef9a7c	Merge 10.4 into 10.5	2020-06-05 20:36:43 +03:00
Varun Gupta	6404645980	MDEV-21626: Optimizer misses the details about the picked join order Added cost of sorting estimate to the optimizer trace	2020-06-04 20:03:22 +05:30
Marko Mäkelä	4337a3b5f9	Merge 10.4 into 10.5	2020-05-04 18:43:00 +03:00
Marko Mäkelä	50f3a38e89	Add an end marker to a test	2020-05-04 18:31:30 +03:00
Sergei Petrunia	7bc6735736	MDEV-22401: Optimizer trace: multi-component range is not printed correctly KEY_MULTI_RANGE::range_flag does not have correct flag bits for per-endpoint flags (NEAR_MIN, NEAR_MAX, NO_MIN_RANGE, NO_MAX_RANGE). It only has bits for flags that describe both endpoints. So - Document this. - Switch optimizer trace to using {start\|end}_key.flag values, instead. This fixes the bug. - Switch records_in_column_ranges() to doing that too. (This used to work, because KEY_MULTI_RANGE::range_flag had correct flag value for the last key component, and EITS only uses one-component pseudo-indexes)	2020-04-29 16:31:16 +03:00
Monty	27d9986c1b	Added more digits to JSON output of double sprintf() format of double changed from '%lg' to '%-.11lg' The change was to make it easier to read optimizer trace output with tables that has millions of records.	2020-04-19 17:33:52 +03:00

1 2 3

110 Commits