wsrep-lib

mirror of https://github.com/codership/wsrep-lib.git synced 2025-12-21 01:22:01 +03:00

Author	SHA1	Message	Date
Teemu Ollakka	ff94dfd8a7	Handle the possibility of client command that cannot return results This patch adds the possibility to have client commands that do not return results from DBMS. While processing such commands we must be able to preserve errors until the next interaction with client. Specifically if the transaction is bf aborted while processing such a non-returning command, then we have to keep the deadlock error until the client issues a command that may return the error. To handle such cases, client_state::before_command() now takes parameter keep_command_error. The DBMS is supposed set keep_command_error true to instruct wsrep-lib to preserve errors (if any) until the next command which sets keep_command_error false. Dealing with a case where current client command does not return result. Work in progress. Fix typo and add assertions in keep_command_error() Make keep_command_error a parameter to before_commit() Fix comment about keep_command_error Handle keep_command_error with s_must_abort in wsrep_before_command() Fix unit test	2020-11-27 11:17:39 +01:00
Daniele Sciascia	b12bbd059c	Support for replaying prepared XA transactions This patch implments replaying for prepared XA transactions. Replay may happen in the following cases: 1) The transaction is BF aborted in prepared state and is idle. In that case, the transaction is handed over to rollbacker for replay. 2) The transaction is BF aborted while executing the commit (i.e. before or after successful certification). In which case the transaction replays itself from fragment storage. 3) The transaction is BF aborted while certifying its commit fragment. This case is handled like replay for streaming transactions, where the provider is directly involved and re-delivers the last fragment.	2020-10-26 14:22:22 +01:00
Daniele Sciascia	965642eded	Support for detaching prepared XA transactions Add support for detaching XA transactions. This is useful for handling the case where the DBMS client has a transaction in prepared state and disconnects. Before disconnect, the DBMS calls the newly introduced client_state::xa_detach(), to cleanup the local transaction and convert it to a high priority transaction. The DBMS may later attempt to terminate the transaction through client_state::commit_by_xid() or client_state::rollback_by_xid(). Also in this patch: - Fix client_state::close() so that it does not rollback transactions in prepared state - Changed class wsrep::xid representation to hold enough information so that DBMS can convert to its native representation - Fix potential infinite loop in server_state::find_streaming_applier(wsrep:xid&) - Append SR keys on prepare fragment and make it pa_unsafe - Handle one phase commit (simply fall back to two phase) - Do not rollback prepared streaming clients in server_state::close_orphaned_transactions()	2020-10-26 14:20:21 +01:00
Teemu Ollakka	3e5a28df32	codership/wsrep-lib#135 Fix wrong assertion in before_command(). An assertion `server_state_.rollback_mode() == wsrep::server_state::rm_async` fired in `client_state::before_command()` if a BF abort happened between calls to wait_rollback_complete_and_acquire_ownership() and before_command(). This commit adds a test to reproduce the assertion and verify the correct behavior, as well as removes the incorrect assertion to fix the issue.	2020-07-24 10:46:48 +03:00
Leandro Pacheco	57523eea75	enter_toi polling fix	2019-12-08 12:52:36 +02:00
Teemu Ollakka	7d8583983f	Fixes to review comments - Increased loop sleep in poll_enter_toi() - Fixed typos in comments - Got rid of unnecessary ostringstreams	2019-12-08 12:52:36 +02:00
Teemu Ollakka	3fd20c4e4d	Fixed compilation for gcc 4.4	2019-12-08 12:52:36 +02:00
Leandro Pacheco	043ff7a7e9	remove has_error arg from begin_nbo_phase_two	2019-12-08 12:52:36 +02:00
Leandro Pacheco	3389b7ad3c	better error handling for NBO failures when losing error voting: - if NBO has failed locally (DBMS side), don't override original DBMS error so it gets reported to the client - otherwise, report "query interrupted" instead of "error during commit"	2019-12-08 12:52:36 +02:00
Leandro Pacheco	b63e753aec	removed unnecessary leave_toi and related TODO	2019-12-08 12:52:36 +02:00
Leandro Pacheco	f27f549479	poll_enter_toi timeout handlign	2019-12-08 12:52:36 +02:00
Leandro Pacheco	e0f9550967	handle certification error explicitly when entering TOI	2019-12-08 12:52:36 +02:00
Teemu Ollakka	922ce579c7	Clear NBO meta on failure, reset current error status after command.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	086c466637	- Added wait-until parameter for begin_nbo_phase_two(). - Retry enter_toi() in poll_enter_toi() also for error_connection_failed which means that the connectivity to the cluster has been lost, a.k.a non-prim.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	750052b640	Fixed timeout condition in poll_enter_toi()	2019-12-08 12:52:36 +02:00
Teemu Ollakka	3a1b194741	Pass certification keys also for NBO end. Certification keys are needed for NBO end to resolve dependencies for the write sets which follow NBO end. Without keys the following write sets do not detect dependency to NBO event and may start applying too early.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	58cea10577	Release TOI critical section in poll_enter_toi() in case of error.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	4ff55088b1	Fix NBO error handling - Set both current error and current error status if provider enter_toi() or leave_toi() fails. - Leave NBO mode if TOI cannot be entered in begin_nbo_phase_two().	2019-12-08 12:52:36 +02:00
Teemu Ollakka	e700ce8c79	Added short sleep between calls to enter_toi().	2019-12-08 12:52:36 +02:00
Teemu Ollakka	aaa92e130b	Made gcc 4.4 work.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	b05abb005f	Chrono definitions to work around g++ 4.4 C++11 incompatibilities.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	55fdbb7a05	Added timeout option to enter_toi_local() and begin_nbo_phase_one() If timeout option is give, enter_toi_local() and begin_nbo_phase_one() retry provider::enter_toi() as long as return status indicates certification failure, given timeout expires or the client is interrupted.	2019-12-08 12:52:36 +02:00
Leandro Pacheco	5298d2340e	error parameter to nbo calls and m_undefined for toi_mode toi_mode is set only when actually inside phase one and two. In between it goes back to m_undefined.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	0b12869715	NBO begin error handling, unit test	2019-12-08 12:52:36 +02:00
Teemu Ollakka	e9bd950ee6	Fixed nbo_meta handling, release commit order for NBO begin.	2019-12-08 12:52:36 +02:00
Teemu Ollakka	24ad144db3	- Remove unneeded keys from nbo phase two begin. - Save nbo meta for phase two - Assign trx_meta in mutable_ws_meta	2019-12-08 12:52:36 +02:00
Teemu Ollakka	85a03394cc	NBO applying - High priority interface method to apply NBO begin, separate from apply_toi() in order to avoid implementation to force interpreting ws_meta flags. - Method to put client_state into NBO mode when applying NBO begin. The client_state will process in m_local mode. - Unit tests for applying NBO	2019-12-08 12:52:36 +02:00
Teemu Ollakka	1267e29b8f	Implementation of client_state NBO operations. - Implemented calls to enter and leave NBO phase one and two - Extended client_state mode checking to include m_nbo - Changed client_state state and mode change sanity checks to print a warning and assert() instead of throwing exceptions to be more graceful in release builds.	2019-12-08 12:52:36 +02:00
Leandro Pacheco	a9987aa970	s_prepared state for XA transactions After the XA PREPARE, the XA transactions stay s_prepared until commit/rollback	2019-10-16 10:15:55 +02:00
Teemu Ollakka	0c54cbd3f8	codership/wsrep-lib#106 Relaxed assumptions about threading model Sanity checks to detect concurrency bugs were assuming a threading model where each client state would always be processed within single thread of execution. This however may be too strong assumption if the application uses some kind of thread pooling. This patch relaxes those assumptions by removing current_thread_id_ from client_state and relaxing assertions against owning_thread_id_. This patch also adds a new method wait_rollback_complete_and_acquire_ownership() into client_state. This method is idempotent and can be used to gain control to client_state before before_command() is called. The method will wait until possible background rollback process is over and marks the state to s_exec to protect the state against new background rollbacks. Other fixes/improvements: - High priority globals state is restored after discarding streaming. - Allowed server_state transition donor -> synced. - Client state method store_globals() was renamed to acquire_ownership() to better describe the intent. Method store_globals() was left for backwards compatibility and marked deprecated.	2019-08-05 15:12:44 +03:00
Alexey Yurchenko	0f676bd893	codership/wsrep-lib#104 Error voting support - populate and pass real error description buffer to provider in case of applying error - return 0 from server_state::on_apply() if error voting confirmed consistency - remove fragments and rollback after fragment applying failure - always release streaming applier on commit or rollback	2019-07-15 03:48:55 +03:00
Teemu Ollakka	fd66bdef0b	codership/wsrep-lib#107 Replace exceptions with assertions Replaced exceptions thrown on debug level sanity checks with assertions to be more graceful with release builds.	2019-07-12 16:15:13 +03:00
Teemu Ollakka	0b09871ad5	Reset client state gtid state in client_state::open() If the application uses caching for client sessions, the client_state object may be reused. This will cause the opened client session to have unexpected value for sync_wait_gtid and last_written_gtid. In order to work around the problem, reset sync_wait_gtid and last_written_gtid in client_state::open().	2019-02-18 15:57:16 +02:00
mkaruza	be98517cb3	Debug log level implementation Debug log will now filter output based on debug level that is enabled.	2019-02-13 13:05:45 +02:00
Teemu Ollakka	20b52ff1dd	Allow direct manipulation of streaming context parameters. Added a method to change streaming context fragment unit and size. The method has a side effect of resetting unit counter.	2019-02-11 16:50:08 +02:00
Daniele Sciascia	a9e2fdccfc	Disable streaming on client_state::close()	2019-01-03 12:18:34 +01:00
Daniele Sciascia	cb93aaa77b	Lost e_error_during_commit if fragment size exceeds maximum size If the size of a SR fragment exceeds the maximum size that the replication provider allows us to replicate, then we are expected to set the client error code to e_error_during_commit. However, client_state::after_statement() unconditionally overrides it to error e_deadlock_error. Fixes client_state::after_statement() so that it overrided the error only if noerror has been set yet.	2018-12-05 16:47:56 +01:00
sjaakola	cfcf34e70f	codership/wsrep-lib#23 before_command() wait for ongoing rollbacks leaks Storing information that background rollbacker in ongoing in client state has_rollback_ This can be used for detecting if there is ongoing background rollback, and client should keep waiting in before_command() entry to avoid conflicts in accessing client state during background rollbacking. transaction::bf_abort() is modified to set has_rollback_ flag when backgroung rollbacking has been assigned for the client sync_rollback_complete() method has been modified to reset the backround rollbacker flag	2018-11-27 12:26:14 +02:00
Teemu Ollakka	7c6ee3f61f	In order to avoid potential deadlocks, release client_state lock when calling server state methods which may acquire server_state mutex. Fixed compilation errors in release mode.	2018-10-15 16:35:19 +03:00
Teemu Ollakka	c0c977f9ab	Added GPLv2 licence and copyright headers.	2018-10-15 15:14:22 +03:00
Teemu Ollakka	0410deee3a	Fixes to streaming rollback. * Check error code from fragment release * Always call streaming rollback from must abort if the transaction is in executing phase. This is needed to ensure that rollback fragment replication happens before rollback starts * Initiate streaming rollback from certify fragment if BF abort happens after fragment certification.	2018-07-19 01:11:02 +03:00
Teemu Ollakka	b02200b1ef	Fixes to streaming rollback * Check fragment removal error code in prepare phase. It is possible that the transaction gets BF aborted during fragment removal. * Mark fragment certified in certify_fragment() even if the provider returns cert failed error. With current wsrep-API error codes it may not be possible to distinquish certification failure and BF abort during fragment replication. This may also be a provider bug. As a result rollback fragment may sometimes be replicated when it would not be necessary.	2018-07-17 14:34:24 +03:00
Teemu Ollakka	9f153be277	Fixes to streaming rollback processing * Count separately fragments certified and fragments stored in streaming context. Storing the fragment may ultimately fail due to BF abort even if the fragment was succesfully certified. Therefore we need to have separate counter for certified fragments to determine if the transaction is streaming and seqnos of fragments which have been succesfully stored. * Provider release is called only after succesful fragment certification and fragment store. * Fixed handling of write sets with rollback flag set in apply_write_set()	2018-07-16 10:07:46 +03:00
Teemu Ollakka	3b9e9e0d0c	SR Rollback handling fixes * Handle BF rollback also in after_statement() call. * Added missing after_apply() call when handling rollback fragment. * Fixed state changes when rollback is starated during preparing state.	2018-07-11 11:39:55 +03:00
Teemu Ollakka	80ca03daaf	Implemented SR transaction rollback.	2018-07-10 14:01:41 +03:00
Teemu Ollakka	95dbab4c08	Made transaction streaming context private and provided accessor method.	2018-07-09 08:49:29 +03:00
Teemu Ollakka	2913aecebd	Pass transaction id instead of client id to storage service append_fragment()	2018-07-07 21:34:58 +03:00
Teemu Ollakka	a8be09161c	Replaced replicating mode with local. The intended purpose for local mode was local storage access without entering replication hooks. However, the same can be achieved with high priority mode. Removed replicating mode and use local instead to denote locally processing clients.	2018-07-07 12:01:14 +03:00
Teemu Ollakka	af18a10a49	Removed is_autocommi() from client_service interface as it is not quite useful as there might not be enough information for it after the statement has been processed. Better to handle retrying on DBMS side. Also removed after_statement_result enumeration and return plain int from after_statement().	2018-07-06 19:48:48 +03:00
Teemu Ollakka	e876418ed3	* Renamed client service rollback() to bf_rollback() to better describe its purpose. * Raise deadlock error for BF aborted transaction in after_statement() call if the error is not set yet.	2018-07-06 15:42:03 +03:00

1 2

65 Commits