wsrep-lib

mirror of https://github.com/codership/wsrep-lib.git synced 2025-10-18 00:31:19 +03:00

Author	SHA1	Message	Date
Teemu Ollakka	89b3561ad8	Read recovered position from sst_received() after initialization In general the position where the storage recovers after a SST cannot be known untile the recovery process is over. This in turn means that the position cannot be known when the server_state sst_received() method is called. Worked around the problem by introducing get_position() method into server service which can be used to get the position from stable storage after SST has completed and the state has been recovered.	2019-01-15 12:35:06 +02:00
Daniele Sciascia	1e9325197a	Turn "Could not find applier context" into debug message Remove warning "Could not find applier context" when no applier context is found while applying rollback fragment.	2019-01-15 10:57:39 +01:00
Teemu Ollakka	653d2526eb	codership/wsrep-lib#34 Changed flow of control in sst_received() Instead of handling error case at the beginning, execute the middle of method body in case of success, leaving only single call to provider().sst_received() at the end.	2018-12-21 15:11:41 +02:00
Teemu Ollakka	4f88e9aea6	codership/wsrep-lib#34 Fixed non-debug compilation error	2018-12-20 19:35:31 +02:00
Teemu Ollakka	e9bb552096	codership/wsrep-lib#34 Provided a method to interrupt state waiters Intruduced server_state::interrupt_state_waiters() to interrupt all waiters inside server_state::wait_until_state(). This mechanism is needed when an error is encountered during state change processing and waiting threads may need to be interrupted to check and handle the error condition. Made server_state::wait_until_state() to throw exception if the wait was interrupted and the new server state is either disconnecting or disconnected, which usually indicates error condition.	2018-12-20 19:35:31 +02:00
Teemu Ollakka	ac5a4cde0d	codership/wsrep-lib#34 Fixed init first IST processing Init first join crashed in server: s1 unallowed state transition: joined -> joined This was due to missing state check for state in on_primary_view() before changing to joined state. Added appropriate check. Implemented unit tests for simple IST scenarios.	2018-12-20 19:35:31 +02:00
Teemu Ollakka	728eaa80b5	Added test cases for donor transitions. Replaced all references to provider_ in server_state methods to provider() call which is virtual and can be overridden by test classes. Provider pointer may not be initialized during unit tests yet.	2018-12-20 19:35:31 +02:00
Teemu Ollakka	7cd0656990	Allow server_state joiner - disconnecting transition. Transition joiner - disconnecting may happen when the joiner failed to receive SST succesfully. Because the system is at undefined state at this point, skip most of the processing in sst_received() and return control to caller after notifying the provider about failure.	2018-12-20 19:35:31 +02:00
Teemu Ollakka	e81c66cd59	Fixed assertion on server_state connected - disconnecting transition Transition from server_state connected state to disconnecting must be allowed to deal with errors during server startup. Added SST first test cases for server_state transitions: * Successful join via SST * Error in connect state * Error in joiner state	2018-12-20 19:35:31 +02:00
Teemu Ollakka	1776537765	codership/wsrep-lib#34 Handle sync-disconnected-sync for init first Handle init first case where state is cycled from synced to disconnected and back synced without SST.	2018-12-20 19:35:31 +02:00
Teemu Ollakka	ae0109f9b3	codership/wsrep-lib#34 Refactored view handling Extracted on_primary_view(), on_non_primary_view() out of on_view().	2018-12-20 19:35:31 +02:00
Teemu Ollakka	76424ad515	codership/wsrep-lib#34 Unit test for sync-disconnect-sync Added unit test for sync-disconnect-sync transition without SST.	2018-12-20 19:35:31 +02:00
Teemu Ollakka	256cd6ae60	codership/wsrep-lib#32 Allow transient desync errors in desync_and_pause() Provider desync may return an error if the provider cannot communicate with rest of the cluster. However, this is acceptable for example if the node has dropped from primary view. Instead of returning error immediately after failed desync(), attempt to pause the provider regardless of the error. If pause operation fails, error is returned. In order to avoid resync in resume_and_resync() in the case desync failed in desync_and_pause(), new member variable desynced_on_pause_ was introduced to decide whether to resync or not in resume_and_resync(). This variable is protected by pause()/resume() calls since they do not allow concurrent pause/resume operations.	2018-12-13 13:04:41 +02:00
Alexey Yurchenko	21781f6644	Improved SST diagnostic logging	2018-12-06 23:26:19 +02:00
Alexey Yurchenko	0f77323d0e	Refs codership/wsrep-lib#18 Don't recover view from state if SST failed. It is pointless and most likely will result in an unnecessary error message logged.	2018-12-06 23:26:19 +02:00
Daniele Sciascia	8f490d431a	Undefined server id in convert_streaming_client_to_applier() Method wsrep::server_state::convert_streaming_client_to_applier() may insert an entry in streaming_appliers_ map which contains undefined server_id. This happens if the method is called while in non-primary state, and server_state::id_ is undefined. The fix is to use the server_id which is recorded in client's tansaction object.	2018-12-03 22:36:24 +01:00
Daniele Sciascia	31f09ca4aa	Refs codership/wsrep-lib#18 Fix compile warning Fixes unused parameter `view` in server_state::go_final() when compiled without assert()s.	2018-11-24 13:42:53 +01:00
Alexey Yurchenko	3950ea3027	Refs codership/wsrep-lib#18 Small fixups - fixed node ID assertion in on_connect() method, fixed "sanity checks" to allow reconnection to primary component - fixed code duplication in on_view() method	2018-11-23 23:27:09 +02:00
Alexey Yurchenko	d95ec7ed99	Refs codership/wsrep-lib#18 : Fixup to proper view from SST processing. Added a call to log_view() to do the internal initializations that need to be done on receiveing a new view. Note however that it is not a view event. Here we only need to configure the application to comply with a new state that it has received, so that it can go on to apply replication events and catch up with the cluster.	2018-11-23 13:11:20 +02:00
Alexey Yurchenko	fb14883547	Recover current view from state after SST. When member joins the group and needs to receive an SST it won't receive the corresponding menbership view event because the SST happens after the event and will already include the effects of all events ordered before it. The view then must be recovered from the received state. Minor renames and cleanups. References codership/wsrep-lib#18	2018-11-12 12:47:42 +02:00
Alexey Yurchenko	ea9971d54b	- Initialize member cluster ID only on connection to cluster and forget it on disconnect. - Don't rely on own index from the view because the view may come from another member (IST/SST), instead always determine own index from own ID. Refs codership/wsrep-lib#13	2018-11-09 00:42:05 +02:00
Teemu Ollakka	7c6ee3f61f	In order to avoid potential deadlocks, release client_state lock when calling server state methods which may acquire server_state mutex. Fixed compilation errors in release mode.	2018-10-15 16:35:19 +03:00
Teemu Ollakka	c0c977f9ab	Added GPLv2 licence and copyright headers.	2018-10-15 15:14:22 +03:00
Alexey Yurchenko	31f244c3b3	Fixed compilation on Ubuntu 18.04 / GCC 7.3.0	2018-10-02 21:41:14 +03:00
Teemu Ollakka	5bf8ad1294	Close SR transactions when disconnecting from the group. Moved SR fragment removal for total order BFd SR transactions into after_rollback() call to avoid deadlocking while trying to access storage before rolling back the transaction.	2018-07-19 15:13:27 +03:00
Teemu Ollakka	ca5c24655f	Fixes to SR transaction processing * Release server lock temporarily when BF aborting local SR transaction during view event processing * Check transaction state for BF aborts in before_prepare() after the lock has acquired after fragment removal * Send rollback fragment only from streaming_rollback()	2018-07-17 15:23:53 +03:00
Teemu Ollakka	b02200b1ef	Fixes to streaming rollback * Check fragment removal error code in prepare phase. It is possible that the transaction gets BF aborted during fragment removal. * Mark fragment certified in certify_fragment() even if the provider returns cert failed error. With current wsrep-API error codes it may not be possible to distinquish certification failure and BF abort during fragment replication. This may also be a provider bug. As a result rollback fragment may sometimes be replicated when it would not be necessary.	2018-07-17 14:34:24 +03:00
Teemu Ollakka	0efec1b8bd	Added debug_crash() method to high priority service interface.	2018-07-16 12:45:53 +03:00
Teemu Ollakka	4418627f1b	Fixed a condition to remove SR appliers during view change.	2018-07-16 11:15:24 +03:00
Teemu Ollakka	9f153be277	Fixes to streaming rollback processing * Count separately fragments certified and fragments stored in streaming context. Storing the fragment may ultimately fail due to BF abort even if the fragment was succesfully certified. Therefore we need to have separate counter for certified fragments to determine if the transaction is streaming and seqnos of fragments which have been succesfully stored. * Provider release is called only after succesful fragment certification and fragment store. * Fixed handling of write sets with rollback flag set in apply_write_set()	2018-07-16 10:07:46 +03:00
Teemu Ollakka	21ae2c849e	Pass pointer to high priority service as a parameter for log_view() The pointer will pass applier context to log_view(), where it can be used for stable storage access.	2018-07-15 19:00:10 +03:00
Teemu Ollakka	86472ee420	Implemented SR transaction rollbacking during configuration changes. SR tranasctions are BF aborted or rolled back on primary view changes according to the following rules: * Ongoing local SR transactions are BF aborted if the processing server is not found from the current view. * All remote SR transactions whose origin server is not included in the current view are rolled back.	2018-07-14 16:11:13 +03:00
Teemu Ollakka	22d7a31d81	Fixes to SR rollback: * Enable codepath to BF abort high priority SR applier * Pass ws_handle, ws_meta to high priority service rollback call to allow total ordering of rollback process	2018-07-12 18:00:52 +03:00
Teemu Ollakka	3f4e5dea3b	Revised logic to handle SR replaying * Added server_id into transaction in order to be able to stop streaming applier during high priority BF abort * Added missing commit fragment applying * Don't clear fragments for replaying SR transaction	2018-07-12 13:36:45 +03:00
Teemu Ollakka	3b9e9e0d0c	SR Rollback handling fixes * Handle BF rollback also in after_statement() call. * Added missing after_apply() call when handling rollback fragment. * Fixed state changes when rollback is starated during preparing state.	2018-07-11 11:39:55 +03:00
Teemu Ollakka	80ca03daaf	Implemented SR transaction rollback.	2018-07-10 14:01:41 +03:00
Teemu Ollakka	6f68c70d37	Interface changes required to store and remove fragments from high priority context.	2018-07-09 18:12:48 +03:00
Teemu Ollakka	6aa6b6f50a	Provider write set handle and meta data for high priority commit The write set handle and meta data are needed for SR transactions where the commit context is not known when the transaction starts. The passed handle and meta data can be set through client_state prepare_for_ordering() call before performing commit.	2018-07-09 13:02:13 +03:00
Teemu Ollakka	8a1e76bcec	Execution context switching for high priority service.	2018-07-09 11:35:20 +03:00
Teemu Ollakka	7c424d8337	Fixes to local streaming replication processing.	2018-07-08 15:27:49 +03:00
Teemu Ollakka	2ac13100f7	Refactored storage service out of client service interface.	2018-07-07 18:06:37 +03:00
Teemu Ollakka	d80a69fe90	Defined log_state_change() interface in server_service. The interface method can be used to notify the DBMS implementation about state changes in well defined order. The call will be done under server_state mutex protection.	2018-07-05 12:45:22 +03:00
Teemu Ollakka	fcefe9f03b	Provide additional provider error status. Fixed IST handling.	2018-07-05 11:31:47 +03:00
Teemu Ollakka	b3de50fa05	Implemented convenience methods to desync/pause, resume/resync.	2018-07-04 18:12:42 +03:00
Teemu Ollakka	622b6583a2	Ignore TOI applying errors for now.	2018-07-03 21:29:35 +03:00
Teemu Ollakka	c552d944ed	Deprecated sst_transferred(), always use sst_received()	2018-07-03 10:20:36 +03:00
Teemu Ollakka	e6d78c380d	Pass ws_meta to high priority service apply_toi	2018-07-03 08:58:25 +03:00
Teemu Ollakka	004244d203	Fixed dbsim SST issues.	2018-07-02 19:17:38 +03:00
Teemu Ollakka	635eaf4c29	Refactored high priority service out of client service.	2018-07-02 18:22:24 +03:00
Teemu Ollakka	db18e91c42	Implemented client last_written_gtid, sync_wait	2018-06-30 07:44:09 +03:00

1 2

72 Commits