1
0
mirror of https://github.com/postgres/postgres.git synced 2025-10-25 13:17:41 +03:00

Allow altering of two_phase option of a SUBSCRIPTION.

The two_phase option is controlled by both the publisher (as a slot
option) and the subscriber (as a subscription option), so the slot option
must also be modified.

Changing the 'two_phase' option for a subscription from 'true' to 'false'
is permitted only when there are no pending prepared transactions
corresponding to that subscription. Otherwise, the changes of already
prepared transactions can be replicated again along with their corresponding
commit leading to duplicate data or errors.

To avoid data loss, the 'two_phase' option for a subscription can only be
changed from 'false' to 'true' once the initial data synchronization is
completed. Therefore this is performed later by the logical replication worker.

Author: Hayato Kuroda, Ajin Cherian, Amit Kapila
Reviewed-by: Peter Smith, Hou Zhijie, Amit Kapila, Vitaly Davydov, Vignesh C
Discussion: https://postgr.es/m/8fab8-65d74c80-1-2f28e880@39088166
This commit is contained in:
Amit Kapila
2024-07-24 10:13:36 +05:30
parent 774d47b6c0
commit 1462aad2e4
17 changed files with 458 additions and 120 deletions

View File

@@ -2192,7 +2192,23 @@ psql "dbname=postgres replication=database" -c "IDENTIFY_SYSTEM;"
</varlistentry> </varlistentry>
</variablelist> </variablelist>
<para>The following option is supported:</para> <para>The following options are supported:</para>
<variablelist>
<varlistentry>
<term><literal>TWO_PHASE [ <replaceable class="parameter">boolean</replaceable> ]</literal></term>
<listitem>
<para>
If true, this logical replication slot supports decoding of two-phase
commit. With this option, commands related to two-phase commit such as
<literal>PREPARE TRANSACTION</literal>, <literal>COMMIT PREPARED</literal>
and <literal>ROLLBACK PREPARED</literal> are decoded and transmitted.
The transaction will be decoded and transmitted at
<literal>PREPARE TRANSACTION</literal> time.
</para>
</listitem>
</varlistentry>
</variablelist>
<variablelist> <variablelist>
<varlistentry> <varlistentry>

View File

@@ -68,8 +68,9 @@ ALTER SUBSCRIPTION <replaceable class="parameter">name</replaceable> RENAME TO <
<para> <para>
Commands <command>ALTER SUBSCRIPTION ... REFRESH PUBLICATION</command>, Commands <command>ALTER SUBSCRIPTION ... REFRESH PUBLICATION</command>,
<command>ALTER SUBSCRIPTION ... {SET|ADD|DROP} PUBLICATION ...</command> <command>ALTER SUBSCRIPTION ... {SET|ADD|DROP} PUBLICATION ...</command>
with <literal>refresh</literal> option as <literal>true</literal> and with <literal>refresh</literal> option as <literal>true</literal>,
<command>ALTER SUBSCRIPTION ... SET (failover = true|false)</command> <command>ALTER SUBSCRIPTION ... SET (failover = true|false)</command> and
<command>ALTER SUBSCRIPTION ... SET (two_phase = false)</command>
cannot be executed inside a transaction block. cannot be executed inside a transaction block.
These commands also cannot be executed when the subscription has These commands also cannot be executed when the subscription has
@@ -228,8 +229,9 @@ ALTER SUBSCRIPTION <replaceable class="parameter">name</replaceable> RENAME TO <
<link linkend="sql-createsubscription-params-with-disable-on-error"><literal>disable_on_error</literal></link>, <link linkend="sql-createsubscription-params-with-disable-on-error"><literal>disable_on_error</literal></link>,
<link linkend="sql-createsubscription-params-with-password-required"><literal>password_required</literal></link>, <link linkend="sql-createsubscription-params-with-password-required"><literal>password_required</literal></link>,
<link linkend="sql-createsubscription-params-with-run-as-owner"><literal>run_as_owner</literal></link>, <link linkend="sql-createsubscription-params-with-run-as-owner"><literal>run_as_owner</literal></link>,
<link linkend="sql-createsubscription-params-with-origin"><literal>origin</literal></link>, and <link linkend="sql-createsubscription-params-with-origin"><literal>origin</literal></link>,
<link linkend="sql-createsubscription-params-with-failover"><literal>failover</literal></link>. <link linkend="sql-createsubscription-params-with-failover"><literal>failover</literal></link>, and
<link linkend="sql-createsubscription-params-with-two-phase"><literal>two_phase</literal></link>.
Only a superuser can set <literal>password_required = false</literal>. Only a superuser can set <literal>password_required = false</literal>.
</para> </para>
@@ -252,6 +254,32 @@ ALTER SUBSCRIPTION <replaceable class="parameter">name</replaceable> RENAME TO <
<link linkend="sql-createsubscription-params-with-failover"><literal>failover</literal></link> <link linkend="sql-createsubscription-params-with-failover"><literal>failover</literal></link>
option is enabled. option is enabled.
</para> </para>
<para>
The <link linkend="sql-createsubscription-params-with-failover"><literal>failover</literal></link>
and <link linkend="sql-createsubscription-params-with-two-phase"><literal>two_phase</literal></link>
parameters can only be altered when the subscription is disabled.
</para>
<para>
When altering <link linkend="sql-createsubscription-params-with-two-phase"><literal>two_phase</literal></link>
from <literal>true</literal> to <literal>false</literal>, the backend
process reports an error if any prepared transactions done by the
logical replication worker (from when <literal>two_phase</literal>
parameter was still <literal>true</literal>) are found. You can resolve
prepared transactions on the publisher node, or manually roll back them
on the subscriber, and then try again. The transactions prepared by
logical replication worker corresponding to a particular subscription have
the following pattern: <quote><literal>pg_gid_%u_%u</literal></quote>
(parameters: subscription <parameter>oid</parameter>, remote transaction id <parameter>xid</parameter>).
To resolve such transactions manually, you need to roll back all
the prepared transactions with corresponding subscription IDs in their
names. Applications can check
<link linkend="view-pg-prepared-xacts"><structname>pg_prepared_xacts</structname></link>
to find the required prepared transactions. After the <literal>two_phase</literal>
option is changed from <literal>true</literal> to <literal>false</literal>,
the publisher will replicate the transactions again when they are committed.
</para>
</listitem> </listitem>
</varlistentry> </varlistentry>

View File

@@ -2681,3 +2681,82 @@ LookupGXact(const char *gid, XLogRecPtr prepare_end_lsn,
LWLockRelease(TwoPhaseStateLock); LWLockRelease(TwoPhaseStateLock);
return found; return found;
} }
/*
* TwoPhaseTransactionGid
* Form the prepared transaction GID for two_phase transactions.
*
* Return the GID in the supplied buffer.
*/
void
TwoPhaseTransactionGid(Oid subid, TransactionId xid, char *gid_res, int szgid)
{
Assert(OidIsValid(subid));
if (!TransactionIdIsValid(xid))
ereport(ERROR,
(errcode(ERRCODE_PROTOCOL_VIOLATION),
errmsg_internal("invalid two-phase transaction ID")));
snprintf(gid_res, szgid, "pg_gid_%u_%u", subid, xid);
}
/*
* IsTwoPhaseTransactionGidForSubid
* Check whether the given GID (as formed by TwoPhaseTransactionGid) is
* for the specified 'subid'.
*/
static bool
IsTwoPhaseTransactionGidForSubid(Oid subid, char *gid)
{
int ret;
Oid subid_from_gid;
TransactionId xid_from_gid;
char gid_tmp[GIDSIZE];
/* Extract the subid and xid from the given GID */
ret = sscanf(gid, "pg_gid_%u_%u", &subid_from_gid, &xid_from_gid);
/*
* Check that the given GID has expected format, and at least the subid
* matches.
*/
if (ret != 2 || subid != subid_from_gid)
return false;
/*
* Reconstruct a temporary GID based on the subid and xid extracted from
* the given GID and check whether the temporary GID and the given GID
* match.
*/
TwoPhaseTransactionGid(subid, xid_from_gid, gid_tmp, sizeof(gid_tmp));
return strcmp(gid, gid_tmp) == 0;
}
/*
* LookupGXactBySubid
* Check if the prepared transaction done by apply worker exists.
*/
bool
LookupGXactBySubid(Oid subid)
{
bool found = false;
LWLockAcquire(TwoPhaseStateLock, LW_SHARED);
for (int i = 0; i < TwoPhaseState->numPrepXacts; i++)
{
GlobalTransaction gxact = TwoPhaseState->prepXacts[i];
/* Ignore not-yet-valid GIDs. */
if (gxact->valid &&
IsTwoPhaseTransactionGidForSubid(subid, gxact->gid))
{
found = true;
break;
}
}
LWLockRelease(TwoPhaseStateLock);
return found;
}

View File

@@ -16,6 +16,7 @@
#include "access/htup_details.h" #include "access/htup_details.h"
#include "access/table.h" #include "access/table.h"
#include "access/twophase.h"
#include "access/xact.h" #include "access/xact.h"
#include "catalog/catalog.h" #include "catalog/catalog.h"
#include "catalog/dependency.h" #include "catalog/dependency.h"
@@ -109,6 +110,8 @@ static void check_publications_origin(WalReceiverConn *wrconn,
static void check_duplicates_in_publist(List *publist, Datum *datums); static void check_duplicates_in_publist(List *publist, Datum *datums);
static List *merge_publications(List *oldpublist, List *newpublist, bool addpub, const char *subname); static List *merge_publications(List *oldpublist, List *newpublist, bool addpub, const char *subname);
static void ReportSlotConnectionError(List *rstates, Oid subid, char *slotname, char *err); static void ReportSlotConnectionError(List *rstates, Oid subid, char *slotname, char *err);
static void CheckAlterSubOption(Subscription *sub, const char *option,
bool slot_needs_update, bool isTopLevel);
/* /*
@@ -259,21 +262,9 @@ parse_subscription_options(ParseState *pstate, List *stmt_options,
opts->specified_opts |= SUBOPT_STREAMING; opts->specified_opts |= SUBOPT_STREAMING;
opts->streaming = defGetStreamingMode(defel); opts->streaming = defGetStreamingMode(defel);
} }
else if (strcmp(defel->defname, "two_phase") == 0) else if (IsSet(supported_opts, SUBOPT_TWOPHASE_COMMIT) &&
strcmp(defel->defname, "two_phase") == 0)
{ {
/*
* Do not allow toggling of two_phase option. Doing so could cause
* missing of transactions and lead to an inconsistent replica.
* See comments atop worker.c
*
* Note: Unsupported twophase indicates that this call originated
* from AlterSubscription.
*/
if (!IsSet(supported_opts, SUBOPT_TWOPHASE_COMMIT))
ereport(ERROR,
(errcode(ERRCODE_SYNTAX_ERROR),
errmsg("unrecognized subscription parameter: \"%s\"", defel->defname)));
if (IsSet(opts->specified_opts, SUBOPT_TWOPHASE_COMMIT)) if (IsSet(opts->specified_opts, SUBOPT_TWOPHASE_COMMIT))
errorConflictingDefElem(defel, pstate); errorConflictingDefElem(defel, pstate);
@@ -1079,6 +1070,60 @@ AlterSubscription_refresh(Subscription *sub, bool copy_data,
table_close(rel, NoLock); table_close(rel, NoLock);
} }
/*
* Common checks for altering failover and two_phase options.
*/
static void
CheckAlterSubOption(Subscription *sub, const char *option,
bool slot_needs_update, bool isTopLevel)
{
/*
* The checks in this function are required only for failover and
* two_phase options.
*/
Assert(strcmp(option, "failover") == 0 ||
strcmp(option, "two_phase") == 0);
/*
* Do not allow changing the option if the subscription is enabled. This
* is because both failover and two_phase options of the slot on the
* publisher cannot be modified if the slot is currently acquired by the
* existing walsender.
*
* Note that two_phase is enabled (aka changed from 'false' to 'true') on
* the publisher by the existing walsender, so we could have allowed that
* even when the subscription is enabled. But we kept this restriction for
* the sake of consistency and simplicity.
*/
if (sub->enabled)
ereport(ERROR,
(errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
errmsg("cannot set %s for enabled subscription",
option)));
if (slot_needs_update)
{
StringInfoData cmd;
/*
* A valid slot must be associated with the subscription for us to
* modify any of the slot's properties.
*/
if (!sub->slotname)
ereport(ERROR,
(errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
errmsg("cannot set %s for a subscription that does not have a slot name",
option)));
/* The changed option of the slot can't be rolled back. */
initStringInfo(&cmd);
appendStringInfo(&cmd, "ALTER SUBSCRIPTION ... SET (%s)", option);
PreventInTransactionBlock(isTopLevel, cmd.data);
pfree(cmd.data);
}
}
/* /*
* Alter the existing subscription. * Alter the existing subscription.
*/ */
@@ -1094,6 +1139,8 @@ AlterSubscription(ParseState *pstate, AlterSubscriptionStmt *stmt,
HeapTuple tup; HeapTuple tup;
Oid subid; Oid subid;
bool update_tuple = false; bool update_tuple = false;
bool update_failover = false;
bool update_two_phase = false;
Subscription *sub; Subscription *sub;
Form_pg_subscription form; Form_pg_subscription form;
bits32 supported_opts; bits32 supported_opts;
@@ -1145,7 +1192,8 @@ AlterSubscription(ParseState *pstate, AlterSubscriptionStmt *stmt,
{ {
supported_opts = (SUBOPT_SLOT_NAME | supported_opts = (SUBOPT_SLOT_NAME |
SUBOPT_SYNCHRONOUS_COMMIT | SUBOPT_BINARY | SUBOPT_SYNCHRONOUS_COMMIT | SUBOPT_BINARY |
SUBOPT_STREAMING | SUBOPT_DISABLE_ON_ERR | SUBOPT_STREAMING | SUBOPT_TWOPHASE_COMMIT |
SUBOPT_DISABLE_ON_ERR |
SUBOPT_PASSWORD_REQUIRED | SUBOPT_PASSWORD_REQUIRED |
SUBOPT_RUN_AS_OWNER | SUBOPT_FAILOVER | SUBOPT_RUN_AS_OWNER | SUBOPT_FAILOVER |
SUBOPT_ORIGIN); SUBOPT_ORIGIN);
@@ -1227,31 +1275,81 @@ AlterSubscription(ParseState *pstate, AlterSubscriptionStmt *stmt,
replaces[Anum_pg_subscription_subrunasowner - 1] = true; replaces[Anum_pg_subscription_subrunasowner - 1] = true;
} }
if (IsSet(opts.specified_opts, SUBOPT_TWOPHASE_COMMIT))
{
/*
* We need to update both the slot and the subscription
* for the two_phase option. We can enable the two_phase
* option for a slot only once the initial data
* synchronization is done. This is to avoid missing some
* data as explained in comments atop worker.c.
*/
update_two_phase = !opts.twophase;
CheckAlterSubOption(sub, "two_phase", update_two_phase,
isTopLevel);
/*
* Modifying the two_phase slot option requires a slot
* lookup by slot name, so changing the slot name at the
* same time is not allowed.
*/
if (update_two_phase &&
IsSet(opts.specified_opts, SUBOPT_SLOT_NAME))
ereport(ERROR,
(errcode(ERRCODE_SYNTAX_ERROR),
errmsg("slot_name and two_phase cannot be altered at the same time")));
/*
* Note that workers may still survive even if the
* subscription has been disabled.
*
* Ensure workers have already been exited to avoid
* getting prepared transactions while we are disabling
* the two_phase option. Otherwise, the changes of an
* already prepared transaction can be replicated again
* along with its corresponding commit, leading to
* duplicate data or errors.
*/
if (logicalrep_workers_find(subid, true, true))
ereport(ERROR,
(errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
errmsg("cannot alter two_phase when logical replication worker is still running"),
errhint("Try again after some time.")));
/*
* two_phase cannot be disabled if there are any
* uncommitted prepared transactions present otherwise it
* can lead to duplicate data or errors as explained in
* the comment above.
*/
if (update_two_phase &&
sub->twophasestate == LOGICALREP_TWOPHASE_STATE_ENABLED &&
LookupGXactBySubid(subid))
ereport(ERROR,
(errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
errmsg("cannot disable two_phase when prepared transactions are present"),
errhint("Resolve these transactions and try again.")));
/* Change system catalog accordingly */
values[Anum_pg_subscription_subtwophasestate - 1] =
CharGetDatum(opts.twophase ?
LOGICALREP_TWOPHASE_STATE_PENDING :
LOGICALREP_TWOPHASE_STATE_DISABLED);
replaces[Anum_pg_subscription_subtwophasestate - 1] = true;
}
if (IsSet(opts.specified_opts, SUBOPT_FAILOVER)) if (IsSet(opts.specified_opts, SUBOPT_FAILOVER))
{ {
if (!sub->slotname)
ereport(ERROR,
(errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
errmsg("cannot set %s for a subscription that does not have a slot name",
"failover")));
/* /*
* Do not allow changing the failover state if the * Similar to the two_phase case above, we need to update
* subscription is enabled. This is because the failover * the failover option for both the slot and the
* state of the slot on the publisher cannot be modified * subscription.
* if the slot is currently acquired by the apply worker.
*/ */
if (sub->enabled) update_failover = true;
ereport(ERROR,
(errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
errmsg("cannot set %s for enabled subscription",
"failover")));
/* CheckAlterSubOption(sub, "failover", update_failover,
* The changed failover option of the slot can't be rolled isTopLevel);
* back.
*/
PreventInTransactionBlock(isTopLevel, "ALTER SUBSCRIPTION ... SET (failover)");
values[Anum_pg_subscription_subfailover - 1] = values[Anum_pg_subscription_subfailover - 1] =
BoolGetDatum(opts.failover); BoolGetDatum(opts.failover);
@@ -1501,13 +1599,14 @@ AlterSubscription(ParseState *pstate, AlterSubscriptionStmt *stmt,
} }
/* /*
* Try to acquire the connection necessary for altering slot. * Try to acquire the connection necessary for altering the slot, if
* needed.
* *
* This has to be at the end because otherwise if there is an error while * This has to be at the end because otherwise if there is an error while
* doing the database operations we won't be able to rollback altered * doing the database operations we won't be able to rollback altered
* slot. * slot.
*/ */
if (replaces[Anum_pg_subscription_subfailover - 1]) if (update_failover || update_two_phase)
{ {
bool must_use_password; bool must_use_password;
char *err; char *err;
@@ -1528,7 +1627,9 @@ AlterSubscription(ParseState *pstate, AlterSubscriptionStmt *stmt,
PG_TRY(); PG_TRY();
{ {
walrcv_alter_slot(wrconn, sub->slotname, opts.failover); walrcv_alter_slot(wrconn, sub->slotname,
update_failover ? &opts.failover : NULL,
update_two_phase ? &opts.twophase : NULL);
} }
PG_FINALLY(); PG_FINALLY();
{ {
@@ -1675,9 +1776,7 @@ DropSubscription(DropSubscriptionStmt *stmt, bool isTopLevel)
* New workers won't be started because we hold an exclusive lock on the * New workers won't be started because we hold an exclusive lock on the
* subscription till the end of the transaction. * subscription till the end of the transaction.
*/ */
LWLockAcquire(LogicalRepWorkerLock, LW_SHARED); subworkers = logicalrep_workers_find(subid, false, true);
subworkers = logicalrep_workers_find(subid, false);
LWLockRelease(LogicalRepWorkerLock);
foreach(lc, subworkers) foreach(lc, subworkers)
{ {
LogicalRepWorker *w = (LogicalRepWorker *) lfirst(lc); LogicalRepWorker *w = (LogicalRepWorker *) lfirst(lc);

View File

@@ -80,7 +80,7 @@ static char *libpqrcv_create_slot(WalReceiverConn *conn,
CRSSnapshotAction snapshot_action, CRSSnapshotAction snapshot_action,
XLogRecPtr *lsn); XLogRecPtr *lsn);
static void libpqrcv_alter_slot(WalReceiverConn *conn, const char *slotname, static void libpqrcv_alter_slot(WalReceiverConn *conn, const char *slotname,
bool failover); const bool *failover, const bool *two_phase);
static pid_t libpqrcv_get_backend_pid(WalReceiverConn *conn); static pid_t libpqrcv_get_backend_pid(WalReceiverConn *conn);
static WalRcvExecResult *libpqrcv_exec(WalReceiverConn *conn, static WalRcvExecResult *libpqrcv_exec(WalReceiverConn *conn,
const char *query, const char *query,
@@ -1121,15 +1121,27 @@ libpqrcv_create_slot(WalReceiverConn *conn, const char *slotname,
*/ */
static void static void
libpqrcv_alter_slot(WalReceiverConn *conn, const char *slotname, libpqrcv_alter_slot(WalReceiverConn *conn, const char *slotname,
bool failover) const bool *failover, const bool *two_phase)
{ {
StringInfoData cmd; StringInfoData cmd;
PGresult *res; PGresult *res;
initStringInfo(&cmd); initStringInfo(&cmd);
appendStringInfo(&cmd, "ALTER_REPLICATION_SLOT %s ( FAILOVER %s )", appendStringInfo(&cmd, "ALTER_REPLICATION_SLOT %s ( ",
quote_identifier(slotname), quote_identifier(slotname));
failover ? "true" : "false");
if (failover)
appendStringInfo(&cmd, "FAILOVER %s",
*failover ? "true" : "false");
if (failover && two_phase)
appendStringInfo(&cmd, ", ");
if (two_phase)
appendStringInfo(&cmd, "TWO_PHASE %s",
*two_phase ? "true" : "false");
appendStringInfoString(&cmd, " );");
res = libpqrcv_PQexec(conn->streamConn, cmd.data); res = libpqrcv_PQexec(conn->streamConn, cmd.data);
pfree(cmd.data); pfree(cmd.data);

View File

@@ -272,11 +272,14 @@ logicalrep_worker_find(Oid subid, Oid relid, bool only_running)
* the subscription, instead of just one. * the subscription, instead of just one.
*/ */
List * List *
logicalrep_workers_find(Oid subid, bool only_running) logicalrep_workers_find(Oid subid, bool only_running, bool acquire_lock)
{ {
int i; int i;
List *res = NIL; List *res = NIL;
if (acquire_lock)
LWLockAcquire(LogicalRepWorkerLock, LW_SHARED);
Assert(LWLockHeldByMe(LogicalRepWorkerLock)); Assert(LWLockHeldByMe(LogicalRepWorkerLock));
/* Search for attached worker for a given subscription id. */ /* Search for attached worker for a given subscription id. */
@@ -288,6 +291,9 @@ logicalrep_workers_find(Oid subid, bool only_running)
res = lappend(res, w); res = lappend(res, w);
} }
if (acquire_lock)
LWLockRelease(LogicalRepWorkerLock);
return res; return res;
} }
@@ -759,7 +765,7 @@ logicalrep_worker_detach(void)
LWLockAcquire(LogicalRepWorkerLock, LW_SHARED); LWLockAcquire(LogicalRepWorkerLock, LW_SHARED);
workers = logicalrep_workers_find(MyLogicalRepWorker->subid, true); workers = logicalrep_workers_find(MyLogicalRepWorker->subid, true, false);
foreach(lc, workers) foreach(lc, workers)
{ {
LogicalRepWorker *w = (LogicalRepWorker *) lfirst(lc); LogicalRepWorker *w = (LogicalRepWorker *) lfirst(lc);

View File

@@ -401,9 +401,6 @@ static void apply_handle_tuple_routing(ApplyExecutionData *edata,
LogicalRepTupleData *newtup, LogicalRepTupleData *newtup,
CmdType operation); CmdType operation);
/* Compute GID for two_phase transactions */
static void TwoPhaseTransactionGid(Oid subid, TransactionId xid, char *gid, int szgid);
/* Functions for skipping changes */ /* Functions for skipping changes */
static void maybe_start_skipping_changes(XLogRecPtr finish_lsn); static void maybe_start_skipping_changes(XLogRecPtr finish_lsn);
static void stop_skipping_changes(void); static void stop_skipping_changes(void);
@@ -3911,7 +3908,7 @@ maybe_reread_subscription(void)
/* !slotname should never happen when enabled is true. */ /* !slotname should never happen when enabled is true. */
Assert(newsub->slotname); Assert(newsub->slotname);
/* two-phase should not be altered */ /* two-phase cannot be altered while the worker is running */
Assert(newsub->twophasestate == MySubscription->twophasestate); Assert(newsub->twophasestate == MySubscription->twophasestate);
/* /*
@@ -4396,24 +4393,6 @@ cleanup_subxact_info()
subxact_data.nsubxacts_max = 0; subxact_data.nsubxacts_max = 0;
} }
/*
* Form the prepared transaction GID for two_phase transactions.
*
* Return the GID in the supplied buffer.
*/
static void
TwoPhaseTransactionGid(Oid subid, TransactionId xid, char *gid, int szgid)
{
Assert(subid != InvalidRepOriginId);
if (!TransactionIdIsValid(xid))
ereport(ERROR,
(errcode(ERRCODE_PROTOCOL_VIOLATION),
errmsg_internal("invalid two-phase transaction ID")));
snprintf(gid, szgid, "pg_gid_%u_%u", subid, xid);
}
/* /*
* Common function to run the apply loop with error handling. Disable the * Common function to run the apply loop with error handling. Disable the
* subscription, if necessary. * subscription, if necessary.
@@ -5014,7 +4993,7 @@ AtEOXact_LogicalRepWorkers(bool isCommit)
List *workers; List *workers;
ListCell *lc2; ListCell *lc2;
workers = logicalrep_workers_find(subid, true); workers = logicalrep_workers_find(subid, true, false);
foreach(lc2, workers) foreach(lc2, workers)
{ {
LogicalRepWorker *worker = (LogicalRepWorker *) lfirst(lc2); LogicalRepWorker *worker = (LogicalRepWorker *) lfirst(lc2);

View File

@@ -804,9 +804,13 @@ ReplicationSlotDrop(const char *name, bool nowait)
* Change the definition of the slot identified by the specified name. * Change the definition of the slot identified by the specified name.
*/ */
void void
ReplicationSlotAlter(const char *name, bool failover) ReplicationSlotAlter(const char *name, const bool *failover,
const bool *two_phase)
{ {
bool update_slot = false;
Assert(MyReplicationSlot == NULL); Assert(MyReplicationSlot == NULL);
Assert(failover || two_phase);
ReplicationSlotAcquire(name, false); ReplicationSlotAcquire(name, false);
@@ -832,28 +836,45 @@ ReplicationSlotAlter(const char *name, bool failover)
* Do not allow users to enable failover on the standby as we do not * Do not allow users to enable failover on the standby as we do not
* support sync to the cascading standby. * support sync to the cascading standby.
*/ */
if (failover) if (failover && *failover)
ereport(ERROR, ereport(ERROR,
errcode(ERRCODE_FEATURE_NOT_SUPPORTED), errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
errmsg("cannot enable failover for a replication slot" errmsg("cannot enable failover for a replication slot"
" on the standby")); " on the standby"));
} }
/* if (failover)
* Do not allow users to enable failover for temporary slots as we do not {
* support syncing temporary slots to the standby. /*
*/ * Do not allow users to enable failover for temporary slots as we do
if (failover && MyReplicationSlot->data.persistency == RS_TEMPORARY) * not support syncing temporary slots to the standby.
ereport(ERROR, */
errcode(ERRCODE_FEATURE_NOT_SUPPORTED), if (*failover && MyReplicationSlot->data.persistency == RS_TEMPORARY)
errmsg("cannot enable failover for a temporary replication slot")); ereport(ERROR,
errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
errmsg("cannot enable failover for a temporary replication slot"));
if (MyReplicationSlot->data.failover != failover) if (MyReplicationSlot->data.failover != *failover)
{
SpinLockAcquire(&MyReplicationSlot->mutex);
MyReplicationSlot->data.failover = *failover;
SpinLockRelease(&MyReplicationSlot->mutex);
update_slot = true;
}
}
if (two_phase && MyReplicationSlot->data.two_phase != *two_phase)
{ {
SpinLockAcquire(&MyReplicationSlot->mutex); SpinLockAcquire(&MyReplicationSlot->mutex);
MyReplicationSlot->data.failover = failover; MyReplicationSlot->data.two_phase = *two_phase;
SpinLockRelease(&MyReplicationSlot->mutex); SpinLockRelease(&MyReplicationSlot->mutex);
update_slot = true;
}
if (update_slot)
{
ReplicationSlotMarkDirty(); ReplicationSlotMarkDirty();
ReplicationSlotSave(); ReplicationSlotSave();
} }

View File

@@ -1407,12 +1407,15 @@ DropReplicationSlot(DropReplicationSlotCmd *cmd)
} }
/* /*
* Process extra options given to ALTER_REPLICATION_SLOT. * Change the definition of a replication slot.
*/ */
static void static void
ParseAlterReplSlotOptions(AlterReplicationSlotCmd *cmd, bool *failover) AlterReplicationSlot(AlterReplicationSlotCmd *cmd)
{ {
bool failover_given = false; bool failover_given = false;
bool two_phase_given = false;
bool failover;
bool two_phase;
/* Parse options */ /* Parse options */
foreach_ptr(DefElem, defel, cmd->options) foreach_ptr(DefElem, defel, cmd->options)
@@ -1424,23 +1427,24 @@ ParseAlterReplSlotOptions(AlterReplicationSlotCmd *cmd, bool *failover)
(errcode(ERRCODE_SYNTAX_ERROR), (errcode(ERRCODE_SYNTAX_ERROR),
errmsg("conflicting or redundant options"))); errmsg("conflicting or redundant options")));
failover_given = true; failover_given = true;
*failover = defGetBoolean(defel); failover = defGetBoolean(defel);
}
else if (strcmp(defel->defname, "two_phase") == 0)
{
if (two_phase_given)
ereport(ERROR,
(errcode(ERRCODE_SYNTAX_ERROR),
errmsg("conflicting or redundant options")));
two_phase_given = true;
two_phase = defGetBoolean(defel);
} }
else else
elog(ERROR, "unrecognized option: %s", defel->defname); elog(ERROR, "unrecognized option: %s", defel->defname);
} }
}
/* ReplicationSlotAlter(cmd->slotname,
* Change the definition of a replication slot. failover_given ? &failover : NULL,
*/ two_phase_given ? &two_phase : NULL);
static void
AlterReplicationSlot(AlterReplicationSlotCmd *cmd)
{
bool failover = false;
ParseAlterReplSlotOptions(cmd, &failover);
ReplicationSlotAlter(cmd->slotname, failover);
} }
/* /*

View File

@@ -1948,7 +1948,7 @@ psql_completion(const char *text, int start, int end)
else if (HeadMatches("ALTER", "SUBSCRIPTION", MatchAny) && TailMatches("SET", "(")) else if (HeadMatches("ALTER", "SUBSCRIPTION", MatchAny) && TailMatches("SET", "("))
COMPLETE_WITH("binary", "disable_on_error", "failover", "origin", COMPLETE_WITH("binary", "disable_on_error", "failover", "origin",
"password_required", "run_as_owner", "slot_name", "password_required", "run_as_owner", "slot_name",
"streaming", "synchronous_commit"); "streaming", "synchronous_commit", "two_phase");
/* ALTER SUBSCRIPTION <name> SKIP ( */ /* ALTER SUBSCRIPTION <name> SKIP ( */
else if (HeadMatches("ALTER", "SUBSCRIPTION", MatchAny) && TailMatches("SKIP", "(")) else if (HeadMatches("ALTER", "SUBSCRIPTION", MatchAny) && TailMatches("SKIP", "("))
COMPLETE_WITH("lsn"); COMPLETE_WITH("lsn");

View File

@@ -62,4 +62,9 @@ extern void PrepareRedoRemove(TransactionId xid, bool giveWarning);
extern void restoreTwoPhaseData(void); extern void restoreTwoPhaseData(void);
extern bool LookupGXact(const char *gid, XLogRecPtr prepare_end_lsn, extern bool LookupGXact(const char *gid, XLogRecPtr prepare_end_lsn,
TimestampTz origin_prepare_timestamp); TimestampTz origin_prepare_timestamp);
extern void TwoPhaseTransactionGid(Oid subid, TransactionId xid, char *gid_res,
int szgid);
extern bool LookupGXactBySubid(Oid subid);
#endif /* TWOPHASE_H */ #endif /* TWOPHASE_H */

View File

@@ -243,7 +243,8 @@ extern void ReplicationSlotCreate(const char *name, bool db_specific,
extern void ReplicationSlotPersist(void); extern void ReplicationSlotPersist(void);
extern void ReplicationSlotDrop(const char *name, bool nowait); extern void ReplicationSlotDrop(const char *name, bool nowait);
extern void ReplicationSlotDropAcquired(void); extern void ReplicationSlotDropAcquired(void);
extern void ReplicationSlotAlter(const char *name, bool failover); extern void ReplicationSlotAlter(const char *name, const bool *failover,
const bool *two_phase);
extern void ReplicationSlotAcquire(const char *name, bool nowait); extern void ReplicationSlotAcquire(const char *name, bool nowait);
extern void ReplicationSlotRelease(void); extern void ReplicationSlotRelease(void);

View File

@@ -372,12 +372,14 @@ typedef char *(*walrcv_create_slot_fn) (WalReceiverConn *conn,
/* /*
* walrcv_alter_slot_fn * walrcv_alter_slot_fn
* *
* Change the definition of a replication slot. Currently, it only supports * Change the definition of a replication slot. Currently, it supports
* changing the failover property of the slot. * changing the failover and two_phase properties of the slot.
*/ */
typedef void (*walrcv_alter_slot_fn) (WalReceiverConn *conn, typedef void (*walrcv_alter_slot_fn) (WalReceiverConn *conn,
const char *slotname, const char *slotname,
bool failover); const bool *failover,
const bool *two_phase);
/* /*
* walrcv_get_backend_pid_fn * walrcv_get_backend_pid_fn
@@ -455,8 +457,8 @@ extern PGDLLIMPORT WalReceiverFunctionsType *WalReceiverFunctions;
WalReceiverFunctions->walrcv_send(conn, buffer, nbytes) WalReceiverFunctions->walrcv_send(conn, buffer, nbytes)
#define walrcv_create_slot(conn, slotname, temporary, two_phase, failover, snapshot_action, lsn) \ #define walrcv_create_slot(conn, slotname, temporary, two_phase, failover, snapshot_action, lsn) \
WalReceiverFunctions->walrcv_create_slot(conn, slotname, temporary, two_phase, failover, snapshot_action, lsn) WalReceiverFunctions->walrcv_create_slot(conn, slotname, temporary, two_phase, failover, snapshot_action, lsn)
#define walrcv_alter_slot(conn, slotname, failover) \ #define walrcv_alter_slot(conn, slotname, failover, two_phase) \
WalReceiverFunctions->walrcv_alter_slot(conn, slotname, failover) WalReceiverFunctions->walrcv_alter_slot(conn, slotname, failover, two_phase)
#define walrcv_get_backend_pid(conn) \ #define walrcv_get_backend_pid(conn) \
WalReceiverFunctions->walrcv_get_backend_pid(conn) WalReceiverFunctions->walrcv_get_backend_pid(conn)
#define walrcv_exec(conn, exec, nRetTypes, retTypes) \ #define walrcv_exec(conn, exec, nRetTypes, retTypes) \

View File

@@ -240,7 +240,8 @@ extern PGDLLIMPORT bool InitializingApplyWorker;
extern void logicalrep_worker_attach(int slot); extern void logicalrep_worker_attach(int slot);
extern LogicalRepWorker *logicalrep_worker_find(Oid subid, Oid relid, extern LogicalRepWorker *logicalrep_worker_find(Oid subid, Oid relid,
bool only_running); bool only_running);
extern List *logicalrep_workers_find(Oid subid, bool only_running); extern List *logicalrep_workers_find(Oid subid, bool only_running,
bool acquire_lock);
extern bool logicalrep_worker_launch(LogicalRepWorkerType wtype, extern bool logicalrep_worker_launch(LogicalRepWorkerType wtype,
Oid dbid, Oid subid, const char *subname, Oid dbid, Oid subid, const char *subname,
Oid userid, Oid relid, Oid userid, Oid relid,

View File

@@ -377,10 +377,7 @@ HINT: To initiate replication, you must manually create the replication slot, e
regress_testsub | regress_subscription_user | f | {testpub} | f | off | p | f | any | t | f | f | off | dbname=regress_doesnotexist | 0/0 regress_testsub | regress_subscription_user | f | {testpub} | f | off | p | f | any | t | f | f | off | dbname=regress_doesnotexist | 0/0
(1 row) (1 row)
--fail - alter of two_phase option not supported. -- we can alter streaming when two_phase enabled
ALTER SUBSCRIPTION regress_testsub SET (two_phase = false);
ERROR: unrecognized subscription parameter: "two_phase"
-- but can alter streaming when two_phase enabled
ALTER SUBSCRIPTION regress_testsub SET (streaming = true); ALTER SUBSCRIPTION regress_testsub SET (streaming = true);
\dRs+ \dRs+
List of subscriptions List of subscriptions

View File

@@ -256,10 +256,7 @@ CREATE SUBSCRIPTION regress_testsub CONNECTION 'dbname=regress_doesnotexist' PUB
CREATE SUBSCRIPTION regress_testsub CONNECTION 'dbname=regress_doesnotexist' PUBLICATION testpub WITH (connect = false, two_phase = true); CREATE SUBSCRIPTION regress_testsub CONNECTION 'dbname=regress_doesnotexist' PUBLICATION testpub WITH (connect = false, two_phase = true);
\dRs+ \dRs+
--fail - alter of two_phase option not supported. -- we can alter streaming when two_phase enabled
ALTER SUBSCRIPTION regress_testsub SET (two_phase = false);
-- but can alter streaming when two_phase enabled
ALTER SUBSCRIPTION regress_testsub SET (streaming = true); ALTER SUBSCRIPTION regress_testsub SET (streaming = true);
\dRs+ \dRs+

View File

@@ -367,6 +367,99 @@ $result =
$node_subscriber->safe_psql('postgres', "SELECT count(*) FROM tab_copy;"); $node_subscriber->safe_psql('postgres', "SELECT count(*) FROM tab_copy;");
is($result, qq(2), 'replicated data in subscriber table'); is($result, qq(2), 'replicated data in subscriber table');
# Clean up
$node_subscriber->safe_psql('postgres', "DROP SUBSCRIPTION tap_sub");
###############################
# Alter the subscription to set two_phase to false.
# Verify that the altered subscription reflects the new two_phase option.
###############################
# Confirm that the two-phase slot option is enabled before altering
$result = $node_publisher->safe_psql('postgres',
"SELECT two_phase FROM pg_replication_slots WHERE slot_name = 'tap_sub_copy';"
);
is($result, qq(t), 'two-phase is enabled');
# Alter subscription two_phase to false
$node_subscriber->safe_psql('postgres',
"ALTER SUBSCRIPTION tap_sub_copy DISABLE;");
$node_subscriber->poll_query_until('postgres',
"SELECT count(*) = 0 FROM pg_stat_activity WHERE backend_type = 'logical replication worker'"
);
$node_subscriber->safe_psql(
'postgres', "
ALTER SUBSCRIPTION tap_sub_copy SET (two_phase = false);
ALTER SUBSCRIPTION tap_sub_copy ENABLE;");
# Wait for subscription startup
$node_subscriber->wait_for_subscription_sync($node_publisher, $appname_copy);
# Make sure that the two-phase is disabled on the subscriber
$result = $node_subscriber->safe_psql('postgres',
"SELECT subtwophasestate FROM pg_subscription WHERE subname = 'tap_sub_copy';"
);
is($result, qq(d), 'two-phase subscription option should be disabled');
# Make sure that the two-phase slot option is also disabled
$result = $node_publisher->safe_psql('postgres',
"SELECT two_phase FROM pg_replication_slots WHERE slot_name = 'tap_sub_copy';"
);
is($result, qq(f), 'two-phase slot option should be disabled');
###############################
# Now do a prepare on the publisher and verify that it is not replicated.
###############################
$node_publisher->safe_psql(
'postgres', qq{
BEGIN;
INSERT INTO tab_copy VALUES (100);
PREPARE TRANSACTION 'newgid';
});
# Wait for the subscriber to catchup
$node_publisher->wait_for_catchup($appname_copy);
# Make sure there are no prepared transactions on the subscriber
$result = $node_subscriber->safe_psql('postgres',
"SELECT count(*) FROM pg_prepared_xacts;");
is($result, qq(0), 'should be no prepared transactions on subscriber');
###############################
# Set two_phase to "true" and failover to "true" before the COMMIT PREPARED.
#
# This tests the scenario where both two_phase and failover are altered
# simultaneously.
###############################
$node_subscriber->safe_psql('postgres',
"ALTER SUBSCRIPTION tap_sub_copy DISABLE;");
$node_subscriber->poll_query_until('postgres',
"SELECT count(*) = 0 FROM pg_stat_activity WHERE backend_type = 'logical replication worker'"
);
$node_subscriber->safe_psql(
'postgres', "
ALTER SUBSCRIPTION tap_sub_copy SET (two_phase = true, failover = true);
ALTER SUBSCRIPTION tap_sub_copy ENABLE;");
###############################
# Now commit the insert and verify that it is replicated.
###############################
$node_publisher->safe_psql('postgres', "COMMIT PREPARED 'newgid';");
# Wait for the subscriber to catchup
$node_publisher->wait_for_catchup($appname_copy);
# Make sure that the committed transaction is replicated.
$result =
$node_subscriber->safe_psql('postgres', "SELECT count(*) FROM tab_copy;");
is($result, qq(3), 'replicated data in subscriber table');
# Make sure that the two-phase is enabled on the subscriber
$result = $node_subscriber->safe_psql('postgres',
"SELECT subtwophasestate FROM pg_subscription WHERE subname = 'tap_sub_copy';"
);
is($result, qq(e), 'two-phase should be enabled');
$node_subscriber->safe_psql('postgres', "DROP SUBSCRIPTION tap_sub_copy;"); $node_subscriber->safe_psql('postgres', "DROP SUBSCRIPTION tap_sub_copy;");
$node_publisher->safe_psql('postgres', "DROP PUBLICATION tap_pub_copy;"); $node_publisher->safe_psql('postgres', "DROP PUBLICATION tap_pub_copy;");
@@ -374,8 +467,6 @@ $node_publisher->safe_psql('postgres', "DROP PUBLICATION tap_pub_copy;");
# check all the cleanup # check all the cleanup
############################### ###############################
$node_subscriber->safe_psql('postgres', "DROP SUBSCRIPTION tap_sub");
$result = $node_subscriber->safe_psql('postgres', $result = $node_subscriber->safe_psql('postgres',
"SELECT count(*) FROM pg_subscription"); "SELECT count(*) FROM pg_subscription");
is($result, qq(0), 'check subscription was dropped on subscriber'); is($result, qq(0), 'check subscription was dropped on subscriber');